Regular Expression Selection

Question

Regular Expression Selection

I have a line like this.

<p class='link'>try</p>bla bla</p>

I want to get only try I tried this. /[^<\/p>]+<\/p>/

But that will not work.

How can i do this? Thank,

+3

ruby regex

Luca romagnoli Jan 31 '11 at 13:04

source share

4 answers

'/<p[^>]+>([^<]+)<\/p>/'

make you try

0

Allen Jan 31 '11 at 13:13

source share

It looks like you used this block: [^<\/p>]+intending to match anything but . Unfortunately, this is not what he does. A block []matches any of the characters inside. In your case, the part /[^<\/p>]+corresponds try</, but the expected ones did not immediately follow , so there was no coincidence.

Alex's decision to use a non-greedy classifier is how I tend to approach this problem.

0

Ray Jan 31 '11 at 13:19

source share

I tried to make it less specific for any particular tag.

(<[^/]+?\s+[^>]*>[^>]*>)

this returns:

try

0

DD-Doug Jan 31 '11 at 14:14

source share

alex · Accepted Answer · 2011-01-31T13:08:22+0000

If this is your line and you want text between tags p, then this should work ...

/<p\sclass='link'>(.*?)<\/p>/

The reason yours doesn't work is because you add <\/p>characters to your range. This does not correspond literally, but not every character checks separately.

Of course, I’m sure to mention that there are more efficient tools for parsing HTML snippets (such as the HTML parser).

Regular Expression Selection

More articles: