Regular expression must match anything inside p tags

Question

Regular expression must match anything inside p tags

I need a regular expression to match all the  tags, for example if I had text:

 <p>Hello world</p>

The regex will match part of the Hello world.

+4

regex

geoffs3310 Feb 03 '11 at 8:44

source share

5 answers

EDIT . Do not do that. Just don't do it.

See this question

If you insist, use (.+?) and the result will be in the first group. This is not ideal, but there will never be any regular expression to solve the HTML parsing problem.

For example (in python)

 >>> import re >>> r = re.compile('<p>(.+?)</p>') >>> r.findall("<p>fo o</p><p>ba adr</p>") ['fo o', 'ba adr']

+5

Kimvais Feb 03 '11 at 8:46

source share

Regex:

 <([az][a-z0-9]*)\b[^>]*>(.*?)</\1>

This will work for any pair of tags.

eg hello 

\ 1 ensures that the open tag matches the closing tag.

Content between tags is written to \ 2.

+1

dogbane Feb 03 '11 at 8:58

source share

It seems that the solutions proposed above will fail either:

return text in ... tags if it contains other tags, such as <a> ,  , etc., or
distinguish between  and <path> or
include tags with attributes such as

Consider using this regex:

<p(|\s+[^>]*)>(.*?)<\/p\s*>

The resulting text will be recorded in group 2.

Obviously, this solution will not work properly when the closing tag  for some reason wrapped in comment tags ... 

0

Alexander Romanov Nov 29 '18 at 20:30

source share

You can use this in Python:

 import re your_variable = 'A html text that has <p> tags' result = your_variable.find_all('p')

0

Ali Jul 15 '19 at 16:50

source share

xzyfer · Accepted Answer · 2011-02-03T08:48:35+0000

in javascript:

 var str = "<p>Hello world</p>"; str.search(/<\s*p[^>]*>([^<]*)<\s*\/\s*p\s*>/)

in php:

 $str = "<p>Hello world</p>"; preg_match_all("/<\s*p[^>]*>([^<]*)<\s*\/\s*p\s*>/", $str);

They will correspond to something more complicated than this.

 < p style= "font-weight: bold;" >Hello world < / p >

Regular expression must match anything inside p tags

More articles: