Htmlentities with exceptions

Question

Htmlentities with exceptions

I have several possible tags, for example "<main>", "<text>", "<tag>". The rest of the characters that I would like to handle using htmlentities (htmlspecialchars)

<main>
<text>
<tag> <>  X&Y <  <falsetag> <tag attr="123" /> </tag>
</text>
</main>

The result should be

<main>
<text>
<tag> &lt;&gt;  X&amp;Y &lt;  &lt;falsetag&gt; <tag attr="123" /> </tag>
</text>
</main>

What is the best way to do this.

+3

xml php special-characters

liysd Nov 06 '10 at 17:45

source share

3 answers

The only solution I see is to load it into an XML parser, and then recursively build the output string yourself, but this will take a bit of work.

. (, ) , >.

+1

Alin Purcaru 06 . '10 17:59

I have a simple solution that worked well for me:

$text = htmlentities($text, ENT_QUOTES, "UTF-8");
$text = htmlspecialchars_decode($text);
$text = strip_tags($text, "<p><b><h2>");

+1

user37337 May 6, '13 at 23:41

source share

Galen · Accepted Answer · 2010-11-06T18:11:18+0000

You can run htmlentities in text and then use regex to replace valid tags <>

Example...

$str = '<main>
<text>
<tag> <>  X&Y <  <falsetag> <tag attr="123" /> </tag>
</text>
</main>
';

$allowed_tags = array( 'tag', 'text', 'main' );

$escaped_str = htmlentities( $str );

$replace_what = array_map( function($v){ return "~&lt;(/?)$v(.*?)&gt;~"; }, $allowed_tags );
$replace_with = array_map( function($v){ return "<$1$v$2>"; }, $allowed_tags );

echo preg_replace( $replace_what, $replace_with, $escaped_str );

Htmlentities with exceptions

More articles: