I have some set of possible tags for example "<main>", "<text>", "<tag>"
. Rest of characters I would like to treat with htmlentities (htmlspecialchars)
<main>
<text>
<tag> <> X&Y < <falsetag> <tag attr="123" /> </tag>
</text>
</main>
The result should be
<main>
<text>
<tag> <> X&Y < <falsetag> <tag attr="123" /> </tag>
</text>
</main>
What's the best way to do it.
The only solution I see is to load it into an XML parser and then recursively build the output string yourself, but this will take a little work.
Note: A regular expression solution (like what Galen proposed) will not work in all cases because attribute values may contain
>
.You can run htmlentities on the text then use a regular expression to replace the allowed tags
<>
Example...
I have a simple Solution that worked good for me: