Given the text
<b>This is some text</b>
I want to write it to my page so that it shows up like this:
<b>This is some text</b>
and not like this
This is some text
using escape("<b>This is some text</b>")
gives me this lovely gem in firefox
%3Cb%3EThis%20is%20some%20text%3C/b%3E
not exaclty what I'm after. Any ideas?
Traditional Escaping
If you're using XHTML, you'll need to use a
CDATA
section. You can use these in HTML, too, but HTML isn't as strict.I split up the string constants so that this code will work inline on XHTML within CDATA blocks. If you are sourcing your JavaScript as separate files, then you don't need to bother with that. Note that if you are using XHTML with inline JavaScript, then you need to enclose your code in a CDATA block, or some of this will not work. You will run into odd, subtle errors.
DOM Text Node
The "proper" way to escape text is to use the DOM function
document.createTextNode
. This doesn't actually escape the text; it just tells the browser to create a text element, which is inherently unparsed. You have to be willing to use the DOM for this method to work, however: that is, you have use methods such asappendChild
, as opposed to theinnerHTML
property and similar. This would fill an element with IDan-element
with text, which would not be parsed as (X)HTML:jQuery DOM Wrapper
jQuery provides a handy wrapper for
createTextNode
namedtext
. It's quite convenient. Here's the same functionality using jQuery:This should work for you: http://blog.nickburwell.com/2011/02/escape-html-tags-in-javascript.html
Security Warning
The function doesn't escape single and double quotes, which if used in the wrong context, may still lead to XSS. For example:
Thanks to buffer for pointing out this case. Snippet taken out of this blog post.
I use the following function that escapes every character with the &#nnn; notation except a-z A-Z 0-9 and space
Example:
Escape('<b>This is some text</b>')
returns
<b>This is some text</b>
The function is code injection attacks proof, unicode proof, pure JavaScript.
This approach is about 50 times slower than the one that creates the DOM text node but still the funcion escapes a one milion (1,000,000) characters string in 100-150 milliseconds.
(Tested on early 2011 MacBook Pro - Safari 9 - Mavericks)
I ended up doing this:
Try this htmlentities for javascript
I like @limc's answer for situations where the HTML DOM document is available.
I like @Michele Bosi's and @Paolo's answers for non HTML DOM document environment such as Node.js.
@Michael Bosi's answer can be optimized by removing the need to call replace 4 times with a single invocation of replace combined with a clever replacer function:
@Paolo's range test can be optimized with a well chosen regex and the for loop can be eliminated by using a replacer function:
As @Paolo indicated, this strategy will work for more scenarios.