Convert unicode representations on incoming string

2019-07-19 01:16发布

问题:

I'm reading some data that has already been converted to html style υ code.

I now need to convert this back to UTF-8 characters for viewing. Unfortunately I can't use a browser to view the string.

I've read around about conversion in java and it seems if you have a string of \uxxxx then the compiler will convert for you; However that wont work of course because I want to read in dynamic strings.

So can this be done?

Many thanks! Dan

回答1:

You need to use:

String StringEscapeUtils.unescapeJava(String str)

from the Apache Commons Library.

It will find \uxxxx sequences in the input string and convert them to a normal Java String.



回答2:

native2ascii

Use the "-reverse" option.



回答3:

converted to html style υ code.

Which of those? \uXXXX is a Java convention and has nothing to do with HTML. HTML entities look like ¯



标签: java utf-8