Print unicode character in java

2019-07-18 12:44发布

Displaying unicode character in java shows "?" sign. For example, i tried to print "अ". Its unicode Number is U+0905 and html representation is "अ". The below codes prints "?" instead of unicode character.

char aa = '\u0905';
String myString = aa + " result" ;
System.out.println(myString); // displays "? result"

Is there a way to display unicode character directly from unicode itself without using unicode numbers? i.e "अ" is saved in file now display the file in jsp.

3条回答
疯言疯语
2楼-- · 2019-07-18 13:05

Java defines two types of streams, byte and character.

The main reason why System.out.println() can't show Unicode characters is that System.out.println() is a byte stream that deal with only the low-order eight bits of character which is 16-bits.

In order to deal with Unicode characters(16-bit Unicode character), you have to use character based stream i.e. PrintWriter.

PrintWriter supports the print( ) and println( ) methods. Thus, you can use these methods in the same way as you used them with System.out.

PrintWriter printWriter = new PrintWriter(System.out,true);
char aa = '\u0905';
printWriter.println("aa = " + aa);
查看更多
祖国的老花朵
3楼-- · 2019-07-18 13:11

try to use utf8 character set -

        Charset utf8 = Charset.forName("UTF-8");
        Charset def = Charset.defaultCharset();

        String charToPrint = "u0905";

        byte[] bytes = charToPrint.getBytes("UTF-8");
        String message = new String(bytes , def.name());

        PrintStream printStream = new PrintStream(System.out, true, utf8.name());
        printStream.println(message); // should print your character
查看更多
Fickle 薄情
4楼-- · 2019-07-18 13:20

Your myString variable contains the perfectly correct value. The problem must be the output from System.out.println(myString) which has to send some bytes to some output to show the glyphs that you want to see.

System.out is a PrintStream using the "platform default encoding" to convert characters to byte sequences - maybe your platform doesn't support that character. E.g. on my Windows 7 computer in Germany, the default encoding is CP1252, and there's no byte sequence in this encoding that corresponds to your character.

Or maybe the encoding is correct, but simply the font that creates graphical glyphs from characters doesn't have that charater.

If you are sending your output to a Windows CMD.EXE window, then maybe both reasons apply.

But be assured, your string is correct, and if you send it to a destination that can handle it (e.g. a Swing JTextField), it'll show up correctly.

查看更多
登录 后发表回答