I am working on decoding text. I am trying to find the character code for the —
character, not to be mistaken for -
, in ASCII. I have tried unsuccessfully. Does anybody know how to convert it?
相关问题
- UrlEncodeUnicode and browser navigation errors
- How to convert a string to a byte array which is c
- Character Encoding in iframes
- Unicode issue with makemessages --all Django 1.6.2
- Including decimal equivalent of a char in a charac
相关文章
- iconv() Vs. utf8_encode()
- Why is `'↊'.isnumeric()` false?
- How to display unicode in SVG?
- When sending XML to JMS should I use TextMessage o
- Google app engine datastore string encoding proble
- UnicodeEncodeError when saving ImageField containi
- How can i get know that my String contains diacrit
- Why is TextView showing the unicode right arrow (\
This character does not exist in ASCII, but only in Unicode, usually encoded by UTF-8.
In UTF-8, characters are encoded by 2- or 3-byte sequences (or occasionally longer), where none of the two or three bytes is a valid ASCII code, where all of them are outside the ASCII range of 0 through 127.
One suspects that the foregoing only partly answers your question, but if so then this is probably because your question is, inadvertently, only partly asked. For further details, you can extend your question with more specifics.
I use this online tool to do so. Put your character into the first green box and click "Convert" (above the box)
further below you find a box titled U+hex containing your answer:
Feel free to edit this answer if the link breaks or leave a comment so I can find a replacement.
—
is known as an Em Dash. It's character code is\u2014
. It is not an ASCII character, so you cannot decode it with the ASCII character set because it is not in the ASCII character table. You would probably want to use UTF8 instead.Quotation from wiki (Em dash)
Em dash character is not a part of ASCII character set.