How to convert ANSEL text to UTF-8

2020-07-22 18:45发布

How to change text encoded in ANSEL to UTF-8 in C#?

标签: c# encoding
3条回答
爷、活的狠高调
2楼-- · 2020-07-22 18:58

There is an AnselEncoding class (deriving from the standard Encoding class) available as part of Gedcom.NET at http://sourceforge.net/projects/gedcom-net/ that also includes support for some of the LDS extensions.

查看更多
Luminary・发光体
3楼-- · 2020-07-22 19:10

This is a non-trivial conversion as Windows/.NET Framework does not have an ANSEL codepage. See here for the travails of another person attempting this conversion.

查看更多
虎瘦雄心在
4楼-- · 2020-07-22 19:13

Joshperry is correct. Eichmann's site has basically the ONLY documentation around that attempts to explain ANSEL encoding. Unfortunately there is no program code there, so you'll have to code it yourself.

There is another code table (dated Dec 2007 - I didn't know anyone was still interested) for ANSEL at: http://lcweb2.loc.gov/diglib/codetables/45.html which is worthwhile comparing Eichmann's work to, since he did admit there were a few codes he's unsure of.

If you are interested from a genealogy point of view, the GEDCOM standard release 5.5 has an abbreviated ANSEL character set and some encodings in its Appendix D.


Update Sept 2017: Long after my original answer of this question, Tamura Jones wrote an excellent series of articles about ANSEL that are worth checking out:

查看更多
登录 后发表回答