How to change text encoded in ANSEL to UTF-8 in C#?
相关问题
- Sorting 3 numbers without branching [closed]
- Graphics.DrawImage() - Throws out of memory except
- Why am I getting UnauthorizedAccessException on th
- 求获取指定qq 资料的方法
- How to know full paths to DLL's from .csproj f
There is an AnselEncoding class (deriving from the standard Encoding class) available as part of Gedcom.NET at http://sourceforge.net/projects/gedcom-net/ that also includes support for some of the LDS extensions.
This is a non-trivial conversion as Windows/.NET Framework does not have an ANSEL codepage. See here for the travails of another person attempting this conversion.
Joshperry is correct. Eichmann's site has basically the ONLY documentation around that attempts to explain ANSEL encoding. Unfortunately there is no program code there, so you'll have to code it yourself.
There is another code table (dated Dec 2007 - I didn't know anyone was still interested) for ANSEL at: http://lcweb2.loc.gov/diglib/codetables/45.html which is worthwhile comparing Eichmann's work to, since he did admit there were a few codes he's unsure of.
If you are interested from a genealogy point of view, the GEDCOM standard release 5.5 has an abbreviated ANSEL character set and some encodings in its Appendix D.
Update Sept 2017: Long after my original answer of this question, Tamura Jones wrote an excellent series of articles about ANSEL that are worth checking out: