It's supposed to be pinyin with numbers used to indicate tone. As noted in the current header to the file, there are known problems with this field which will be fixed in the next release.

On Wednesday, June 4, 2003, at 03:19 PM, Frank Tang wrote:

Anyone know what kind of system used to encode the information in kMandarin inside Unihan.txt
I try to convert that info into BoPoMoFo, anyone have open source perl or js code can do it?
--

signature


<image.tiff>

==========
John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://www.tejat.net/

Reply via email to