Re: Errors in Unihan data : simplified/traditional variants

2010-11-01 Thread John H. Jenkins
On 2010/10/30, at 下午8:42, Koxinga wrote: My quickly done parsing program counted 1154 such pairs, where the head character was the same as the character above. It seems to be always in the order kTraditionalVariant then kSimplifiedVariant, so can maybe be automatically corrected. It seems

Errors in Unihan data : simplified/traditional variants

2010-10-31 Thread Koxinga
Hello, I recently looked up the relationships traditional-simplified in the Unihan database (Unihan_Variants.txt). I knew it had mistakes and I wanted to help correct some of them, but the first thing that stand out and surprised me was the large number of lines like : U+346F