[CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Bess Sadler
By sending this here I hope I'm going to hit everyone on the blacklight, vufind, and solrmarc mailing lists, and maybe some other interested parties. Our East Asian Languages Librarian has approached me with a problem he wants to see solved. According to him, the typical North American

Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Demian Katz
What's the right place for such a piece of code? Solrmarc seems the obvious place to me. As it has been described to me so far, this doesn't seem like an issue affecting people outside the library realm, which makes it seem too niche and community-specific to get it built into the lucene

Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Till Kinstler
Bess Sadler schrieb: So, thoughts? Anyone know more about this than I do and want to speak up? I'd second Demian's and Jonathan's statements: Do that in Solr by using a Filter (either at indexing or search time). You want to solve that using an algorithm that translates american

Re: [CODE4LIB] Pinyin (Romanized Chinese) searching

2009-10-29 Thread Ya'aqov Ziso
Till, Bess, Ralph, assuming the algorithm for enriching the spellings of a word (from PinYin to Chinese) exists, will the result include both forms, PinYin AND Non-PinYin Chinese transliteration and BOTH forms will be indexed? the principle of indexing different forms for spelling a vertain