Re: Spellchecking in the Chinese Lanugage

2011-04-12 Thread Luke Lu
It doesn't make sense to spell check individual character sized words, but makes a lot of sense for phrases. Due to pervasive use of pinyin IM, it's very easy to write phrases that are totally wrong in semantics and but "sounds" correct. n-gram should work if it doesn't mangle the characters. On T

Re: CJK Analyzers for Solr

2007-11-27 Thread Luke Lu
Not sure how up to date this is: http://www.basistech.com/customers/ I've only used their C++ products, which generally worked well for web search with a few exceptions. According to http:// www.basistech.com/knowledge-center/chinese/chinese-language- analysis.pdf , they provide Java APIs as