On Wed, Feb 3, 2010 at 11:15 PM, Daniel Cheng <j16sdiz+freenet at gmail.com> wrote: > On Tue, Feb 2, 2010 at 10:41 PM, Matthew Toseland > <toad at amphibian.dyndns.org> wrote: >> Is there any chance you could implement or at least test Chinese searching? >> As far as I can see all we need to do is: >> - Decode surrogates >> - Check whether the character or surrogate is in the relevant range >> - Treat a character (which might be two UTF16 char's) as a word > > I have some (out of date) Chinese-enabled xmlspider code around, > which I would try to bring up-to-date and commit this week or next week.
I have committed some code to xmlspider, which should make it support creating chinese index. Please test if anything breaks. Thanks. > > i will try to port that code to library as well, > but no promise -- library have phase search/boolean operator that may > need a new tokenizer. > >> >> Thanks! >> >> I do have a possible tester if you don't have time, he put me onto the >> urgency of this for widespread use in China... >> >> https://bugs.freenetproject.org/view.php?id=3837 >> >
