http://bugs.koha-community.org/bugzilla3/show_bug.cgi?id=10729
--- Comment #29 from Zeno Tajoli <[email protected]> --- (In reply to Pongtawat from comment #26) > I did some test by starting with phrases-icu.xml and change it towards > words-icu.xml. Here is what I got: > > 1. Adding the first two transliterate rules in words-icu.xml to > phrases-icu.xml. > --> Stack Smashed. > > 2. Changing this line: > > <transform rule="[:Punctuation:] Remove"/> > > to > > <transform rule="[[:WhiteSpace:][:Punctuation:]] Remove"/> > > Now it index fine. > > So, if I just add the WhiteSpace remove rule to phrases-icu.xml, what effect > it will cause? Hi, in fact the conf of phrases-icu.xml is still very similar of words-icu.xml. The conf is tested well on a Latin enviroment, with latin special char (è, à, Ł, Ń, etc.). So in a Thai enviroment, like you, probably is better to use a different configuration. An help on setup is here: http://www.indexdata.com/yaz/doc/yaz-icu.html In my opinion is good to have two files, one for words and one for phrase, but many (particulary outsite Europe/America) need to personalize them. -- You are receiving this mail because: You are watching all bug changes. _______________________________________________ Koha-bugs mailing list [email protected] http://lists.koha-community.org/cgi-bin/mailman/listinfo/koha-bugs website : http://www.koha-community.org/ git : http://git.koha-community.org/ bugs : http://bugs.koha-community.org/
