Use TextIndexNG...it is better suited for such purposes.
-aj
--On 14. Juni 2005 16:54:19 +0200 Yuri [EMAIL PROTECTED] wrote:
How could I can tell the Splitter of ZCText intedex to not split words as
aaabbb in aaa and bbb?
I would like to tell zope that , and so on are alphanumeric
letters... In Splitter.c I have:
class Splitter:
import re
rx = re.compile(r(?L)\w+)
?L match as the locale, but I have multilingual latin-1 contents... \w
would match only [a..z,A..Z]!
TIA
P.S. I've written a small Class for the ZCTextindex pipeline that
convert all the accented characters in non accented ones, so I can index
perch as perche. It would work only if I can solve this splitter
problem...
___
Zope maillist - Zope@zope.org
http://mail.zope.org/mailman/listinfo/zope
** No cross posts or HTML encoding! **
(Related lists - http://mail.zope.org/mailman/listinfo/zope-announce
http://mail.zope.org/mailman/listinfo/zope-dev )
pgpP8uWtBRMgS.pgp
Description: PGP signature
___
Zope maillist - Zope@zope.org
http://mail.zope.org/mailman/listinfo/zope
** No cross posts or HTML encoding! **
(Related lists -
http://mail.zope.org/mailman/listinfo/zope-announce
http://mail.zope.org/mailman/listinfo/zope-dev )