Re: [Zope-dev] What catalog/index to use ...
In the original design of ZCTextIndex we (PythonLabs mostly) considered stemming and found that it has been found to have dubious value in many information theorists views (The fact that Google does no stemming was also a factor in the decision). So we decided to leave it out entirely. ZCTextIndex is extensible and third parties can add additional text processing facilites (called pipeline elements) to the system without modifying ZCTextIndex. This could be a way to add stemming and any other conceivable feature involving preprocessing the index source and query text. Granted that feature could use better(!) documentation... (I should just add that to my email sig ;^) -Casey On Friday 08 November 2002 08:19 am, Jens Vagelpohl wrote: > > Depends on your needs. ZCTextIndex is very easy to use and supports > > relevance > > ranking, TextIndexNG is supposed to be some kind of > > eier-legende-wollmilch-sau. > > Compare the features and make your choice. > > > > -aj > > > > isn't TextIndexNG much better with international character encodings > and that stuff? and it has a lot more stemmers for various languages. > > jens ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )
Re: [Zope-dev] What catalog/index to use ...
ZCTextIndex is to become the full replacement for old TextIndex. There are a couple of outstanding patches for making the ZCTextIndex splitter, etc. locale friendly. Whether those solve your problem I don't know. We are happy to improve ZCTextIndex for international use, however we at Zope Corp are not authorities in the matter, so we will require some help from those who are. If you can create a collector issue that illustrates the problems you have experienced (perhaps posting some sample content), that we be a great start. -Casey On Thursday 07 November 2002 07:44 pm, Joachim Werner wrote: > Hi! > > Currently there are at least three options for doing full text indexing with > ZCatalog: > > - good old TextIndex > - ZCTextIndex > - TextIndexNG > > TextIndex basically works fine for me and handles German umlauts well (if > you use the right locale settings in the Zope start skript), but ZCTextIndex > is generally better, except that it does not handle umlauts correctly as far > as I can see. So without a bug fix ZCTextIndex is good for US, but not for > us ;-) > > Then there is Andreas Jung's TextIndexNG, which seems to be really > impressive. > > What are the plans for Zope 2.6.x/2.7? Will ZCTextIndex be replaced by > TextIndexNG? > > Does it make sense to get ZCTextIndex fixed (there seems to be a patch in > the collector already) or should I go with TextIndexNG? If yes, is it ready > for production environments? > > Cheers > > Joachim > > _ > > Joachim Werner > > iuveno AG > Wittelsbacherstraße 23b > 90475 Nürnberg > > Tel. +49 (0) 911 / 988398-4 > Fax +49 (0) 911 / 988398-5 > > Mail: [EMAIL PROTECTED] > WWW: http://www.iuveno.de > > > > > ___ > Zope-Dev maillist - [EMAIL PROTECTED] > http://lists.zope.org/mailman/listinfo/zope-dev > ** No cross posts or HTML encoding! ** > (Related lists - > http://lists.zope.org/mailman/listinfo/zope-announce > http://lists.zope.org/mailman/listinfo/zope ) > ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )
Re: [Zope-dev] What catalog/index to use ...
Depends on your needs. ZCTextIndex is very easy to use and supports relevance ranking, TextIndexNG is supposed to be some kind of eier-legende-wollmilch-sau. Compare the features and make your choice. -aj isn't TextIndexNG much better with international character encodings and that stuff? and it has a lot more stemmers for various languages. jens ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )
Re: [Zope-dev] What catalog/index to use ...
--On Freitag, 8. November 2002 01:44 +0100 Joachim Werner <[EMAIL PROTECTED]> wrote: What are the plans for Zope 2.6.x/2.7? Will ZCTextIndex be replaced by TextIndexNG? No, they will coexist. ZCTextIndex is maintained by Zope Corp, TextIndexNG is maintained by myself. Does it make sense to get ZCTextIndex fixed (there seems to be a patch in the collector already) or should I go with TextIndexNG? If yes, is it ready for production environments? Depends on your needs. ZCTextIndex is very easy to use and supports relevance ranking, TextIndexNG is supposed to be some kind of eier-legende-wollmilch-sau. Compare the features and make your choice. -aj ___ Zope-Dev maillist - [EMAIL PROTECTED] http://lists.zope.org/mailman/listinfo/zope-dev ** No cross posts or HTML encoding! ** (Related lists - http://lists.zope.org/mailman/listinfo/zope-announce http://lists.zope.org/mailman/listinfo/zope )