In the original design of ZCTextIndex we (PythonLabs mostly) considered 
stemming and found that it has been found to have dubious value in many 
information theorists views (The fact that Google does no stemming was also a 
factor in the decision). So we decided to leave it out entirely.

ZCTextIndex is extensible and third parties can add additional text processing 
facilites (called pipeline elements) to the system without modifying 
ZCTextIndex. This could be a way to add stemming and any other conceivable 
feature involving preprocessing the index source and query text.

Granted that feature could use better(!) documentation... (I should just add 
that to my email sig ;^)


On Friday 08 November 2002 08:19 am, Jens Vagelpohl wrote:
> > Depends on your needs. ZCTextIndex is very easy to use and supports 
> > relevance
> > ranking, TextIndexNG is supposed to be some kind of 
> > eier-legende-wollmilch-sau.
> > Compare the features and make your choice.
> >
> > -aj
> >
> isn't TextIndexNG much better with international character encodings 
> and that stuff? and it has a lot more stemmers for various languages.
> jens

Zope-Dev maillist  -  [EMAIL PROTECTED]
**  No cross posts or HTML encoding!  **
(Related lists - )

Reply via email to