In the original design of ZCTextIndex we (PythonLabs mostly) considered
stemming and found that it has been found to have dubious value in many
information theorists views (The fact that Google does no stemming was also a
factor in the decision). So we decided to leave it out entirely.
ZCTextIndex is extensible and third parties can add additional text processing
facilites (called pipeline elements) to the system without modifying
ZCTextIndex. This could be a way to add stemming and any other conceivable
feature involving preprocessing the index source and query text.
Granted that feature could use better(!) documentation... (I should just add
that to my email sig ;^)
On Friday 08 November 2002 08:19 am, Jens Vagelpohl wrote:
> > Depends on your needs. ZCTextIndex is very easy to use and supports
> > relevance
> > ranking, TextIndexNG is supposed to be some kind of
> > eier-legende-wollmilch-sau.
> > Compare the features and make your choice.
> > -aj
> isn't TextIndexNG much better with international character encodings
> and that stuff? and it has a lot more stemmers for various languages.
Zope-Dev maillist - [EMAIL PROTECTED]
** No cross posts or HTML encoding! **
(Related lists -