Re: [pylucene-dev] language dependent analyzer

Andi Vajda Mon, 10 Dec 2007 10:36:36 -0800


On Mon, 10 Dec 2007, Helmut Jarausch wrote:

sorry, but I need some more help.

I'm trying to index our libarary. Each book entry contains the table of
contents (TOC). 'Analyzing' this should be dependent on the language the
book is written in.
So, I need a customized Analyzer (probably using
PerFieldAnalyzerWrapper)
which 'analyzes' the TOC dependent on the (recorded) language of the
book.

Is there an example of an customized analyzer whose action depends
on the data currently being indexed?

For general Lucene usage questions such as this one, you are encouraged tocontact the Lucene user list at [EMAIL PROTECTED] The solutionyou'd find there is directly applicable to PyLucene. This list is aboutPython-specific or PyLucene-specific issues.The Lucene user list gets a lot more traffic and you're more likely to findan answer there.

As for language specific analyzers, PyLucene supports all the Java Luceneanalyzers [1] and the snowball package [2] by including them into the buildby default. It is trivial to add more such packages to jcc-PyLucene sinceall the wrappers are machine-generated. See its Makefile [3] for an exampleof how the current set of Lucene .jar files is wrapped.


Andi..

[1] 
http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/
[2] http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/snowball/
[3] http://svn.osafoundation.org/pylucene/trunk/jcc/Makefile
_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Re: [pylucene-dev] language dependent analyzer

Reply via email to