On Mon, 10 Dec 2007, Helmut Jarausch wrote:

sorry, but I need some more help.

I'm trying to index our libarary. Each book entry contains the table of
contents (TOC). 'Analyzing' this should be dependent on the language the
book is written in.
So, I need a customized Analyzer (probably using
PerFieldAnalyzerWrapper)
which 'analyzes' the TOC dependent on the (recorded) language of the
book.

Is there an example of an customized analyzer whose action depends
on the data currently being indexed?

For general Lucene usage questions such as this one, you are encouraged to contact the Lucene user list at [EMAIL PROTECTED] The solution you'd find there is directly applicable to PyLucene. This list is about Python-specific or PyLucene-specific issues. The Lucene user list gets a lot more traffic and you're more likely to find an answer there.

As for language specific analyzers, PyLucene supports all the Java Lucene analyzers [1] and the snowball package [2] by including them into the build by default. It is trivial to add more such packages to jcc-PyLucene since all the wrappers are machine-generated. See its Makefile [3] for an example of how the current set of Lucene .jar files is wrapped.

Andi..

[1] 
http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/analyzers/src/java/org/apache/lucene/analysis/
[2] http://svn.apache.org/repos/asf/lucene/java/trunk/contrib/snowball/
[3] http://svn.osafoundation.org/pylucene/trunk/jcc/Makefile
_______________________________________________
pylucene-dev mailing list
[email protected]
http://lists.osafoundation.org/mailman/listinfo/pylucene-dev

Reply via email to