Jérôme Charron wrote:
Hi,

I would like to add some language specific analysis plugins. In this first approach, each plugin would be simply a wrapper of the lucene's analyzers. So each analysis-<lang> plugin need to import lucene-analyzers-1.9-rc1-dev.jar in its lib directory. In order to avoid adding this jar in many plugins, I would like to add the lucene-analyzers-1.9-rc1-dev.jar in the nutch core lib.
Any comments? Any objection?

I'm wondering if you could implement this plugin as a more or less automatic wrapper around any Lucene classes that implement Analyzer, i.e. so that it doesn't require recompiling to change/select the language, or add a non-standard analyzer from the classpath. I think it's possible to do this, but you would have to code a special-case for Snowball analyzers, where the default constructor requires an argument. All of this could be read from the plugin.xml or n utch-default.xml files.


--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com

Reply via email to