Jérôme Charron wrote:
Hi,
I would like to add some language specific analysis plugins. In this first
approach, each plugin would be simply a wrapper of the lucene's analyzers.
So each analysis-<lang> plugin need to import
lucene-analyzers-1.9-rc1-dev.jar in its lib directory. In order to avoid
adding this jar in many plugins,
I would like to add the lucene-analyzers-1.9-rc1-dev.jar in the nutch core
lib.
Any comments? Any objection?
I'm wondering if you could implement this plugin as a more or less
automatic wrapper around any Lucene classes that implement Analyzer,
i.e. so that it doesn't require recompiling to change/select the
language, or add a non-standard analyzer from the classpath. I think
it's possible to do this, but you would have to code a special-case for
Snowball analyzers, where the default constructor requires an argument.
All of this could be read from the plugin.xml or n utch-default.xml files.
--
Best regards,
Andrzej Bialecki <><
___. ___ ___ ___ _ _ __________________________________
[__ || __|__/|__||\/| Information Retrieval, Semantic Web
___|||__|| \| || | Embedded Unix, System Integration
http://www.sigram.com Contact: info at sigram dot com