> I am wondering Analyzer of nutch in svn trunk is chosen by
> languageidentifer plugin or not? (I knew in nutch 0.7.1-dev it did).
It's not really choosen by the languageidentifier, but coosen regarding the
value of the lang attribute (for now, that's right, only the
languageidentifier add this attribute).
> In org.apache.nutch.indexer.Indexer.class line 104
> writer.addDocument((Document)((ObjectWritable)value).get());
> It should be
> NutchAnalyzer analyzer = AnalyzerFactory.get(doc.get("lang"));
> writer.addDocument((Document)((ObjectWritable)value).get(), analyzer );
> right?
Yes, it should.
Thanks for noticing this.
Merge problem?
(I don't remember to add this in nutch-0.7 ...)
> Once more,query parsing should call AnalyzerFactory?? The query input
> is multi-lingual also.
The query part is not yet implemented.
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/