Hi Uygar. The version of Nutch you're using was compiled with a version of
Carrot2 that did not have Turkish stopwords and language resources, hence the
warning and behavior you described.
There has been an upgrade of Carrot2 libraries with Nutch --
https://issues.apache.org/jira/browse/NUTCH-544
This patch should be applicable to Nutch 0.9 (perhaps with some minor code
changes).
Dawid
Uygar BAYAR wrote:
hi
we use nutch 0.9 with carrot 2.1. When we search turkish words (we also
enable default lang "tr" in nutch.site.xml) in nutch We get below errors..
We get results but stop words don't work ..
OnlineClustererFactory - Using the first clustering extension found:
Carrot2-Lingo
2007-10-30 17:37:12,985 INFO Clusterer - Default language: tr
2007-10-30 17:37:12,985 INFO Clusterer - Enabled languages: [en, nl, da,
fi, fr, de, it, no, pl, pt, ru, es, sv, tr, ro, hu]
2007-10-30 17:37:12,985 WARN Clusterer - Language not supported in Carrot2:
pl
2007-10-30 17:37:12,986 WARN Clusterer - Language not supported in Carrot2:
tr
2007-10-30 17:37:12,986 WARN Clusterer - Language not supported in Carrot2:
ro
2007-10-30 17:37:12,986 WARN Clusterer - Language not supported in Carrot2:
hu