Sorry for belated reply, Doğacan. We have Turkish support in Carrot2 -- this includes the stemmer available in Snowball (don't know what it is exactly) and a set of stopwords I compiled from various Web resources.

Dawid

Doğacan Güney wrote:
Hi Dawid,

On 10/31/07, Dawid Weiss <[EMAIL PROTECTED]> wrote:
Hi Uygar. The version of Nutch you're using was compiled with a version of
Carrot2 that did not have Turkish stopwords and language resources, hence the
warning and behavior you described.

There has been an upgrade of Carrot2 libraries with Nutch --

https://issues.apache.org/jira/browse/NUTCH-544

This patch should be applicable to Nutch 0.9 (perhaps with some minor code
changes).

So current Carrot2 supports Turkish? That's good to hear. Does it just
include a stop word list or does it include a stemmer too (zemberek,
perhaps?)?

Dawid


Uygar BAYAR wrote:
hi
we use nutch 0.9 with carrot 2.1. When we search turkish words (we also
enable default lang "tr" in nutch.site.xml) in nutch We get below errors..
We get results but stop words don't work ..

 OnlineClustererFactory - Using the first clustering extension found:
Carrot2-Lingo
2007-10-30 17:37:12,985 INFO  Clusterer - Default language: tr
2007-10-30 17:37:12,985 INFO  Clusterer - Enabled languages: [en, nl, da,
fi, fr, de, it, no, pl, pt, ru, es, sv, tr, ro, hu]
2007-10-30 17:37:12,985 WARN  Clusterer - Language not supported in Carrot2:
pl
2007-10-30 17:37:12,986 WARN  Clusterer - Language not supported in Carrot2:
tr
2007-10-30 17:37:12,986 WARN  Clusterer - Language not supported in Carrot2:
ro
2007-10-30 17:37:12,986 WARN  Clusterer - Language not supported in Carrot2:
hu



Reply via email to