that's right, carrot2 just used to clustering documents, but not to classify them.
On 2/5/07, Stanislaw Osinski <[EMAIL PROTECTED]> wrote:
Hi, Carrot2 performs document clustering, which, as opposed to document classification, is an unsupervised technique (no predefined categories). Therefore, Carrot2 doesn't seem suitable for this particular problem. Thanks, Stanislaw On 2/4/07, kauu <[EMAIL PROTECTED]> wrote: > > hi chee wu : > the easiest way is to realize ur goal i think. but the carrot2 's > performance is not very good. and the another important thing is that u > should input the data with as little spam as possible , or u will get > useless result. > > On 2/4/07, chee wu <[EMAIL PROTECTED]> wrote: > > > > Hi, > > I am trying to divide all the web pages crawled to predefined > > categories,does anybody have successfully fulfilled classification > based > > on Nutch? I did find some threads talking about this,but none of them > are > > clear enough. Below are some possible solutions mentioned in the past > > threads : > > 1. Using SVM-Light, but it seems a C based program ? > > 2. Can I fulfill this based on Carrot2? > > 3. Other open source software packages like Rainbow or IBM UIMA ? > > I want to do a deeper research on the three options above,which one > should > > I study first? Any other hints or experiences also are welcome! > > > > Thanks > > -Chee > > > > > > -- > www.babatu.com > > -- Stanislaw Osinski, [EMAIL PROTECTED] http://www.carrot-search.com
-- www.babatu.com
