We wrote a custom Nutch parse plugin that uses a Mahout classifier to classify docs.
Mathijs Homminga On Jul 1, 2012, at 21:02, Alexander Aristov <[email protected]> wrote: > People > > can you give me some advises? > > I want to integrate nutch and mahout to classify crawled pages. > > 1st question: Has someone tried this and are there any libraries available? > > next: What is better/easier? Improve nutch and inject mahout classifier into > the project OR improve mahout to add an ability to read and write nutch files? > > Best Regards > Alexander Aristov

