Hi All,
I am trying to setup Nutch 1.10 in Eclipse and got partially succeeded in the
same. I am able to do operations: Injector, Generator etc. individually using
the following code snippet:
String crawlArg = "demo/ urls";
// Run Crawl tool
try {
ToolRunner.run(NutchConfiguration.create(), new
Injector(), tokenize(crawlArg));
} catch (Exception e) {
e.printStackTrace();
}
Can someone help me in merging the individual steps to a crawl script as showin
below. I am unable to find Crawl class inside nutch.crawl package in Nutch
1.10. This class is not available from Nutch 1.8.
String<http://www.google.com/search?hl=en&q=allinurl%3Astring+java.sun.com&btnI=I%27m%20Feeling%20Lucky>
crawlArg = "urls -dir crawl -threads 5 -depth 3 -topN 20";
// Run Crawl tool
try {
ToolRunner.run(NutchConfiguration.create(), new Crawl(),
tokenize(crawlArg));
} catch
(Exception<http://www.google.com/search?hl=en&q=allinurl%3Aexception+java.sun.com&btnI=I%27m%20Feeling%20Lucky>
e) {
e.printStackTrace();
return;
}
Thanks,
Murali