Hi,
you both are right. If you edit the script you can see the call to the class
org.apache.nutch.crawl.Crawl, and if you edit this class you can see its
behavior:
...
for (int i = 0; i < args.length; i++) {
if ("-dir".equals(args[i])) {
dir = new Path(args[i+1]);
i++;
} else if ("-threads".equals(args[i])) {
threads = Integer.parseInt(args[i+1]);
i++;
}...
but I wonder if there is any documentation about doing the whole process by
your own, like this example:
http://code.google.com/p/daicaheb/source/browse/trunk/nutch-trunk-bb/src/test/org/apache/nutch/fetcher/TestFetcher.java?spec=svn77&r=77
http://code.google.com/p/daicaheb/source/browse/trunk/nutch-trunk-bb/src/test/org/apache/nutch/fetcher/TestFetcher.java?spec=svn77&r=77
--
View this message in context:
http://lucene.472066.n3.nabble.com/nutch-crawling-with-java-not-shellscript-tp617212p1962188.html
Sent from the Nutch - User mailing list archive at Nabble.com.