Hi,

you both are right. If you edit the script you can see the call to the class
org.apache.nutch.crawl.Crawl, and if you edit this class you can see its
behavior:

...
   for (int i = 0; i < args.length; i++) {
      if ("-dir".equals(args[i])) {
        dir = new Path(args[i+1]);
        i++;
      } else if ("-threads".equals(args[i])) {
        threads = Integer.parseInt(args[i+1]);
        i++;
      }...

but I wonder if there is any documentation about doing the whole process by
your own, like this example:

http://code.google.com/p/daicaheb/source/browse/trunk/nutch-trunk-bb/src/test/org/apache/nutch/fetcher/TestFetcher.java?spec=svn77&r=77
http://code.google.com/p/daicaheb/source/browse/trunk/nutch-trunk-bb/src/test/org/apache/nutch/fetcher/TestFetcher.java?spec=svn77&r=77
 




-- 
View this message in context: 
http://lucene.472066.n3.nabble.com/nutch-crawling-with-java-not-shellscript-tp617212p1962188.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to