Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "NutchTutorial" page has been changed by SebastianNagel:
https://wiki.apache.org/nutch/NutchTutorial?action=diff&rev1=88&rev2=89

Comment:
Add core name (default "nutch") to solr server URL: 
http://localhost:8983/solr/nutch

        Seed Dir        Directory in which to look for a seeds file
        Crawl Dir       Directory where the crawl/link/segments dirs are saved
        Num Rounds      The number of rounds to run this crawl for
-      Example: bin/crawl -i -D solr.server.url=http://localhost:8983/solr/ 
urls/ TestCrawl/  2
+      Example: bin/crawl -i -D 
solr.server.url=http://localhost:8983/solr/nutch urls/ TestCrawl/  2
  }}}
  The crawl script has lot of parameters set, and you can modify the parameters 
to your needs. It would be ideal to understand the parameters before setting up 
big crawls.
  

Reply via email to