> rootUrlDir = -topN50 Wrong arguments were provided. 1. you forgot to tell crawler the directory where URL list is kept 2. Use "-topN 50" not "-topN50" a comand sampel can be as below: mnt/div/nutch/startsiden <URLSList > -depth 3 -topN 50
----- Original Message ----- From: "Tor Harald Thorland" <[EMAIL PROTECTED]> To: <[email protected]> Sent: Wednesday, January 10, 2007 9:22 PM Subject: Starting nutch fails > > I'm totally new to this, and is "stuck" > > Can someone explain this some further.. > > [EMAIL PROTECTED]:/mnt/div/nutch/urls$ > /home/tortho/Desktop/nutch-0.8.1/bin/nutch crawl startsiden.no -dir > /mnt/div/nutch/startsiden -depth 3 -topN50 > crawl started in: /mnt/div/nutch/startsiden > rootUrlDir = -topN50 > threads = 10 > depth = 3 > Injector: starting > Injector: crawlDb: /mnt/div/nutch/startsiden/crawldb > Injector: urlDir: -topN50 > Injector: Converting injected urls to crawl db entries. > Exception in thread "main" java.io.IOException: Input directory > /mnt/div/nutch/urls/-topN50 in local is invalid. > at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:274) > at org.apache.hadoop.mapred.JobClient.runJob(JobClient.java:327) > at org.apache.nutch.crawl.Injector.inject(Injector.java:138) > at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) > [EMAIL PROTECTED]:/mnt/div/nutch/urls$ > > Thanks, > ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
