Hi You get that error while running earlier 0.7 nutch tutorial running on 0.8dev nutch.
Use the tutorial for 0.8 dev http://wiki.media-style.com/display/nutchDocu/quick+tutorial+for+nutch+0.8+and+later. Or add following property to nutch-site.xml. <property> <name>mapred.input.dir</name> <value>C:/cygwin/usr/local/src/nutch-nightly/conf</value> <description>The proxy port.</description> </property> P >Hi all, >Having some problems getting nutch to run on XP/Cygwin. >This is re nutch-2006-01-17 >Intranet crawl........ >When I do this (after making urls file, etc.): > bin/nutch crawl urls -dir cdir -depth 2 >&log >I get this in the log: >060117 114833 parsing >file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch->default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/crawl-tool.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/mapred-default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-site.xml 060117 114834 crawl started in: cdir 060117 114834 rootUrlDir = urls 060117 114834 threads = 10 060117 114834 depth = 2 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/crawl-tool.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-site.xml 060117 114834 Injector: starting 060117 114834 Injector: crawlDb: cdir\crawldb 060117 114834 Injector: urlDir: urls 060117 114834 Injector: Converting injected urls to crawl db entries. 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/crawl-tool.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/mapred-default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/mapred-default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-site.xml 060117 114834 Running job: job_krj0e1 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-default.xml 060117 114834 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/mapred-default.xml 060117 114835 parsing \tmp\nutch\mapred\local\localRunner\job_krj0e1.xml 060117 114835 parsing file:/C:/cygwin/usr/local/src/nutch-nightly/conf/nutch-site.xml java.io.IOException: No input directories specified in: NutchConf: nutch-default.xml , mapred-default.xml , \tmp\nutch\mapred\local\localRunner\job_krj0e1.xml , nutch-site.xml at org.apache.nutch.mapred.InputFormatBase.listFiles(InputFormatBase.java:85) at org.apache.nutch.mapred.InputFormatBase.getSplits(InputFormatBase.java:95) at org.apache.nutch.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:63) 060117 114835 map 0% java.io.IOException: Job failed! at org.apache.nutch.mapred.JobClient.runJob(JobClient.java:308) at org.apache.nutch.crawl.Injector.inject(Injector.java:102) at org.apache.nutch.crawl.Crawl.main(Crawl.java:105) Exception in thread "main" I see that: nutch-site.xml is empty mapred-default is empty Whole Web setup............................ When I do this: (after mkdirs) bin/nutch admin db -create I get this at the prompt: Exception in thread "main" java.lang.NoClassDefFoundError: admin I don't speak Java, so I'm not sure what it's saying. Please help. TIA. __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com __________________________________________________ Do You Yahoo!? Tired of spam? Yahoo! Mail has the best spam protection around http://mail.yahoo.com
