Hi Nils Make sure the Adpater configuration is right in your linux box. And you can search thread "nutch and linux box" in nutch maillist. I think I posted the problem before. Regards /Jack > On 8/28/05, Nils Hoeller <[EMAIL PROTECTED]> wrote: > > Hi > > > > my Problem is: > > > > I ve done everything as descriped in the Getting Started Tutorial at > > nutch.org. > > > > When I now run the command: bin/nutch crawl urls -dir crawl.test -depth > > 3 >& crawl.log > > > > I get this Exception in the log file: > > run java in /usr/java/jdk1.5.0_04 > > 050828 104004 parsing > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml > > 050828 104004 parsing > > file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml > > 050828 104004 parsing > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml > > 050828 104004 No FS indicated, using default:local > > 050828 104004 crawl started in: crawl.test > > 050828 104004 rootUrlFile = urls > > 050828 104004 threads = 10 > > 050828 104004 depth = 3 > > Exception in thread "main" java.lang.RuntimeException: > > java.net.UnknownHostException: linux: linux > > at org.apache.nutch.io.SequenceFile > > $Writer.<init>(SequenceFile.java:67) > > at org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94) > > at org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507) > > at > > org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438) > > at > > org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172) > > at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133) > > Caused by: java.net.UnknownHostException: linux: linux > > at java.net.InetAddress.getLocalHost(InetAddress.java:1308) > > at org.apache.nutch.io.SequenceFile > > $Writer.<init>(SequenceFile.java:64) > > ... 5 more > > > > > > My urls file looks like this: > > > > http://www.nutch.org/ > > > > I ve also tried: > > > > http://www.ifis.uni-luebeck.de/ which I d like to get nutched > > > > Also in the urlfilter conf is written > > > > +^http://([a-z0-9]*\.)*ifis.uni-luebeck.de/ > > +^http://([a-z0-9]*\.)*nutch.org/ > > > > > > Can anyone give me a Hint? > > Where is the error? > > > > Thanks Nils > > > > > > > -- > Keep Discovering ... ... > http://www.jroller.com/page/jmars >
-- Keep Discovering ... ... http://www.jroller.com/page/jmars ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
