try reinstall a new version J2EE? I guess JVM has problem to interface to file system,
Michael, --- Nils Hoeller <[EMAIL PROTECTED]> wrote: > Now what I tried (after what you said): > > 1. I started the command out of the Superuser > Terminal (Suse 9.3) > ยด= same Problem > > 2. I stopped Suse s firewall in Yast2 = same Problem > > 3. the file is "urls" without any extension > > To the misconfiguration of network: > > I m not that pro in linux, so where do I have to > search? > Actually I m going into internet over PPPoE , > tomorrow when my router arrives I go directly over > lan. > As i mentioned: Stoping the firewall (also what I > thought > to be the reason for the exception) doesn t help. > > What else could be configured ? > > The exception is everytime: > > run java in /usr/java/jdk1.5.0_04 > 050729 131449 parsing > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml > 050729 131449 parsing > file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml > 050729 131449 parsing > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml > 050729 131449 No FS indicated, using default:local > 050729 131449 crawl started in: crawl.test > 050729 131449 rootUrlFile = urls > 050729 131449 threads = 10 > 050729 131449 depth = 3 > Exception in thread "main" > java.lang.RuntimeException: > java.net.UnknownHostException: linux: linux > at org.apache.nutch.io.SequenceFile > $Writer.<init>(SequenceFile.java:67) > at > org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94) > at > org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507) > at > org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438) > at > org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172) > at > org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133) > Caused by: java.net.UnknownHostException: linux: > linux > at > java.net.InetAddress.getLocalHost(InetAddress.java:1308) > at org.apache.nutch.io.SequenceFile > $Writer.<init>(SequenceFile.java:64) > "crawl.log" 20L, 1180C > 1,1 > Anfang > > > Thanks for your help > > Nils > > > Am Donnerstag, den 28.07.2005, 18:41 -0700 schrieb > Feng (Michael) Ji: > > try change your user-mode to superuser in linux? > seems > > it is an IO error from JVM, > > > > Michael > > > > --- Nils Hoeller <[EMAIL PROTECTED]> wrote: > > > > > Hi > > > > > > my Problem is: > > > > > > I ve done everything as descriped in the Getting > > > Started Tutorial at > > > nutch.org. > > > > > > When I now run the command: bin/nutch crawl urls > > > -dir crawl.test -depth > > > 3 >& crawl.log > > > > > > I get this Exception in the log file: > > > run java in /usr/java/jdk1.5.0_04 > > > 050828 104004 parsing > > > > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml > > > 050828 104004 parsing > > > > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml > > > 050828 104004 parsing > > > > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml > > > 050828 104004 No FS indicated, using > default:local > > > 050828 104004 crawl started in: crawl.test > > > 050828 104004 rootUrlFile = urls > > > 050828 104004 threads = 10 > > > 050828 104004 depth = 3 > > > Exception in thread "main" > > > java.lang.RuntimeException: > > > java.net.UnknownHostException: linux: linux > > > at org.apache.nutch.io.SequenceFile > > > $Writer.<init>(SequenceFile.java:67) > > > at > > > > > > org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94) > > > at > > > > > > org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507) > > > at > > > > > > org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438) > > > at > > > > > > org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172) > > > at > > > > > > org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133) > > > Caused by: java.net.UnknownHostException: linux: > > > linux > > > at > > > > > > java.net.InetAddress.getLocalHost(InetAddress.java:1308) > > > at org.apache.nutch.io.SequenceFile > > > $Writer.<init>(SequenceFile.java:64) > > > ... 5 more > > > > > > > > > My urls file looks like this: > > > > > > http://www.nutch.org/ > > > > > > I ve also tried: > > > > > > http://www.ifis.uni-luebeck.de/ which I d like > to > > > get nutched > > > > > > Also in the urlfilter conf is written > > > > > > +^http://([a-z0-9]*\.)*ifis.uni-luebeck.de/ > > > +^http://([a-z0-9]*\.)*nutch.org/ > > > > > > > > > Can anyone give me a Hint? > > > Where is the error? > > > > > > Thanks Nils > > > > > > > > > > > > > > > > > ____________________________________________________ > > Start your day with Yahoo! - make it your home > page > > http://www.yahoo.com/r/hs > > > > ____________________________________________________ Start your day with Yahoo! - make it your home page http://www.yahoo.com/r/hs ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
