I ve now downloaded the newest J2EE from java.sun.com I ve installed it with by executing the bin file. Should I do anything more?
The Problem is: I ve got still the exception. java -version gives me (if this matters) java version "1.5.0_04" Java(TM) 2 Runtime Environment, Standard Edition (build 1.5.0_04-b05) Java HotSpot(TM) Client VM (build 1.5.0_04-b05, mixed mode, sharing) This are my Env. Var. in my .bashrc export NUTCH_JAVA_HOME=/usr/java/jdk1.5.0_04 export JAVA_HOME=/usr/java/jdk1.5.0_04 export CATALINA_HOME=/home/nils/jakarta-tomcat-4.1.27 For Tomcat they are working, so I guess they ll do also for nutch (the java path) It s getting really frustrating...:-( Thanks anyway Nils Am Freitag, den 29.07.2005, 05:05 -0700 schrieb Feng (Michael) Ji: > try reinstall a new version J2EE? > > I guess JVM has problem to interface to file system, > > Michael, > > --- Nils Hoeller <[EMAIL PROTECTED]> wrote: > > > Now what I tried (after what you said): > > > > 1. I started the command out of the Superuser > > Terminal (Suse 9.3) > > ยด= same Problem > > > > 2. I stopped Suse s firewall in Yast2 = same Problem > > > > 3. the file is "urls" without any extension > > > > To the misconfiguration of network: > > > > I m not that pro in linux, so where do I have to > > search? > > Actually I m going into internet over PPPoE , > > tomorrow when my router arrives I go directly over > > lan. > > As i mentioned: Stoping the firewall (also what I > > thought > > to be the reason for the exception) doesn t help. > > > > What else could be configured ? > > > > The exception is everytime: > > > > run java in /usr/java/jdk1.5.0_04 > > 050729 131449 parsing > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml > > 050729 131449 parsing > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml > > 050729 131449 parsing > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml > > 050729 131449 No FS indicated, using default:local > > 050729 131449 crawl started in: crawl.test > > 050729 131449 rootUrlFile = urls > > 050729 131449 threads = 10 > > 050729 131449 depth = 3 > > Exception in thread "main" > > java.lang.RuntimeException: > > java.net.UnknownHostException: linux: linux > > at org.apache.nutch.io.SequenceFile > > $Writer.<init>(SequenceFile.java:67) > > at > > > org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94) > > at > > > org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507) > > at > > > org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438) > > at > > > org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172) > > at > > > org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133) > > Caused by: java.net.UnknownHostException: linux: > > linux > > at > > > java.net.InetAddress.getLocalHost(InetAddress.java:1308) > > at org.apache.nutch.io.SequenceFile > > $Writer.<init>(SequenceFile.java:64) > > "crawl.log" 20L, 1180C > > 1,1 > > Anfang > > > > > > Thanks for your help > > > > Nils > > > > > > Am Donnerstag, den 28.07.2005, 18:41 -0700 schrieb > > Feng (Michael) Ji: > > > try change your user-mode to superuser in linux? > > seems > > > it is an IO error from JVM, > > > > > > Michael > > > > > > --- Nils Hoeller <[EMAIL PROTECTED]> wrote: > > > > > > > Hi > > > > > > > > my Problem is: > > > > > > > > I ve done everything as descriped in the Getting > > > > Started Tutorial at > > > > nutch.org. > > > > > > > > When I now run the command: bin/nutch crawl urls > > > > -dir crawl.test -depth > > > > 3 >& crawl.log > > > > > > > > I get this Exception in the log file: > > > > run java in /usr/java/jdk1.5.0_04 > > > > 050828 104004 parsing > > > > > > > > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml > > > > 050828 104004 parsing > > > > > > > > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml > > > > 050828 104004 parsing > > > > > > > > > > file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml > > > > 050828 104004 No FS indicated, using > > default:local > > > > 050828 104004 crawl started in: crawl.test > > > > 050828 104004 rootUrlFile = urls > > > > 050828 104004 threads = 10 > > > > 050828 104004 depth = 3 > > > > Exception in thread "main" > > > > java.lang.RuntimeException: > > > > java.net.UnknownHostException: linux: linux > > > > at org.apache.nutch.io.SequenceFile > > > > $Writer.<init>(SequenceFile.java:67) > > > > at > > > > > > > > > > org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94) > > > > at > > > > > > > > > > org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507) > > > > at > > > > > > > > > > org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438) > > > > at > > > > > > > > > > org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172) > > > > at > > > > > > > > > > org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133) > > > > Caused by: java.net.UnknownHostException: linux: > > > > linux > > > > at > > > > > > > > > > java.net.InetAddress.getLocalHost(InetAddress.java:1308) > > > > at org.apache.nutch.io.SequenceFile > > > > $Writer.<init>(SequenceFile.java:64) > > > > ... 5 more > > > > > > > > > > > > My urls file looks like this: > > > > > > > > http://www.nutch.org/ > > > > > > > > I ve also tried: > > > > > > > > http://www.ifis.uni-luebeck.de/ which I d like > > to > > > > get nutched > > > > > > > > Also in the urlfilter conf is written > > > > > > > > +^http://([a-z0-9]*\.)*ifis.uni-luebeck.de/ > > > > +^http://([a-z0-9]*\.)*nutch.org/ > > > > > > > > > > > > Can anyone give me a Hint? > > > > Where is the error? > > > > > > > > Thanks Nils > > > > > > > > > > > > > > > > > > > > > > > > > ____________________________________________________ > > > Start your day with Yahoo! - make it your home > > page > > > http://www.yahoo.com/r/hs > > > > > > > > > > > > ____________________________________________________ > Start your day with Yahoo! - make it your home page > http://www.yahoo.com/r/hs > ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
