try reinstall a new version J2EE?

I guess JVM has problem to interface to file system,

Michael,

--- Nils Hoeller <[EMAIL PROTECTED]> wrote:

> Now what I tried (after what you said):
> 
> 1. I started the command out of the Superuser
> Terminal (Suse 9.3)
>    ยด= same Problem 
> 
> 2. I stopped Suse s firewall in Yast2 = same Problem
> 
> 3. the file is "urls" without any extension
> 
> To the misconfiguration of network:
> 
> I m not that pro in linux, so where do I have to
> search? 
> Actually I m going into internet over PPPoE ,
> tomorrow when my router arrives I go directly over
> lan.
> As i mentioned: Stoping the firewall (also what I
> thought
> to be the reason for the exception) doesn t help.
> 
> What else could be configured ? 
> 
> The exception is everytime:
> 
> run java in /usr/java/jdk1.5.0_04
> 050729 131449 parsing
>
file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml
> 050729 131449 parsing
>
file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml
> 050729 131449 parsing
>
file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml
> 050729 131449 No FS indicated, using default:local
> 050729 131449 crawl started in: crawl.test
> 050729 131449 rootUrlFile = urls
> 050729 131449 threads = 10
> 050729 131449 depth = 3
> Exception in thread "main"
> java.lang.RuntimeException:
> java.net.UnknownHostException: linux: linux
>         at org.apache.nutch.io.SequenceFile
> $Writer.<init>(SequenceFile.java:67)
>         at
>
org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94)
>         at
>
org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507)
>         at
>
org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438)
>         at
>
org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172)
>         at
>
org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133)
> Caused by: java.net.UnknownHostException: linux:
> linux
>         at
>
java.net.InetAddress.getLocalHost(InetAddress.java:1308)
>         at org.apache.nutch.io.SequenceFile
> $Writer.<init>(SequenceFile.java:64)
> "crawl.log" 20L, 1180C                              
>          1,1
> Anfang
> 
> 
> Thanks for your help
> 
> Nils
> 
> 
> Am Donnerstag, den 28.07.2005, 18:41 -0700 schrieb
> Feng (Michael) Ji:
> > try change your user-mode to superuser in linux?
> seems
> > it is an IO error from JVM,
> > 
> > Michael
> > 
> > --- Nils Hoeller <[EMAIL PROTECTED]> wrote:
> > 
> > > Hi 
> > > 
> > > my Problem is:
> > > 
> > > I ve done everything as descriped in the Getting
> > > Started Tutorial at
> > > nutch.org. 
> > > 
> > > When I now run the command: bin/nutch crawl urls
> > > -dir crawl.test -depth
> > > 3 >& crawl.log
> > > 
> > > I get this Exception in the log file:
> > > run java in /usr/java/jdk1.5.0_04
> > > 050828 104004 parsing
> > >
> >
>
file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-default.xml
> > > 050828 104004 parsing
> > >
> >
>
file:/home/nils/Studienarbeit/nutch-nightly/conf/crawl-tool.xml
> > > 050828 104004 parsing
> > >
> >
>
file:/home/nils/Studienarbeit/nutch-nightly/conf/nutch-site.xml
> > > 050828 104004 No FS indicated, using
> default:local
> > > 050828 104004 crawl started in: crawl.test
> > > 050828 104004 rootUrlFile = urls
> > > 050828 104004 threads = 10
> > > 050828 104004 depth = 3
> > > Exception in thread "main"
> > > java.lang.RuntimeException:
> > > java.net.UnknownHostException: linux: linux
> > >         at org.apache.nutch.io.SequenceFile
> > > $Writer.<init>(SequenceFile.java:67)
> > >         at
> > >
> >
>
org.apache.nutch.io.MapFile$Writer.<init>(MapFile.java:94)
> > >         at
> > >
> >
>
org.apache.nutch.db.WebDBWriter.<init>(WebDBWriter.java:1507)
> > >         at
> > >
> >
>
org.apache.nutch.db.WebDBWriter.createWebDB(WebDBWriter.java:1438)
> > >         at
> > >
> >
>
org.apache.nutch.tools.WebDBAdminTool.main(WebDBAdminTool.java:172)
> > >         at
> > >
> >
>
org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:133)
> > > Caused by: java.net.UnknownHostException: linux:
> > > linux
> > >         at
> > >
> >
>
java.net.InetAddress.getLocalHost(InetAddress.java:1308)
> > >         at org.apache.nutch.io.SequenceFile
> > > $Writer.<init>(SequenceFile.java:64)
> > >         ... 5 more
> > > 
> > > 
> > > My urls file looks like this:
> > > 
> > > http://www.nutch.org/
> > > 
> > > I ve also tried:
> > > 
> > > http://www.ifis.uni-luebeck.de/ which I d like
> to
> > > get nutched
> > > 
> > > Also in the urlfilter conf is written
> > > 
> > > +^http://([a-z0-9]*\.)*ifis.uni-luebeck.de/
> > > +^http://([a-z0-9]*\.)*nutch.org/
> > > 
> > > 
> > > Can anyone give me a Hint?
> > > Where is the error?
> > > 
> > > Thanks Nils
> > > 
> > > 
> > 
> > 
> > 
> >             
> >
> ____________________________________________________
> > Start your day with Yahoo! - make it your home
> page 
> > http://www.yahoo.com/r/hs 
> > 
> 
> 



                
____________________________________________________
Start your day with Yahoo! - make it your home page 
http://www.yahoo.com/r/hs 
 


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO September
19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to