I configured Eclipse following RunNutchInEclipse0.9 guide and got following:

 When I run Nutch 0.9 using bin\nutch crawl  command inside cygwin -
everything is OK, apache.org site (I just use what is written in the
Tutorial) is processed as it should be. When  I do the same  from Eclipse -
it runs, but doesn't process any URLs - one of printouts on the console says
"No URLs to fetch - check your seed list and URL filters." 

I have checked bin/nutch script and added following as VM arguments:

 

-Xmx1000m

-Dhadoop.log.dir=c:\nutch-0.9\logs

-Dhadoop.log.file=hadoop.log

-Djava.library.path=c:\nutch-0.9\lib\native\Windows_2003-x86-32  

 

It didn't help.

 

Looking into the list history, I found that other people also had this
problem with nutch 0.9.

What configuration/anything else is missing? Can someone confirm that  nutch
0.9  runs successfully  in eclipse, and advise about settings?

 

I would very appreciate any help.

 

Thank you,

 

Lev Kantorovich

Reply via email to