Hi Ralf, I got Nutch running with MySQL and run it like this >
The MySQL backend in GORA is broken and not supported any more in recent versions. You'd be better off using a different backend like HBase or Cassandra > bin/nutch inject urls > > bin/nutch generate -topN 200 > bin/nutch fetch -all > bin/nutch parse -all > bin/nutch updatedb > > > > It seems to take a long time to fetch URLS, yet my CPU's are only used > 1%... > is this normal? Can this be done faster since it doesn't see a limitation > of > system resources? > How long is a piece of string? you did not tell us how long it takes, how many URLs are in your DB etc... as for the fetching, it's all about network not CPU or memory so that's pretty normal. Maybe try Nutch 1.x for a comparison or if you want to stick to 2.x move to HBase HTH Julien > > > > > > Thnx > > -- * *Open Source Solutions for Text Engineering http://digitalpebble.blogspot.com/ http://www.digitalpebble.com http://twitter.com/digitalpebble

