Hi Ralf,

I got Nutch running with MySQL and run it like this
>

The MySQL backend in GORA is broken and not supported any more in recent
versions. You'd be better off using a different backend like HBase or
Cassandra



> bin/nutch inject urls
>
> bin/nutch generate -topN 200
> bin/nutch fetch -all
> bin/nutch parse -all
> bin/nutch updatedb
>
>
>
> It seems to take a long time to fetch URLS, yet my CPU's are only used
> 1%...
> is this normal? Can this be done faster since it doesn't see a limitation
> of
> system resources?
>

How long is a piece of string? you did not tell us how long it takes, how
many URLs are in your DB etc... as for the fetching, it's all about network
not CPU or memory so that's pretty normal.

Maybe try Nutch 1.x for a comparison or if you want to stick to 2.x move to
HBase

HTH

Julien



>
>
>
>
>
> Thnx
>
>


-- 
*
*Open Source Solutions for Text Engineering

http://digitalpebble.blogspot.com/
http://www.digitalpebble.com
http://twitter.com/digitalpebble

Reply via email to