Hi,

I have nutch2 configured with a Cassandra backed (as described there
http://sujitpal.blogspot.com/2012/01/exploring-nutch-gora-with-cassandra.html

And it fails to fetch pages after first iteration. That is it sucessfuly
goes throught home pages but then fetcher gets 0 pages on subesequent ones.

Commands I am issuing

bin/nutch inject seed
bin/nutch generate
bin/nutch fetch ID1

There I get log
...
0/0 spinwaiting/active, 4 pages


bin/nutch parse ID1
bin/nutch updatedb
bin/nutch generate
bin/nutch fetch ID2

QueueFeeder finished: total 0 records. Hit by time limit :0
-finishing thread FetcherThread0, activeThreads=0
...
0/0 spinwaiting/active, 0 pages, 0 errors, 0.0 0.0 pages/s,

Exact same config works with v1.5.1

Also I was getting NullPointerException on inject before
changing conf/gora-cassandra-mapping.xml
from:  <class keyClass="java.lang.String"
name="org.apache.nutch.storage.WebPage">
to: <class keyClass="java.lang.String"
name="org.apache.nutch.storage.WebPage" keyspace="webpage">

http.content.limit is set to -1 as it was suggested a while back in a
similar thread, but it does not help

Regards

Reply via email to