Hi, Lewis. I checked the links but I can't get into a conclusion. I think we would need to have the output of readdb after each phase:
nutch inject readdb nuch generate readdb nutch fetch readdb nutch parse readdb nutch updatedb readdb And then much probably we could find something. Thanks! Alfonso Nishikawa 2015-02-26 0:46 GMT+01:00 Lewis John Mcgibbney <[email protected]>: > Hi Folks, > Several threads have popped up over on the Nutch mailing lists regarding > use of gora-cassandra 0.5 within Nutch 2.3. > > http://www.mail-archive.com/user%40nutch.apache.org/msg13228.html > http://www.mail-archive.com/user%40nutch.apache.org/msg13235.html > http://www.mail-archive.com/user%40nutch.apache.org/msg13237.html > http://www.mail-archive.com/user%40nutch.apache.org/msg13250.html > > I think we can expect a 0.6.1 release pretty soon if this is discovered to > be a major bug. > I have not been using gora-cassandra for a number of months (2 or so), so I > am not immediately sure right now what is wrong. > We appear to be loosing data between ParserJob and FetcherJob states with 0 > Map input records being provided to the ParserJob Map Reduce framework. > Any help from this team on deploying a test configuration and testing would > be highly appreciated. > Suggested software stack is as follows > > Nutch 2.4-SNAPSHOT (HEAD) > Gora 0.5, Gora Cassandra 0.5 > Cassandra 2.0.2 > > Thanks > Lewis > > > -- > *Lewis* >

