Hi, Lewis. Add the question that appeared today at stackoverflow:
http://stackoverflow.com/questions/28813709/how-to-extract-nutch-2-3-data-from-cassandra-with-gora/28815075 Nutch 2.2.3 + Gora 0.5 We dug until we got here: WARN mapreduce.GoraRecordWriter - Exception at GoraRecordWriter.class while closing datastore.InvalidRequestException(why:supercolumn parameter is not optional for super CF sc) It looks to me as something related with gora-cassandra-mapping.xml OR CassandraStore when flush()/close() is called and it starts persisting fields. Alfonso Nishikawa 2015-02-26 10:31 GMT-01:00 Alfonso Nishikawa <[email protected]>: > Hi, Lewis. > > I checked the links but I can't get into a conclusion. > I think we would need to have the output of readdb after each phase: > > nutch inject > readdb > nuch generate > readdb > nutch fetch > readdb > nutch parse > readdb > nutch updatedb > readdb > > And then much probably we could find something. > > Thanks! > > Alfonso Nishikawa > > 2015-02-26 0:46 GMT+01:00 Lewis John Mcgibbney <[email protected]> > : > >> Hi Folks, >> Several threads have popped up over on the Nutch mailing lists regarding >> use of gora-cassandra 0.5 within Nutch 2.3. >> >> http://www.mail-archive.com/user%40nutch.apache.org/msg13228.html >> http://www.mail-archive.com/user%40nutch.apache.org/msg13235.html >> http://www.mail-archive.com/user%40nutch.apache.org/msg13237.html >> http://www.mail-archive.com/user%40nutch.apache.org/msg13250.html >> >> I think we can expect a 0.6.1 release pretty soon if this is discovered to >> be a major bug. >> I have not been using gora-cassandra for a number of months (2 or so), so >> I >> am not immediately sure right now what is wrong. >> We appear to be loosing data between ParserJob and FetcherJob states with >> 0 >> Map input records being provided to the ParserJob Map Reduce framework. >> Any help from this team on deploying a test configuration and testing >> would >> be highly appreciated. >> Suggested software stack is as follows >> >> Nutch 2.4-SNAPSHOT (HEAD) >> Gora 0.5, Gora Cassandra 0.5 >> Cassandra 2.0.2 >> >> Thanks >> Lewis >> >> >> -- >> *Lewis* >> > >

