newer Cassandra + Hadoop = TimedOutException()

Patrik Modesto Fri, 24 Feb 2012 05:21:37 -0800

Hi,

I can see some strange behaviour on my test cluster and in production.
Both running cassandra 0.8.10. Strange is that when I compile my
mapreduce job against cassandra-all 0.8.7 everything is ok, but if I
use higher version I get quite a lots of TimedOutException.


java.lang.RuntimeException: TimedOutException()
        at 
org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.maybeInit(ColumnFamilyRecordReader.java:319)
        at 
org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.computeNext(ColumnFamilyRecordReader.java:333)
        at 
org.apache.cassandra.hadoop.ColumnFamilyRecordReader$RowIterator.computeNext(ColumnFamilyRecordReader.java:207)
        at 
com.google.common.collect.AbstractIterator.tryToComputeNext(AbstractIterator.java:135)
        at 
com.google.common.collect.AbstractIterator.hasNext(AbstractIterator.java:130)
        at 
org.apache.cassandra.hadoop.ColumnFamilyRecordReader.nextKeyValue(ColumnFamilyRecordReader.java:163)
        at 
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:456)
        at 
org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:32

There is nothing in the cassandra log, the cluster is idle, no one
else is accessing the cluster.

There are just few rows, nothing big:

INFO  mapred.JobClient:     Reduce input groups=14
INFO  mapred.JobClient:     Combine output records=0
INFO  mapred.JobClient:     Map input records=544009
INFO  mapred.JobClient:     Reduce shuffle bytes=33876
INFO  mapred.JobClient:     Reduce output records=0
INFO  mapred.JobClient:     Spilled Records=38
INFO  mapred.JobClient:     Map output bytes=33656
INFO  mapred.JobClient:     Combine input records=0
INFO  mapred.JobClient:     Map output records=19
INFO  mapred.JobClient:     SPLIT_RAW_BYTES=3937
INFO  mapred.JobClient:     Reduce input records=19

What could be the problem?

Regards,
P.

newer Cassandra + Hadoop = TimedOutException()

Reply via email to