Andrew Purtell wrote:
Scanners got this:

java.io.IOException: java.io.IOException: java.lang.IndexOutOfBoundsException
        at java.io.DataInputStream.readFully(DataInputStream.java:175)
        at 
org.apache.hadoop.io.DataOutputBuffer$Buffer.write(DataOutputBuffer.java:64)
        at 
org.apache.hadoop.io.DataOutputBuffer.write(DataOutputBuffer.java:102)
        at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1933)
        at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1833)
        at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1879)
        at org.apache.hadoop.io.MapFile$Reader.next(MapFile.java:516)
        at 
org.apache.hadoop.hbase.regionserver.StoreFileScanner.getNext(StoreFileScanner.java:312)
        at 
org.apache.hadoop.hbase.regionserver.StoreFileScanner.next(StoreFileScanner.java:183)
        at 
org.apache.hadoop.hbase.regionserver.HStoreScanner.next(HStoreScanner.java:196)
        at 
org.apache.hadoop.hbase.regionserver.HRegion$HScanner.next(HRegion.java:2027)
        at 
org.apache.hadoop.hbase.regionserver.HRegionServer.next(HRegionServer.java:1087)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.hbase.ipc.HbaseRPC$Server.call(HbaseRPC.java:554)
        at org.apache.hadoop.ipc.Server$Handler.run(Server.java:888)

After a restart all seems well. But that restart was a painful
manual intervention. Have you seen this before? I can file a JIRA
or comment on an existing one.
I've not seen it before. Exception should note the file it was trying to read from I'd say at a minimum. Looks like failure trying to read in MapFile(SequenceFile) content. And you've not seen it since the restart? (Would be odd that a problematic file would heal itself).

What about the files you made when crawler had no upper-bound on sizes pulled down? Are they still in your hbase?

Disabling compression brought on a bunch of splits but otherwise, it seems to be working?

St.Ack

Reply via email to