Andrew Purtell wrote:
Scanners got this:
java.io.IOException: java.io.IOException: java.lang.IndexOutOfBoundsException
at java.io.DataInputStream.readFully(DataInputStream.java:175)
at
org.apache.hadoop.io.DataOutputBuffer$Buffer.write(DataOutputBuffer.java:64)
at
org.apache.hadoop.io.DataOutputBuffer.write(DataOutputBuffer.java:102)
at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1933)
at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1833)
at org.apache.hadoop.io.SequenceFile$Reader.next(SequenceFile.java:1879)
at org.apache.hadoop.io.MapFile$Reader.next(MapFile.java:516)
at
org.apache.hadoop.hbase.regionserver.StoreFileScanner.getNext(StoreFileScanner.java:312)
at
org.apache.hadoop.hbase.regionserver.StoreFileScanner.next(StoreFileScanner.java:183)
at
org.apache.hadoop.hbase.regionserver.HStoreScanner.next(HStoreScanner.java:196)
at
org.apache.hadoop.hbase.regionserver.HRegion$HScanner.next(HRegion.java:2027)
at
org.apache.hadoop.hbase.regionserver.HRegionServer.next(HRegionServer.java:1087)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.hbase.ipc.HbaseRPC$Server.call(HbaseRPC.java:554)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:888)
After a restart all seems well. But that restart was a painful
manual intervention. Have you seen this before? I can file a JIRA
or comment on an existing one.
I've not seen it before. Exception should note the file it was trying
to read from I'd say at a minimum. Looks like failure trying to read in
MapFile(SequenceFile) content. And you've not seen it since the
restart? (Would be odd that a problematic file would heal itself).
What about the files you made when crawler had no upper-bound on sizes
pulled down? Are they still in your hbase?
Disabling compression brought on a bunch of splits but otherwise, it
seems to be working?
St.Ack