[
https://issues.apache.org/jira/browse/HBASE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12858171#action_12858171
]
Andrew Purtell commented on HBASE-2461:
---------------------------------------
I think this, and all related tightening up what we do when IOE from FS, should
be a subtask of HBASE-1964. At the Hackathon we'd like to start chasing these
down with a modified DFSClient that we can inject faults, either random with
adjustable probabilities or always on particular code paths.
> Split doesn't handle IOExceptions when creating new region reference files
> --------------------------------------------------------------------------
>
> Key: HBASE-2461
> URL: https://issues.apache.org/jira/browse/HBASE-2461
> Project: Hadoop HBase
> Issue Type: Bug
> Components: regionserver
> Reporter: Todd Lipcon
> Priority: Blocker
> Fix For: 0.20.5, 0.21.0
>
>
> I was testing an HDFS patch which had a bug in it, so it happened to throw an
> NPE during a split with the following trace:
> 2010-04-16 19:18:20,727 ERROR
> org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction failed
> for region TestTable,-1945465867<1271449232310>,1271453785648
> java.lang.NullPointerException
> at
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.enqueueCurrentPacket(DFSClient.java:3124)
> at
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.flushInternal(DFSClient.java:3220)
> at
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.closeInternal(DFSClient.java:3306)
> at
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.close(DFSClient.java:3255)
> at
> org.apache.hadoop.fs.FSDataOutputStream$PositionCache.close(FSDataOutputStream.java:61)
> at
> org.apache.hadoop.fs.FSDataOutputStream.close(FSDataOutputStream.java:86)
> at org.apache.hadoop.fs.FileSystem.createNewFile(FileSystem.java:560)
> at org.apache.hadoop.hbase.util.FSUtils.create(FSUtils.java:95)
> at org.apache.hadoop.hbase.io.Reference.write(Reference.java:129)
> at
> org.apache.hadoop.hbase.regionserver.StoreFile.split(StoreFile.java:498)
> at
> org.apache.hadoop.hbase.regionserver.HRegion.splitRegion(HRegion.java:682)
> at
> org.apache.hadoop.hbase.regionserver.CompactSplitThread.split(CompactSplitThread.java:162)
> at
> org.apache.hadoop.hbase.regionserver.CompactSplitThread.run(CompactSplitThread.java:95)
> After that, my region was gone, any further writes to it would fail.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira