[ 
https://issues.apache.org/jira/browse/HBASE-2461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12858171#action_12858171
 ] 

Andrew Purtell commented on HBASE-2461:
---------------------------------------

I think this, and all related tightening up what we do when IOE from FS, should 
be a subtask of HBASE-1964. At the Hackathon we'd like to start chasing these 
down with a modified DFSClient that we can inject faults, either random with 
adjustable probabilities or always on particular code paths. 

> Split doesn't handle IOExceptions when creating new region reference files
> --------------------------------------------------------------------------
>
>                 Key: HBASE-2461
>                 URL: https://issues.apache.org/jira/browse/HBASE-2461
>             Project: Hadoop HBase
>          Issue Type: Bug
>          Components: regionserver
>            Reporter: Todd Lipcon
>            Priority: Blocker
>             Fix For: 0.20.5, 0.21.0
>
>
> I was testing an HDFS patch which had a bug in it, so it happened to throw an 
> NPE during a split with the following trace:
> 2010-04-16 19:18:20,727 ERROR 
> org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction failed 
> for region TestTable,-1945465867<1271449232310>,1271453785648
> java.lang.NullPointerException
>         at 
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.enqueueCurrentPacket(DFSClient.java:3124)
>         at 
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.flushInternal(DFSClient.java:3220)
>         at 
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.closeInternal(DFSClient.java:3306)
>         at 
> org.apache.hadoop.hdfs.DFSClient$DFSOutputStream.close(DFSClient.java:3255)
>         at 
> org.apache.hadoop.fs.FSDataOutputStream$PositionCache.close(FSDataOutputStream.java:61)
>         at 
> org.apache.hadoop.fs.FSDataOutputStream.close(FSDataOutputStream.java:86)
>         at org.apache.hadoop.fs.FileSystem.createNewFile(FileSystem.java:560)
>         at org.apache.hadoop.hbase.util.FSUtils.create(FSUtils.java:95)
>         at org.apache.hadoop.hbase.io.Reference.write(Reference.java:129)
>         at 
> org.apache.hadoop.hbase.regionserver.StoreFile.split(StoreFile.java:498)
>         at 
> org.apache.hadoop.hbase.regionserver.HRegion.splitRegion(HRegion.java:682)
>         at 
> org.apache.hadoop.hbase.regionserver.CompactSplitThread.split(CompactSplitThread.java:162)
>         at 
> org.apache.hadoop.hbase.regionserver.CompactSplitThread.run(CompactSplitThread.java:95)
> After that, my region was gone, any further writes to it would fail.

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators: 
https://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to