[ 
https://issues.apache.org/jira/browse/HBASE-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Devaraj Das updated HBASE-6649:
-------------------------------

    Attachment: 6649-fix-io-exception-handling.patch

Attaching a more complete fix (for 0.94)

[~jdcryans], could you please try this patch out in your cluster.

The more I think about it, the more I am beginning to believe that setting the 
position so that it always points to a valid location is the fix here...

[~lhofhansl] I have seen dataloss issues (via the unit test) without this 
patch..
                
> [0.92 UNIT TESTS] TestReplication.queueFailover occasionally fails [Part-1]
> ---------------------------------------------------------------------------
>
>                 Key: HBASE-6649
>                 URL: https://issues.apache.org/jira/browse/HBASE-6649
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Devaraj Das
>            Assignee: Devaraj Das
>             Fix For: 0.96.0, 0.92.3, 0.94.2
>
>         Attachments: 6649-0.92.patch, 6649-1.patch, 6649-2.txt, 
> 6649-fix-io-exception-handling.patch, 6649-fix-io-exception-handling.patch, 
> 6649-trunk.patch, 6649-trunk.patch, 6649.txt, HBase-0.92 #495 test - 
> queueFailover [Jenkins].html, HBase-0.92 #502 test - queueFailover 
> [Jenkins].html
>
>
> Have seen it twice in the recent past: http://bit.ly/MPCykB & 
> http://bit.ly/O79Dq7 .. 
> Looking briefly at the logs hints at a pattern - in both the failed test 
> instances, there was an RS crash while the test was running.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to