[ 
https://issues.apache.org/jira/browse/HBASE-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13458404#comment-13458404
 ] 

Lars Hofhansl commented on HBASE-6649:
--------------------------------------

I say we revert from 0.94.2 and retry in 0.94.3.

Although from DD's comment:
bq. If the second call (within the while loop) throws an exception (like 
EOFException), it basically destroys the work done up until then. Therefore, 
some rows would never be replicated.

This would be a dataloss issue without the fix.

I find that a bit confusion. Since J-D saw the ignored exception in the test 
cluster eventually on all machines, it seems there was data lost in all 
versions before 0.94.2? That seems very unlikely.

                
> [0.92 UNIT TESTS] TestReplication.queueFailover occasionally fails [Part-1]
> ---------------------------------------------------------------------------
>
>                 Key: HBASE-6649
>                 URL: https://issues.apache.org/jira/browse/HBASE-6649
>             Project: HBase
>          Issue Type: Bug
>            Reporter: Devaraj Das
>            Assignee: Devaraj Das
>             Fix For: 0.96.0, 0.92.3, 0.94.2
>
>         Attachments: 6649-0.92.patch, 6649-1.patch, 6649-2.txt, 
> 6649-fix-io-exception-handling.patch, 6649-trunk.patch, 6649-trunk.patch, 
> 6649.txt, HBase-0.92 #495 test - queueFailover [Jenkins].html, HBase-0.92 
> #502 test - queueFailover [Jenkins].html
>
>
> Have seen it twice in the recent past: http://bit.ly/MPCykB & 
> http://bit.ly/O79Dq7 .. 
> Looking briefly at the logs hints at a pattern - in both the failed test 
> instances, there was an RS crash while the test was running.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to