[
https://issues.apache.org/jira/browse/HBASE-6649?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13454189#comment-13454189
]
Jean-Daniel Cryans commented on HBASE-6649:
-------------------------------------------
What I meant is that the reader gets this 10 times:
{noformat}
java.io.EOFException:
hdfs://localhost:60044/user/hudson/hbase/.oldlogs/vesta.apache.org%2C57779%2C1345217521341.1345217601487,
entryStart=40929, pos=40960, end=40960, edit=3
{noformat}
So if I'm reading this correctly it's able to read the file and got 3 edits but
gets an EOF. Is something half written? Then it gives up on the file:
{noformat}
2012-08-17 15:33:50,099 INFO
[ReplicationExecutor-0.replicationSource,2-vesta.apache.org,57779,1345217521341]
regionserver.ReplicationSourceManager(352): Done with the recovered queue
2-vesta.apache.org,57779,1345217521341
{noformat}
And there's data loss.
> [0.92 UNIT TESTS] TestReplication.queueFailover occasionally fails [Part-1]
> ---------------------------------------------------------------------------
>
> Key: HBASE-6649
> URL: https://issues.apache.org/jira/browse/HBASE-6649
> Project: HBase
> Issue Type: Bug
> Reporter: Devaraj Das
> Assignee: Devaraj Das
> Fix For: 0.96.0, 0.92.3, 0.94.2
>
> Attachments: 6649-0.92.patch, 6649-1.patch, 6649-2.txt,
> 6649-trunk.patch, 6649-trunk.patch, 6649.txt, HBase-0.92 #495 test -
> queueFailover [Jenkins].html, HBase-0.92 #502 test - queueFailover
> [Jenkins].html
>
>
> Have seen it twice in the recent past: http://bit.ly/MPCykB &
> http://bit.ly/O79Dq7 ..
> Looking briefly at the logs hints at a pattern - in both the failed test
> instances, there was an RS crash while the test was running.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira