[
https://issues.apache.org/jira/browse/HBASE-4058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13062197#comment-13062197
]
stack commented on HBASE-4058:
------------------------------
I took a look at the logs Wayne posted. The master shows a few regionservers
losing their leases and its having trouble connecting to a particular server.
The regionserver snippet posted shows a regionserver aborting because it can't
roll its wal log. It gets an EOFE. The datanode snippet shows connection
refused trying to connect to the same server (130) that the master is trying to
contact (NN?).
Its hard to tell much from snippets posted.
> Extend TestHBaseFsck with a complete .META. recovery scenario
> -------------------------------------------------------------
>
> Key: HBASE-4058
> URL: https://issues.apache.org/jira/browse/HBASE-4058
> Project: HBase
> Issue Type: Improvement
> Reporter: Andrew Purtell
> Assignee: Andrew Purtell
> Fix For: 0.92.0
>
>
> We should have a unit test that launches a minicluster and constructs a few
> tables, then deletes META files on disk, then bounces the master, then
> recovers the result with HBCK. Perhaps it is possible to extend TestHBaseFsck
> to do this.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira