default walog copy/sort uses replication of 1
---------------------------------------------
Key: ACCUMULO-509
URL: https://issues.apache.org/jira/browse/ACCUMULO-509
Project: Accumulo
Issue Type: Bug
Components: logger
Environment: medium size cluster
Reporter: Eric Newton
Assignee: Eric Newton
Priority: Trivial
Fix For: 1.4.1
During recovery, the logger copied/sorted a recovery walog to hdfs. The copy
was ok, but there was a checksum error when replaying the data. The system did
not recover without manual intervention. The work-around was to find the
datanode serving the back block, and stop it. Then I removed the bad recovery
file and restarted the master. The copy/sort took place again, and used a
different datanode. Recovery proceeded successfully.
We need to use a higher replication and/or a more sophisticated approach to
verifying and restarting recoveries.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira