Josh Elser created ACCUMULO-3937:
------------------------------------
Summary: Hard-coded HDFS failure tolerance
Key: ACCUMULO-3937
URL: https://issues.apache.org/jira/browse/ACCUMULO-3937
Project: Accumulo
Issue Type: Bug
Components: tserver
Affects Versions: 1.7.0
Reporter: Josh Elser
Assignee: Josh Elser
Priority: Blocker
Fix For: 1.7.1, 1.8.0
ACCUMULO-2480 added an error cache to the TabletServer which makes the tserver
kill itself after 5 errors creating a new WAL file within 10 seconds.
This is painful because it now causes Accumulo to kill itself if HDFS is
restarted beneath Accumulo. Previously, I would have expected Accumulo to just
keep on chugging if HDFS goes away. Now, I'll have to restart it when HDFS
returns.
This should be a configuration property instead of being hard-coded.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)