Christopher Tubbs created ACCUMULO-4536:
-------------------------------------------

             Summary: Infinite loop creating empty WAL files when disk space is 
low
                 Key: ACCUMULO-4536
                 URL: https://issues.apache.org/jira/browse/ACCUMULO-4536
             Project: Accumulo
          Issue Type: Bug
          Components: tserver
    Affects Versions: 1.6.6
            Reporter: Christopher Tubbs
            Priority: Minor


Saw this on 1.6.6 with a small disk for testing (32GB disk). The default walog 
size is around 1GB, and only 3.4GB were left available on each data node.

The namenode reported that no data nodes had space available when trying to 
write the first block, so the tserver failed to write the file. It kept 
retrying, resulting in the namenode filling up with thousands of zero-length 
WAL files.

The fix was to lower the {{tserver.walog.max.size}} to {{100M}}. Another 
solution would be to use a larger disk.

The infinite loop problem, constantly creating new empty WAL files is still a 
problem, but it should only happen when low on disk space, which is likely 
going to cause other, more serious problems... and could be avoided with good 
system monitoring.

I have not tested on versions newer than 1.6.6, but I imagine it's still a 
problem.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to