[
https://issues.apache.org/jira/browse/FLUME-2245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13997290#comment-13997290
]
Juhani Connolly commented on FLUME-2245:
----------------------------------------
I've not been working much with flume recently so please feel free to go ahead
with this. We've been running something like what I submitted earlier but it
would be nice to get rid of that if you can fix it more cleanly.
> HDFS files with errors unable to close
> --------------------------------------
>
> Key: FLUME-2245
> URL: https://issues.apache.org/jira/browse/FLUME-2245
> Project: Flume
> Issue Type: Bug
> Reporter: Juhani Connolly
> Attachments: FLUME-2245.patch, flume.log.1133, flume.log.file
>
>
> This is running on a snapshot of Flume-1.5 with the git hash
> 99db32ccd163daf9d7685f0e8485941701e1133d
> When a datanode goes unresponsive for a significant amount of time(for
> example a big gc) an append failure will occur followed by repeated time outs
> appearing in the log, and failure to close the stream. Relevant section of
> logs attached(where it first starts appearing.
> The same log repeats periodically, consistently running into a
> TimeoutException.
> Restarting flume(or presumably just the HDFSSink) solves the issue.
> Probable cause in comments
--
This message was sent by Atlassian JIRA
(v6.2#6252)