escapedFormatDfs goes into a file creation frenzy
-------------------------------------------------

                 Key: FLUME-734
                 URL: https://issues.apache.org/jira/browse/FLUME-734
             Project: Flume
          Issue Type: Bug
          Components: Sinks+Sources
    Affects Versions: v0.9.4
         Environment: CentOS 5.6
            Reporter: Eran Kutner
            Priority: Critical
         Attachments: flume.log

Using this configuration:
collectorSource(54001) | collector(600000) { 
escapedFormatDfs("hdfs://hadoop1-m1:8020/raw-events/%Y-%m-%d/", 
"events-%{rolltag}-col1.snappy", seqfile("SnappyCodec")) }

The expected behavior is to see a new file created every 10 minutes. However, 
once in a while the collector would go into a file creation frenzy, creating 
new files every second.
The log indicates that writing has failed with error: "OutputFormat instance 
can only write to the same OutputStream" causing the file to be closed a new 
one to be opened just to be closed again.

Looking at the code I'm not even sure how the output stream could change but 
the behavior I'm seeing feels like some sort of a race condition. It is 
happening much more under heavy load than under low load.

See attached log excerpt.


--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to