Jeff Lord created FLUME-2128:
--------------------------------
Summary: HDFS Sink rollSize is calculated based off of
uncompressed size of cumulative events.
Key: FLUME-2128
URL: https://issues.apache.org/jira/browse/FLUME-2128
Project: Flume
Issue Type: Bug
Components: Sinks+Sources
Affects Versions: v1.3.1, v1.4.0
Reporter: Jeff Lord
The hdfs sink rollSize parameter is compared against uncompressed event sizes.
The net of this is that if you are using compression and expect the size of
your files on HDFS to be rolled/sized based on the value set for rollSize than
your files will be much smaller due to compression.
We should take into account when compression is set and roll based on the
compressed size on hdfs.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira