Joey Echeverria created FLUME-1239:
--------------------------------------
Summary: Flume uses codec file extensions for compressed sequence
files
Key: FLUME-1239
URL: https://issues.apache.org/jira/browse/FLUME-1239
Project: Flume
Issue Type: Bug
Affects Versions: v1.1.0
Reporter: Joey Echeverria
The convention in hadoop is for a file to end in a codec's default extension if
and only if it's a plain file that's fully compressed. When you configure the
HDFSSink to output compressed sequence files, you end up with files that have
that codec's default extension. This causes problems when using some Hadoop
utilities (e.g. hadoop fs -text) as they assume that the file is not in a
container format. It would be nice if all sequence files ended in a common
extension (e.g. .seq).
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira