[
https://issues.apache.org/jira/browse/FLUME-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13925505#comment-13925505
]
Gopinathan A commented on FLUME-2321:
-------------------------------------
[~guillermo.of] please reopen this issue, this jira should be in PATCH
AVAILABLE state.
Also create review task in https://reviews.apache.org.
> New HDFSTextWritableSerializer for HDFS Sink
> --------------------------------------------
>
> Key: FLUME-2321
> URL: https://issues.apache.org/jira/browse/FLUME-2321
> Project: Flume
> Issue Type: New Feature
> Components: Configuration, Sinks+Sources
> Affects Versions: v1.4.0
> Environment: Centos 6.4, Java6_u34
> Reporter: Guillermo Ortiz Fernández, Pragsis.
> Priority: Minor
> Attachments: FLUME-2321-0.patch
>
>
> Generated a new way to save the SequenceFiles, So far, Flume has two
> different ways to write (Text and Writable) in HDFS, we must tell it in the
> configuration file like this:
> colector.sinks.SinkWritable.hdfs.writeFormat = Writable.
> I have created a new type TextWritable, where you can generated SequenceFile
> with Text as key and BytesWritable as value.
> The key must be configured with in the flume.conf, for example:
> colector.sinks.SinkWritable.hdfs.writeFormat = TextWritable.
> colector.sinks.SinkWritable.hdfs.textWritable.keyFormat =
> %{file}_%{host}_hello.
> When Flume generates the SequenceFile, the key will be the name of the file
> and the name of the host, these variables are totally configurable. If you
> don't define the property keyFormat the key will be the timestamp.
> I have the code done. I hope to improve a little the JavaDoc, modify the
> documentation today and upload the patch.
--
This message was sent by Atlassian JIRA
(v6.2#6252)