[
https://issues.apache.org/jira/browse/FLUME-2321?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13903020#comment-13903020
]
Pragsis, Guillermo Ortiz commented on FLUME-2321:
-------------------------------------------------
I had never colaborated in JIRA, I hope I had done the steps correctly.
> New HDFSTextWritableSerializer for HDFS Sink
> --------------------------------------------
>
> Key: FLUME-2321
> URL: https://issues.apache.org/jira/browse/FLUME-2321
> Project: Flume
> Issue Type: New Feature
> Components: Configuration, Sinks+Sources
> Affects Versions: v1.4.0
> Environment: Centos 6.4, Java6_u34
> Reporter: Pragsis, Guillermo Ortiz
> Priority: Minor
> Attachments: FLUME-2321-0.patch
>
>
> Generated a new way to save the SequenceFiles, So far, Flume has two
> different ways to write (Text and Writable) in HDFS, we must tell it in the
> configuration file like this:
> colector.sinks.SinkWritable.hdfs.writeFormat = Writable.
> I have created a new type TextWritable, where you can generated SequenceFile
> with Text as key and BytesWritable as value.
> The key must be configured with in the flume.conf, for example:
> colector.sinks.SinkWritable.hdfs.writeFormat = TextWritable.
> colector.sinks.SinkWritable.hdfs.textWritable.keyFormat =
> %{file}_%{host}_hello.
> When Flume generates the SequenceFile, the key will be the name of the file
> and the name of the host, these variables are totally configurable. If you
> don't define the property keyFormat the key will be the timestamp.
> I have the code done. I hope to improve a little the JavaDoc, modify the
> documentation today and upload the patch.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)