[
https://issues.apache.org/jira/browse/STORM-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15390106#comment-15390106
]
Aaron Dossett commented on STORM-1971:
--------------------------------------
I did not create that approach (although I did carry it over in a re-write) but
one advantage that comes to mind is that data will potentially appear in HDFS
sooner. Also, not all implementations of AbstractHdfsBolt may be able to
support the "write one big message" approach. A text file bolt certainly
could, but what about Sequence File or Avro? I don't know off the top of my
head.
If you pursue that and are able to contribute a patch, I will definitely be
available for review.
> HDFS Timed Synchronous Policy
> -----------------------------
>
> Key: STORM-1971
> URL: https://issues.apache.org/jira/browse/STORM-1971
> Project: Apache Storm
> Issue Type: Bug
> Components: storm-hdfs
> Affects Versions: 0.10.0, 1.0.0
> Reporter: darion yaphet
> Assignee: darion yaphet
>
> When the data need to be wrote to HDFS is not very large in quantity . We
> need a timed synchronous policy to flush cached date into HDFS periodically.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)