[ 
https://issues.apache.org/jira/browse/STORM-1971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15390106#comment-15390106
 ] 

Aaron Dossett commented on STORM-1971:
--------------------------------------

I did not create that approach (although I did carry it over in a re-write) but 
one advantage that comes to mind is that data will potentially appear in HDFS 
sooner.  Also, not all implementations of AbstractHdfsBolt may be able to 
support the "write one big message" approach.  A text file bolt certainly 
could, but what about Sequence File or Avro?  I don't know off the top of my 
head.

If you pursue that and are able to contribute a patch, I will definitely be 
available for review.

> HDFS Timed Synchronous Policy
> -----------------------------
>
>                 Key: STORM-1971
>                 URL: https://issues.apache.org/jira/browse/STORM-1971
>             Project: Apache Storm
>          Issue Type: Bug
>          Components: storm-hdfs
>    Affects Versions: 0.10.0, 1.0.0
>            Reporter: darion yaphet
>            Assignee: darion yaphet
>
> When the data need to be wrote to HDFS is not very large in quantity . We 
> need a timed synchronous policy to flush cached date into HDFS periodically.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to