[ 
https://issues.apache.org/jira/browse/STORM-960?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14638754#comment-14638754
 ] 

ASF GitHub Bot commented on STORM-960:
--------------------------------------

GitHub user dossett opened a pull request:

    https://github.com/apache/storm/pull/653

    STORM-960 HiveBolt should ack tuples only after flushing

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/dossett/storm HiveBoltAck

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/storm/pull/653.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #653
    
----
commit aac496ade94210888a0ca473df648855c408496b
Author: Sriharsha Chintalapani <[email protected]>
Date:   2015-07-21T22:42:22Z

    STORM-951. Storm Hive connector leaking connections.

commit 6a5d0dcb44b58063035bfdaac4ebddba401dc914
Author: Aaron Dossett <[email protected]>
Date:   2015-07-23T12:50:25Z

    STORM 960: HiveBolt should only ack after succesful flush

----


> Hive-Bolt can lose tuples when flushing data
> --------------------------------------------
>
>                 Key: STORM-960
>                 URL: https://issues.apache.org/jira/browse/STORM-960
>             Project: Apache Storm
>          Issue Type: Improvement
>          Components: external
>            Reporter: Aaron Dossett
>            Priority: Minor
>
> In HiveBolt's execute method tuples are ack'd as they are received.  When a 
> batchsize of tuples has been received, the writers are flushed.  However, if 
> the flush fails only the most recent tuple will be marked as failed.  All 
> prior tuples will already have been ack'd.  This creates a window for data 
> loss.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to