[ 
https://issues.apache.org/jira/browse/NIFI-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15432727#comment-15432727
 ] 

ASF GitHub Bot commented on NIFI-2631:
--------------------------------------

GitHub user gresockj opened a pull request:

    https://github.com/apache/nifi/pull/917

    NIFI-2631: Adding 'Commit Mode' and 'Use Versions' to ListS3

    

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/gresockj/nifi NIFI-2631-master

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/917.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #917
    
----
commit 02e94ed885f86d67e02eb9b4e783ac4ef56397e4
Author: Joe Gresock <[email protected]>
Date:   2016-08-23T12:46:12Z

    NIFI-2631: Adding 'Commit Mode' and 'Use Versions' to ListS3

----


> ListS3 improvements: "Use versions" and "Commit mode"
> -----------------------------------------------------
>
>                 Key: NIFI-2631
>                 URL: https://issues.apache.org/jira/browse/NIFI-2631
>             Project: Apache NiFi
>          Issue Type: Improvement
>    Affects Versions: 0.7.0
>            Reporter: Joseph Gresock
>            Assignee: Joseph Gresock
>            Priority: Minor
>             Fix For: 1.0.0, 0.8.0
>
>
> Our team needs to be able to list individual versions in S3.  We also ran 
> into a use case where a bucket with many objects (over 1 million in our case) 
> seemed to cause ListS3 to run forever.  The S3 list command finished in a few 
> minutes, but we believe it was taking a very long time for NiFi to commit all 
> the flow files at once.
> To handle this use case, we added a Commit Mode property to ListS3 that 
> allows you specify that you want to commit "Per page" vs. "Once".  This has 
> proven to correctly emit the flow files as the S3 paging progresses.
> We also implemented support for S3 List Versions, which includes the 
> "s3.version" and "s3.isLatest" attributes if applicable.  The "s3.version" 
> attribute can in turn be used in the FetchS3 processor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to