[jira] [Commented] (NIFI-6762) New Processor - Watch Multiple S3 Buckets for content changes

Aram S Openden (Jira) Fri, 11 Oct 2019 09:32:17 -0700


    [ 
https://issues.apache.org/jira/browse/NIFI-6762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16949599#comment-16949599
 ]


Aram S Openden commented on NIFI-6762:
--------------------------------------

{quote}How do you plan to watch? Scan?
{quote}
Is "Watch" the wrong word perhaps?  I was using the word Watch at the 
suggestion of [~joewitt], as per his dev mailing list reply on 9/17/2019:
{quote}The purpose of the List* processors is 'once told to look at a thing' it
should generate a listing of what it is there AND it should continue to
look at the thing to see what changes/shows up later.  Arguably we should
have called that Watch*.

If your use case is 'once told to look at a thing' it should generate a
listing of what is there at that time and not worry about later updates
happening because you can always redo a listing again later.  The
difference there is subtle but important.  And for this case/desire the
name 'List*' makes a lot of sense and may be why we've seen folks want it
to work this way rather than how it does.
{quote}
 

Perhaps I should explain the workflow in a bit more.

The workflow is similar to the existing ListS3 Processor, with the exception 
that it also fires its {{on_trigger()}} behavior from an incoming FlowFile 
event that tells it to looks for a bucket that matches the input expression in 
the "Bucket" input attribute.

So, should we name this component something different. Very much *open to 
feedback* on that question.  

> New Processor - Watch Multiple S3 Buckets for content changes
> -------------------------------------------------------------
>
>                 Key: NIFI-6762
>                 URL: https://issues.apache.org/jira/browse/NIFI-6762
>             Project: Apache NiFi
>          Issue Type: New Feature
>          Components: Extensions
>    Affects Versions: 1.9.2
>            Reporter: Aram S Openden
>            Priority: Major
>              Labels: Processor, S3, watch
>   Original Estimate: 384h
>  Remaining Estimate: 384h
>
> Proposing New S3 processor that is a "variation" on the existing ListS3 
> Processor. For lack of better term, I will call this "Watch Multiple S3 
> Buckets Processor" (naming here is open to feedback). This new Processor 
> takes an input attribute "Bucket" from the FlowFile, which could use NiFi 
> ExpressionLanguage  to compute that value. Hence, the Processor could "watch" 
> different buckets in an S3 Object Store.
> This processor must handle the State Management logic for multiple S3 
> buckets. The simplest solution for this problem add to modify the State 
> Management logic to prepend the bucket name to this key/objectName that is 
> being used to track state.
> Finally, the Dev should document the maximum number of buckets it will 
> monitor at once, maximum amount of state/storage it can have to track the 
> various listings in various buckets.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (NIFI-6762) New Processor - Watch Multiple S3 Buckets for content changes

Reply via email to