[ 
https://issues.apache.org/jira/browse/NIFI-3497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15880699#comment-15880699
 ] 

ASF GitHub Bot commented on NIFI-3497:
--------------------------------------

GitHub user joetrite opened a pull request:

    https://github.com/apache/nifi/pull/1536

    NIFI-3497 ScanAttribute Tagging

    Adding tagging to ScanAttribute.  

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/joetrite/nifi master

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/nifi/pull/1536.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #1536
    
----
commit f52e1f2a064b31f87d4165af6075716fa7d55046
Author: Joe Trite <[email protected]>
Date:   2017-02-22T21:36:08Z

    NIFI-3497 - Added metadata option
    
    Added optional to post additional metadata as new attributed if a match is 
found in the dictionary.

commit 8eb54a50193897cf564eb7d222aae35481168af4
Author: Joe Trite <[email protected]>
Date:   2017-02-22T21:46:13Z

    NIFI-3497 test cases for metadata updates
    
    Adding test cases to support metadata option update.

commit a7a7b6ace80380416c342809ce95a4f4087bb066
Author: Joe Trite <[email protected]>
Date:   2017-02-22T21:48:10Z

    NIFI-3497 - New dictionary files for test
    
    Adding new dictionary files to support metadata dictionary option.

commit d71426037b142da8ca04dae38952c164d1614806
Author: Joe Trite <[email protected]>
Date:   2017-02-23T15:19:01Z

    NIFI-3497 - excluding test files
    
    Adding new test data files to exclude list.

----


> ScanAttribute should support tagging a flowfile with metadata value from the 
> supplied dictionary
> ------------------------------------------------------------------------------------------------
>
>                 Key: NIFI-3497
>                 URL: https://issues.apache.org/jira/browse/NIFI-3497
>             Project: Apache NiFi
>          Issue Type: Improvement
>            Reporter: Joseph Witt
>
> Today ScanAttribute just looks through the supplied dictionary and given 
> object for a string matching hit.  If it hits then it is a match otherwise it 
> is a 'not found'.  However, when a hit occurs it can often be quite useful to 
> gather additional metadata about that hit.  This makes cases like 
> enrichment/tagging much easier.
> So, plan is to have ScanAttribute support a dictionary value demarcator which 
> would separate the dictionary term from some string response that will be 
> added to the flowfile.  For instance a dictionary might have
> apples:These are red or green
> bananas:These are yellow unless you should toss them or make bread
> Then if a hit occurs on 'apples' the flowfile that contained such an 
> attribute would have a new attribute such as 'dictionary.hit.term' = 'apple' 
> and 'dictionary.hit.metadata' = 'These are red or green'.
> This means downstream processors could extract that metadata and do 
> interesting things with it.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to