[
https://issues.apache.org/jira/browse/NIFI-3497?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15880699#comment-15880699
]
ASF GitHub Bot commented on NIFI-3497:
--------------------------------------
GitHub user joetrite opened a pull request:
https://github.com/apache/nifi/pull/1536
NIFI-3497 ScanAttribute Tagging
Adding tagging to ScanAttribute.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/joetrite/nifi master
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/nifi/pull/1536.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #1536
----
commit f52e1f2a064b31f87d4165af6075716fa7d55046
Author: Joe Trite <[email protected]>
Date: 2017-02-22T21:36:08Z
NIFI-3497 - Added metadata option
Added optional to post additional metadata as new attributed if a match is
found in the dictionary.
commit 8eb54a50193897cf564eb7d222aae35481168af4
Author: Joe Trite <[email protected]>
Date: 2017-02-22T21:46:13Z
NIFI-3497 test cases for metadata updates
Adding test cases to support metadata option update.
commit a7a7b6ace80380416c342809ce95a4f4087bb066
Author: Joe Trite <[email protected]>
Date: 2017-02-22T21:48:10Z
NIFI-3497 - New dictionary files for test
Adding new dictionary files to support metadata dictionary option.
commit d71426037b142da8ca04dae38952c164d1614806
Author: Joe Trite <[email protected]>
Date: 2017-02-23T15:19:01Z
NIFI-3497 - excluding test files
Adding new test data files to exclude list.
----
> ScanAttribute should support tagging a flowfile with metadata value from the
> supplied dictionary
> ------------------------------------------------------------------------------------------------
>
> Key: NIFI-3497
> URL: https://issues.apache.org/jira/browse/NIFI-3497
> Project: Apache NiFi
> Issue Type: Improvement
> Reporter: Joseph Witt
>
> Today ScanAttribute just looks through the supplied dictionary and given
> object for a string matching hit. If it hits then it is a match otherwise it
> is a 'not found'. However, when a hit occurs it can often be quite useful to
> gather additional metadata about that hit. This makes cases like
> enrichment/tagging much easier.
> So, plan is to have ScanAttribute support a dictionary value demarcator which
> would separate the dictionary term from some string response that will be
> added to the flowfile. For instance a dictionary might have
> apples:These are red or green
> bananas:These are yellow unless you should toss them or make bread
> Then if a hit occurs on 'apples' the flowfile that contained such an
> attribute would have a new attribute such as 'dictionary.hit.term' = 'apple'
> and 'dictionary.hit.metadata' = 'These are red or green'.
> This means downstream processors could extract that metadata and do
> interesting things with it.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)