[ 
https://issues.apache.org/jira/browse/MINIFI-244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15936544#comment-15936544
 ] 

Joseph Witt commented on MINIFI-244:
------------------------------------

[~achristianson] could you describe how this might be used in a flow?  I think 
perhaps it isn't clear what is meant to be the purpose of the processor but it 
might just be the name that is throwing me off.

If the notion is to extract data or metadata from an archive formatted data 
object (such as tar, zip, 7z, rar, etc..) then we'd want to call the processor 
ExtractArchive and it would perhaps have a mode indicating whether it is 
content or metadata that is of interest, some way to query what the 
definition/range of the lens is, etc..  The lens notion is interesting of 
course but it doesnt' seem like that is the part that would be interesting to 
an end user so much as it is interesting to the developer of the processor.

If i'm way  off base on understanding can you please help reel me back in with 
an end to end example of how it might be used in a flow?

> Create ArchiveLens processor
> ----------------------------
>
>                 Key: MINIFI-244
>                 URL: https://issues.apache.org/jira/browse/MINIFI-244
>             Project: Apache NiFi MiNiFi
>          Issue Type: Task
>          Components: C++, Extensions
>            Reporter: Andrew Christianson
>            Assignee: Andrew Christianson
>            Priority: Minor
>
> Create an ArchiveLens processor. A concise, though informal, definition of a 
> lens is as follows:
> "Essentially, they represent the act of “peering into” or “focusing in on” 
> some particular piece/path of a complex data object such that you can more 
> precisely target particular operations without losing the context or 
> structure of the overall data you’re working with." 
> https://medium.com/@dtipson/functional-lenses-d1aba9e52254#.hdgsvbraq
> Why an ArchiveLens in MiNiFi? Simply put, it will enable us to "focus in on" 
> an entry in the archive, perform processing *in-context* of that entry, then 
> re-focus on the overall archive. This allows for transformation or other 
> processing of an entry in the archive without losing the overall context of 
> the archive.
> Initial format support is tar, due to its simplicity and ubiquity.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to