[ https://issues.apache.org/jira/browse/MINIFI-244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15936544#comment-15936544 ]
Joseph Witt commented on MINIFI-244: ------------------------------------ [~achristianson] could you describe how this might be used in a flow? I think perhaps it isn't clear what is meant to be the purpose of the processor but it might just be the name that is throwing me off. If the notion is to extract data or metadata from an archive formatted data object (such as tar, zip, 7z, rar, etc..) then we'd want to call the processor ExtractArchive and it would perhaps have a mode indicating whether it is content or metadata that is of interest, some way to query what the definition/range of the lens is, etc.. The lens notion is interesting of course but it doesnt' seem like that is the part that would be interesting to an end user so much as it is interesting to the developer of the processor. If i'm way off base on understanding can you please help reel me back in with an end to end example of how it might be used in a flow? > Create ArchiveLens processor > ---------------------------- > > Key: MINIFI-244 > URL: https://issues.apache.org/jira/browse/MINIFI-244 > Project: Apache NiFi MiNiFi > Issue Type: Task > Components: C++, Extensions > Reporter: Andrew Christianson > Assignee: Andrew Christianson > Priority: Minor > > Create an ArchiveLens processor. A concise, though informal, definition of a > lens is as follows: > "Essentially, they represent the act of “peering into” or “focusing in on” > some particular piece/path of a complex data object such that you can more > precisely target particular operations without losing the context or > structure of the overall data you’re working with." > https://medium.com/@dtipson/functional-lenses-d1aba9e52254#.hdgsvbraq > Why an ArchiveLens in MiNiFi? Simply put, it will enable us to "focus in on" > an entry in the archive, perform processing *in-context* of that entry, then > re-focus on the overall archive. This allows for transformation or other > processing of an entry in the archive without losing the overall context of > the archive. > Initial format support is tar, due to its simplicity and ubiquity. -- This message was sent by Atlassian JIRA (v6.3.15#6346)