Dear NiFi community, I often face the use-case where I import flow files with content of order O(1gb) or O(10gb) – already compressed. Let's day I need to branch off of a flow where the actual flow file should be processed further, and one some side branch I want just to do some kind of logging or whatever without accessing the flow file's contents. Thus it's clearly wasteful to duplicate the flow file including content. For this case I wrote a processor defining 2 relationships: "original" and "attributes only", so the flow file attributes can be accessed separately from the content. I will gladly prepare a PR if anyone finds that worth incorporating into NiFi. Only remaining question for me would be: use an individual processor to that end, or add it to e.g. the DuplicateFlowFile processor. The former seems cleaner to me. Proposed names would be something like ForkProcessor (no better idea yet).
Thanks in advance! Best, Lars
signature.asc
Description: This is a digitally signed message part