[ 
https://issues.apache.org/jira/browse/NIFI-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15433539#comment-15433539
 ] 

ASF GitHub Bot commented on NIFI-2632:
--------------------------------------

Github user YolandaMDavis commented on the issue:

    https://github.com/apache/nifi/pull/919
  
    @mattyb149 Understood on the above comment.  Just to note it (for maybe 
future refactor) it may be cool to have some abstract SplitProcessor that has 
attributes, relationships and some of the split/original flow file handling 
that seems to be similar across many of the split processors.
    
    This looks good tested with flows using json and xml. Splits were executed 
as expected.  Extracted attributes in both cases and confirmed new attributes 
for flow files transferred to split queue (original files did not contain 
attributes).  Attributes were also available for expression languages.
    
    +1
    
    I'll merge this in shortly



> Add fragment attributes for SplitJson and SplitXml
> --------------------------------------------------
>
>                 Key: NIFI-2632
>                 URL: https://issues.apache.org/jira/browse/NIFI-2632
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>            Reporter: Matt Burgess
>            Assignee: Matt Burgess
>
> Some "splitting" processors such as SplitText and SplitContent write 
> attributes to the split flow files indicating their fragment index, the total 
> count, and the filename of the original flow file. This is done to support a 
> form of "micro-batching" and/or a split-join pattern in a data flow (i.e. 
> split the original file, do work on the individual pieces, and possibly merge 
> them together later).
> For consistency and capability, the SplitJson and SplitXml processors should 
> write these same attributes for their split flow files.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to