Hi,

For:


   - Correlation Attribute Name: ${myFlowfileAttributeName}

Should be set to: myFlowfileAttributeName

NOT ${myFlowfileAttibuteName)

Hope that helps.

On Tue, Aug 28, 2018 at 12:07 PM Tim Dean <[email protected]> wrote:

> I have a flow that sends a large number of JSON files into a MergeContent
> processor. The job of that processor is to combine all the incoming flow
> files with a particular flow file attribute into a single flow file,
> creating a JSON array containing each of the input flow files’ JSON.
>
> I have configured the MergeContent as processor as follows:
>
>    - Merge Strategy: Bin-Packing Algorithm
>    - Merge Format: Binary Concatenation
>    - Correlation Attribute Name: ${myFlowfileAttributeName}
>    - Minimum number of entries: 1
>    - Maximum number of entries: 5000
>    - Minimum group size: 0 B
>    - Maximum group size: <no value set>
>    - Max bin age: 30 min
>    - Maximum number of bins: 10
>    - Delimiter strategy: Text
>    - Header: [
>    - Footer: ]
>    - Demarcator: ,
>
>
> When I run data through this flow, I am seeing a large number of small-ish
> merged flow files being sent to the merged relationship, I was expecting
> ALL of the files for a given flow file attribute value to be binned
> together, but they are not coming through that way. To give a example, I
> pushed through data containing 262 input JSON files. Of these 262, 2 of
> them have a flow file attribute value of ‘A’, 2 of them have a flow file
> attribute value of ‘B’, and 258 have a flow file attribute of ‘C’. I was
> expecting the merged relationship to deliver 3 flow files, one each for
> value A, B, and C. But.I am seeing 24 flow files on the merged
> relationship, 1 for a value of A, 1 for a value of B, and 22 of varying
> sizes with the value of C.
>
> Can someone help me understand what other criteria MergeContent might be
> using to determine when to send along its merged flow files?
>
> Thanks
>


-- 
Juan Carlos Sequeiros

Reply via email to