Koji Kawamura created NIFI-6599:
-----------------------------------
Summary: MergeRecord fails if 'fragment.count' attribute equals
the number of records within a FlowFile where it should wait for remaining
FlowFiles
Key: NIFI-6599
URL: https://issues.apache.org/jira/browse/NIFI-6599
Project: Apache NiFi
Issue Type: Bug
Components: Extensions
Reporter: Koji Kawamura
Assignee: Koji Kawamura
RecordBinManager.createThresholds has following code block:
{code:java}
if (MergeRecord.MERGE_STRATEGY_DEFRAGMENT.getValue().equals(mergeStrategy)) {
fragmentCountAttribute = MergeContent.FRAGMENT_COUNT_ATTRIBUTE;
if (!StringUtils.isEmpty(flowfile.getAttribute(fragmentCountAttribute))) {
minRecords =
Integer.parseInt(flowfile.getAttribute(fragmentCountAttribute));
}
} else {
fragmentCountAttribute = null;
}
{code}
The code uses 'fragment.count' as the minRecords. This is wrong because
'fragment.count' represents the number of fragments, i.e. number of FlowFiles
holding partial record set.
This causes a FlowFile to be sent 'failure' relationship where it should be
hold in the incoming connection. For example, when a FlowFile is split into two
FlowFiles, and each has 2 records in it, 'fragment.count' will be 2. In this
case, MergeContent thinks the minRecords is 2, where 4 is correct. Then the
first FlowFile is processed, while the 2nd one hasn't arrived, MergeContent
misunderstood that the bin reached to the minimum number of records. But since
there's only one FlowFile, it sends the FlowFile to 'failure'.
The issue can be reproduced by the attached template.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)