[ 
https://issues.apache.org/jira/browse/NIFI-5642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17798751#comment-17798751
 ] 

Matt Burgess commented on NIFI-5642:
------------------------------------

The "maxRowsPerFlowFile" variable is changed to match the rows available 
without fetching (default is 5000), so only 5000 rows get put into the FlowFile 
even if the Max Rows Per Flow File property is set to zero. I have a test table 
with 20k rows, if I run QueryCassandra once I get 4 output FlowFiles when I 
should only get one. There's a logic error or two in there that need to be 
fixed, but since it has been released as of 1.22.0 and 2.0.0-M1, I will open a 
new Jira to fix them.

> QueryCassandra processor : output FlowFiles as soon fetch_size is reached
> -------------------------------------------------------------------------
>
>                 Key: NIFI-5642
>                 URL: https://issues.apache.org/jira/browse/NIFI-5642
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.7.1
>            Reporter: André Gomes Lamas Otero
>            Assignee: Levi Lentz
>            Priority: Major
>             Fix For: 2.0.0-M1, 1.22.0
>
>          Time Spent: 4h 10m
>  Remaining Estimate: 0h
>
> When I'm using QueryCassandra alongside with fetch_size parameter I expected 
> that as soon my reader reaches the fetch_size the processor outputs some data 
> to be processed by the next processor, but QueryCassandra reads all the data, 
> then output the flow files.
> I'll start to work on a patch for this situation, I'll appreciate any 
> suggestion.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to