Matt Burgess created NIFI-12731:
-----------------------------------
Summary: GetHBase should save state whenever the session is
committed
Key: NIFI-12731
URL: https://issues.apache.org/jira/browse/NIFI-12731
Project: Apache NiFi
Issue Type: Bug
Components: Extensions
Reporter: Matt Burgess
Fix For: 2.0.0, 1.26.0
Currently there is a place in the GetHBase code where the session is committed
after each set of 500 rows/FlowFiles (so as not to run out of memory buffering
millions of rows/FlowFiles) but the state is not updated. If an error occurs
during processing of the entire table, the state is not updated but FlowFiles
have already been sent downstream, so restarting the processor results in
duplicate data.
GetHBase should save the current state whenever the session is committed.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)