Matt Burgess created NIFI-12731:
-----------------------------------

             Summary: GetHBase should save state whenever the session is 
committed
                 Key: NIFI-12731
                 URL: https://issues.apache.org/jira/browse/NIFI-12731
             Project: Apache NiFi
          Issue Type: Bug
          Components: Extensions
            Reporter: Matt Burgess
             Fix For: 2.0.0, 1.26.0


Currently there is a place in the GetHBase code where the session is committed 
after each set of 500 rows/FlowFiles (so as not to run out of memory buffering 
millions of rows/FlowFiles) but the state is not updated. If an error occurs 
during processing of the entire table, the state is not updated but FlowFiles 
have already been sent downstream, so restarting the processor results in 
duplicate data.

GetHBase should save the current state whenever the session is committed.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to