Josef Zahner created NIFI-8483:
----------------------------------

             Summary: Restart NiFi - duplicate SFTP flowfiles
                 Key: NIFI-8483
                 URL: https://issues.apache.org/jira/browse/NIFI-8483
             Project: Apache NiFi
          Issue Type: Bug
    Affects Versions: 1.13.2
         Environment: Java 1.8.0_282, CentOS 7, 8-Node Cluster
            Reporter: Josef Zahner
         Attachments: SFTP_failure.png

Since the upgrade from NiFi 1.11.4 to 1.13.2 we faced an issue with the 
FetchSFTP & PutSFTP processors. We have a 8-Node NiFi cluster. Pattern is 
always ListSFTP (tracking timestamp) - FetchSFTP (and delete) and PutSFTP.

If we do a restart of NiFi and NiFi comes back, we sometimes see flowfiles for 
FetchSFTP (not found) and PutSFTP (already present on disk) which have been 
processed successfully and have been stored already. So in fact we see 
flowfiles in a failure queue which have been save to disk with PutSFTP, which 
should never happen.  The files are always small (a few MBs) and the network 
connectivity is insanely fast. The cluster shutdown is always before the grace 
period runs out. The attached screeshot shows an example where the FetchSFTP 
and the PutSFTP failure queue has files. Especially for the FetchSFTP this 
shouldn't be possible and if I do a restart with the command below, I would 
expect that within the grace period the processor has been stopped and it can't 
be processed twice. 
{code:java}
/opt/nifi/bin/nifi.sh restart
{code}
At the moment we have no clue where the issue comes from and why it happens, so 
I can't provide an exact scenario to reproduce it. I only know that it 
sometimes happens after a restart of our 8-node cluster. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to