Jens M Kofoed created NIFI-7276:
-----------------------------------

             Summary: PutFTP create alot of received trafic when tx small files
                 Key: NIFI-7276
                 URL: https://issues.apache.org/jira/browse/NIFI-7276
             Project: Apache NiFi
          Issue Type: Bug
          Components: Extensions
    Affects Versions: 1.11.4
         Environment: NIFI: Ubuntu 18.04. NIFI 1.11.4
FTP: Ubuntu 18.04 with vsftpd
            Reporter: Jens M Kofoed


I don't know if this is a bug, but the result of my investigation is that it 
seems like the PutFTP receive between 1,6 - 2,7MB of data per file it transmit. 
For me it seems like a lot of RX data.

I'm using nload and nethogs to monitor the network trafic.

I have a lot (100.000+) of small files (<1MB), which I can't combine.

When starting the PutFTP process the nifi server starts receiving data from the 
ftp server and it continues with with an avg. of 50 Mbit/s, constantly.

To increasing the throughput I've set the run duration to 2sec. and playing 
with the Batch size. With a batch size of 100 and over 5 min. I can see at the 
ftp server that nifi is logging in 5 times and start a vsftpd process. within 
those 5 min. nifi has tx 700 files and 65.32 MB, but the ftp server has sent 
1544MB data back to nifi.

With a batch size of 1000. It took nifi around 8 minutes for transferring 1000 
files with a total sum of 64MB, but the ftp server has this time TX 2700MB data 
back to nifi.

I tried with larger files and a batch size at 100. NIFI sent 249 files with at 
total sum of 32 GB of data within 3½ min. and this time the ftp server sent 
400MB data back to nifi.

It seems like the PutFTP receive between 1,6 - 2,7MB of data per file it 
transmit. For me it seems like alot of RX data.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to