Andy LoPresto created NIFI-7002:
-----------------------------------

             Summary: Handle large files internally in EC toolkit
                 Key: NIFI-7002
                 URL: https://issues.apache.org/jira/browse/NIFI-7002
             Project: Apache NiFi
          Issue Type: Sub-task
          Components: Tools and Build
    Affects Versions: 1.10.0
            Reporter: Andy LoPresto


The EC toolkit cannot handle large flow definition files (>100 MB uncompressed) 
due to its use of {{String}} to hold the XML content internally through 
frequent manipulations. As Strings are immutable in Java, this causes a lot of 
heap usage and duplication. The read/write to the file system also operates on 
the String directly rather than perform a streaming read/write. 

We should change the internal representation of the file content (for all 
files) to be a {{BufferedInputStream}} or other similar stream representation 
rather than a String. We should test with very large files (1 GB) and clearly 
provide instructions and warnings if the file size exceeds that. 

The logic of value encryption/replacement will have to change to handle this 
approach. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to