Matt Burgess created NIFI-4496:
----------------------------------

             Summary: Improve performance of CSVReader
                 Key: NIFI-4496
                 URL: https://issues.apache.org/jira/browse/NIFI-4496
             Project: Apache NiFi
          Issue Type: Improvement
          Components: Extensions
            Reporter: Matt Burgess


During some throughput testing, it was noted that the CSVReader was not as fast 
as desired, processing less than 50k records per second. A look at [this 
benchmark|https://github.com/uniVocity/csv-parsers-comparison] implies that the 
Apache Commons CSV parser (used by CSVReader) is quite slow compared to others.

>From that benchmark it appears that CSVReader could be enhanced by using a 
>different CSV parser under the hood. Perhaps Jackson is the best choice, as it 
>is fast when values are quoted, and is a mature and maintained codebase.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to