Matt Burgess created NIFI-4496:
----------------------------------
Summary: Improve performance of CSVReader
Key: NIFI-4496
URL: https://issues.apache.org/jira/browse/NIFI-4496
Project: Apache NiFi
Issue Type: Improvement
Components: Extensions
Reporter: Matt Burgess
During some throughput testing, it was noted that the CSVReader was not as fast
as desired, processing less than 50k records per second. A look at [this
benchmark|https://github.com/uniVocity/csv-parsers-comparison] implies that the
Apache Commons CSV parser (used by CSVReader) is quite slow compared to others.
>From that benchmark it appears that CSVReader could be enhanced by using a
>different CSV parser under the hood. Perhaps Jackson is the best choice, as it
>is fast when values are quoted, and is a mature and maintained codebase.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)