Wesley L Lawrence created NIFI-3503:
---------------------------------------
Summary: Create a 'SplitCSV' processor
Key: NIFI-3503
URL: https://issues.apache.org/jira/browse/NIFI-3503
Project: Apache NiFi
Issue Type: New Feature
Reporter: Wesley L Lawrence
Priority: Minor
While the 'SplitText' processor helps break up newline separated records into
individual files, it's not uncommon to have CSV files where records span
multiple lines, and 'SplitText' isn't able or meant to handle this.
Currently, one can replace, remove, or escape newline characters that exist in
a single CSV record by searching within quoted columns with 'ReplaceText',
before passing the data onto 'SplitText'. However, this may not work in all
cases, or could potentially remove the valid newline character at the end of a
CSV record, if all edge cases aren't properly covered with regex.
Having a dedicated 'SplitCSV' processor will solve this problem, and be a
simpler approach for users.
See the following [Apache NiFi user email
thread|https://mail-archives.apache.org/mod_mbox/nifi-users/201702.mbox/%3CCAFuL2BbgymFXwu5fRyd8pP-zu6WkToqPE2Ek7bkyBg0_-cknqQ%40mail.gmail.com%3E]
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)