Wesley L Lawrence created NIFI-3503:
---------------------------------------

             Summary: Create a 'SplitCSV' processor
                 Key: NIFI-3503
                 URL: https://issues.apache.org/jira/browse/NIFI-3503
             Project: Apache NiFi
          Issue Type: New Feature
            Reporter: Wesley L Lawrence
            Priority: Minor


While the 'SplitText' processor helps break up newline separated records into 
individual files, it's not uncommon to have CSV files where records span 
multiple lines, and 'SplitText' isn't able or meant to handle this.

Currently, one can replace, remove, or escape newline characters that exist in 
a single CSV record by searching within quoted columns with 'ReplaceText', 
before passing the data onto 'SplitText'. However, this may not work in all 
cases, or could potentially remove the valid newline character at the end of a 
CSV record, if all edge cases aren't properly covered with regex.

Having a dedicated 'SplitCSV' processor will solve this problem, and be a 
simpler approach for users.

See the following [Apache NiFi user email 
thread|https://mail-archives.apache.org/mod_mbox/nifi-users/201702.mbox/%3CCAFuL2BbgymFXwu5fRyd8pP-zu6WkToqPE2Ek7bkyBg0_-cknqQ%40mail.gmail.com%3E]



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to