[
https://issues.apache.org/jira/browse/NIFI-3503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15876890#comment-15876890
]
Michael Moser commented on NIFI-3503:
-------------------------------------
This is a possible duplicate of NIFI-1716.
> Create a 'SplitCSV' processor
> -----------------------------
>
> Key: NIFI-3503
> URL: https://issues.apache.org/jira/browse/NIFI-3503
> Project: Apache NiFi
> Issue Type: New Feature
> Reporter: Wesley L Lawrence
> Priority: Minor
>
> While the 'SplitText' processor helps break up newline separated records into
> individual files, it's not uncommon to have CSV files where records span
> multiple lines, and 'SplitText' isn't able or meant to handle this.
> Currently, one can replace, remove, or escape newline characters that exist
> in a single CSV record by searching within quoted columns with 'ReplaceText',
> before passing the data onto 'SplitText'. However, this may not work in all
> cases, or could potentially remove the valid newline character at the end of
> a CSV record, if all edge cases aren't properly covered with regex.
> Having a dedicated 'SplitCSV' processor will solve this problem, and be a
> simpler approach for users.
> See the following [Apache NiFi user email
> thread|https://mail-archives.apache.org/mod_mbox/nifi-users/201702.mbox/%3CCAFuL2BbgymFXwu5fRyd8pP-zu6WkToqPE2Ek7bkyBg0_-cknqQ%40mail.gmail.com%3E]
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)