[ 
https://issues.apache.org/jira/browse/NIFI-436?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15338835#comment-15338835
 ] 

Karthik Narayanan commented on NIFI-436:
----------------------------------------

Ok,  how would we handle the base issue of the jira, where the newline 
character may be in the data. if there are LF characters in the data and CRLF 
are the line endings, that would cause splittext to fail if AUTO-DETECT is 
enabled. Would it may be make sense to have properties for data enclosures ex. 
" , which is very frequent in case of files where data contains speical 
character. May be a property for escape character as well.

> SplitText should allow changing the endline regex
> -------------------------------------------------
>
>                 Key: NIFI-436
>                 URL: https://issues.apache.org/jira/browse/NIFI-436
>             Project: Apache NiFi
>          Issue Type: Improvement
>    Affects Versions: 0.6.1
>            Reporter: Jon Parise
>            Assignee: Karthik Narayanan
>              Labels: beginner
>             Fix For: 1.0.0
>
>         Attachments: nifi-4361x.patch
>
>
> I have a CSV file in a format that inidcates the end of a line with a crlf. 
> This file has embedded comments that have lf in them.
> When I run this file through the split text processor, it is splitting at the 
> LF characters.
> I think it would be nice to have a setting to change the line ending 
> characters for splitting text.
> I can't find anything in the documentation that indicates how I would change 
> this behavior, so I assume it does not exist.
> Also, I would be willing to try and implement this improvement, but I can't 
> seem to find the source for the SplitTextProcessor.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to