Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-17 Thread Wes Lawrence
JIRA is created [1] . While I can't promise a time frame for when I could get around to it; it is on my radar, and I'd be happy to contribute it back to the NiFi project. =) [1] https://issues.apache.org/jira/browse/NIFI-3503 On Fri, Feb 17, 2017 at 7:26 PM, Andy LoPresto

Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-17 Thread Andy LoPresto
Wes, Do you mind raising a Jira [1] and providing a PR with your fix once you have it translated? I know people would appreciate it. Thanks. [1] https://issues.apache.org/jira/secure/CreateIssue!default.jspa Andy LoPresto

Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-17 Thread Wes Lawrence
This might be more of a question for the dev mailing list, but does it make sense to have a 'SplitCSV' processor? A situation we encounter a lot at Interset are CSV files whose records extend across multiple lines, similar to Prabhu's data. We currently have code written for Flume for isolating

Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-16 Thread Andy LoPresto
This isn’t working because of known issue NIFI-3255. Oleg has submitted a PR with a patch and Koji has been reviewing. There are some outstanding questions about provenance chain decisions with original vs. split, but the code fixes the exception which was raised and I was able to make a

Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-15 Thread prabhu Mahendran
Andy, I have used following properties in ReplaceText processor. Search Value:"(.*?)(\n)(.*?)" Replacement Value:"$1\\n$3" Character Set:UTF-8 MaximumBuffer Size:1MB Replacement Strategy:Regex Replace Evaluation Mode:Entire Text Result of this processor same as like input.It could n't

Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-14 Thread Andy LoPresto
Prabhu, I answered this on Stack Overflow [1] but I think you could do it with ReplaceText before the SplitText using a regex like "(.*?)(\n)(.*?)" replaced with "$1\\n$3" [1] http://stackoverflow.com/a/42242665/70465 Andy LoPresto

Re: How to avoid this splitting of single line as multi lines in SplitText?

2017-02-14 Thread Lee Laim
Prabhu, You need to remove the new lines from within the last field. I'd recommend using awk in an execute stream command processor first, then splitting the text. Alternatively, you could write a custom processor to specifically handle the incoming data. Lee > On Feb 14, 2017, at 11:01