Ken Wu created PIG-4623:
---------------------------
Summary: Fixed the 'new line' character inside double-quote
causing the csv parsing failure
Key: PIG-4623
URL: https://issues.apache.org/jira/browse/PIG-4623
Project: Pig
Issue Type: Bug
Components: piggybank
Reporter: Ken Wu
Assignee: Ken Wu
A new line character should be allowed inside a double quote as a valid csv
document. For example, the following csv document should be treated as a SINGLE
valid csv data
Iphone,"{ ItemName : Cheez-It
21 Ounce}",
However, the current implementation of the getNext() inside
org.apache.pig.piggybank.storage.CSVLoader class fails to take care of this
case and it sees two lines of data while in fact it should be treated as single
line of data.
This pull request fixes the above issue.
(Note: here is a linke to validate whether a csv document: http://csvlint.io/)
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)