[
https://issues.apache.org/jira/browse/CSV-141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17654605#comment-17654605
]
Damjan Jovanovic commented on CSV-141:
--------------------------------------
That seems like a standard warning when saving though, similar to what
OpenOffice gives you on any CSV file, regardless of the content?
I've now updated my PR with another setting for the "EOF reached before
encapsulated token finished".
> Handle malformed CSV files
> --------------------------
>
> Key: CSV-141
> URL: https://issues.apache.org/jira/browse/CSV-141
> Project: Commons CSV
> Issue Type: Wish
> Components: Parser
> Affects Versions: 1.0
> Reporter: Nguyen Minh
> Priority: Minor
> Fix For: 1.x
>
> Attachments: image-2023-01-04-14-00-57-158.png
>
>
> My java application has to handle thousands of CSV files uploaded by the
> client phones everyday. So, there some CSV files have the wrong format which
> I'm not sure why.
> Here is my sample CSV. Microsoft Excel parses it correctly, but both Common
> CSV and OpenCSV can't parse it. Open CSV can't parse line 2 (due to '\'
> character) and Common CSV will crash on line 3 and 4:
> "1414770317901","android.widget.EditText","pass sem1 _84*|*","0","pass sem1
> _8"
> "1414770318470","android.widget.EditText","pass sem1 _84:*|*","0","pass sem1
> _84:\"
> "1414770318327","android.widget.EditText","pass sem1
> "1414770318628","android.widget.EditText","pass sem1 _84*|*","0","pass sem1
> Line 3: java.io.IOException: (line 5) invalid char between encapsulated token
> and delimiter
> at org.apache.commons.csv.CSVParser$1.getNextRecord(CSVParser.java:398)
> at org.apache.commons.csv.CSVParser$1.hasNext(CSVParser.java:407)
> Line 4: java.io.IOException: (startline 5) EOF reached before encapsulated
> token finished
> at org.apache.commons.csv.CSVParser$1.getNextRecord(CSVParser.java:398)
> at org.apache.commons.csv.CSVParser$1.hasNext(CSVParser.java:407)
--
This message was sent by Atlassian Jira
(v8.20.10#820010)