[jira] [Commented] (CSV-107) CSVFormat.EXCEL.parse should handle byte order marks

Thomas Neidhart (JIRA) Wed, 18 Jun 2014 07:40:24 -0700

    [ 
https://issues.apache.org/jira/browse/CSV-107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14035780#comment-14035780
 ]


Thomas Neidhart commented on CSV-107:
-------------------------------------

Another option would be to do the same as suggested here: 
http://stackoverflow.com/questions/1835430/byte-order-mark-screws-up-file-reading-in-java

It's basically a unicode reader that skips the BOM at the beginning of the 
stream if one is detected.

> CSVFormat.EXCEL.parse should handle byte order marks
> ----------------------------------------------------
>
>                 Key: CSV-107
>                 URL: https://issues.apache.org/jira/browse/CSV-107
>             Project: Commons CSV
>          Issue Type: Bug
>          Components: Parser
>            Reporter: Kenzley Alphonse
>            Priority: Minor
>             Fix For: 1.x
>
>         Attachments: vod.csv
>
>   Original Estimate: 3h
>  Remaining Estimate: 3h
>
> The CSVFormat.EXCEL.parse should consider the byte order marks when reading 
> the input stream. Files with a byte order mark fail to properly parse.
> In my example, I have a starting byte order mark before my headers in a CVS 
> file. The parse fails when trying to get the header via the CSVRecord.get 
> call.
> I marked this as critical because many users will interact with Windows user 
> which will most likely have BOM files.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

[jira] [Commented] (CSV-107) CSVFormat.EXCEL.parse should handle byte order marks

Reply via email to