[jira] [Commented] (CSV-67) UnicodeUnescapeReader should not be applied before parsing

Sebb (Commented) (JIRA) Fri, 16 Mar 2012 11:16:01 -0700

    [ 
https://issues.apache.org/jira/browse/CSV-67?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13231472#comment-13231472
 ]


Sebb commented on CSV-67:
-------------------------

OK, I'm working on removing it and moving the reader class to IO for a future 
release.
                
> UnicodeUnescapeReader should not be applied before parsing
> ----------------------------------------------------------
>
>                 Key: CSV-67
>                 URL: https://issues.apache.org/jira/browse/CSV-67
>             Project: Commons CSV
>          Issue Type: Bug
>            Reporter: Sebb
>
> The UnicodeEscapeReader is currently applied before the input file is parsed.
> This means that unicode escapes are treated differently from other escapes.
> For example, the sequence <esc>r<esc>n is not treated as a new-line for the 
> purpose of recognising the end of a record, yet \o000D\u000A is converted to 
> CRLF and would terminate the record (unless embedded in a quoted string).
> The unicode escape processing (if selected) should occur as part of the 
> parsing, just as for ordinary escape processing.
> The class can be made public so the user can wrap the input if required; this 
> preserves the existing functionality should it be required, so there is no 
> need to introduce another setting.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (CSV-67) UnicodeUnescapeReader should not be applied before parsing

Reply via email to