[
https://issues.apache.org/jira/browse/HADOOP-1097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#action_12481219
]
Milind Bhandarkar commented on HADOOP-1097:
-------------------------------------------
There is a severe bug in the binary serialization of strings as well. (It does
not work for non-ascii characters.) This bug was introduced when we moved from
Text class to Java String class. I will submit a patch by the end of the day.
> Bug in XML serialization of strings in record I/O
> -------------------------------------------------
>
> Key: HADOOP-1097
> URL: https://issues.apache.org/jira/browse/HADOOP-1097
> Project: Hadoop
> Issue Type: Improvement
> Components: record
> Affects Versions: 0.12.0
> Environment: All
> Reporter: Milind Bhandarkar
> Assigned To: Milind Bhandarkar
> Fix For: 0.13.0
>
> Attachments: jute-patch.txt
>
>
> XML serialization of strings in record I/O has a bug whicha makes the
> XmlInputArchive crash with SAXParseExcpetion (generated XML contains invalid
> Unicode characters http://www.w3.org/TR/REC-xml/#charsets).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.