[
https://issues.apache.org/jira/browse/AVRO-1348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rob Turner updated AVRO-1348:
-----------------------------
Attachment: AVRO-1348v2.patch
Surprisingly, passing the character set name string "UTF-8" performs about 15%
better (in my tests) than passing a single instance of Charset. This is due to
the former caching the StringDecoder/StringEncoder in a ThreadLocal whereas the
latter creates a new instance each time.
In patch v2 I handle the UnsupportedEncodingException by throwing an unchecked
java.nio.charset.UnsupportedCharsetException in the same way as Charset.forName.
Please review the patch and what do you think?
> Improve Utf8 to String conversion
> ---------------------------------
>
> Key: AVRO-1348
> URL: https://issues.apache.org/jira/browse/AVRO-1348
> Project: Avro
> Issue Type: Bug
> Reporter: Mark Wagner
> Assignee: Mohammad Kamrul Islam
> Attachments: AVRO-1348v2.patch, AVRO1348v1.patch
>
>
> AVRO-1241 found that the existing method of creating Strings from Utf8 byte
> arrays could be made faster. The same method is being used in the
> Utf8.toString(), and could likely be sped up by doing the same thing.
--
This message was sent by Atlassian JIRA
(v6.1#6144)