[
https://issues.apache.org/jira/browse/HIVE-375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Zheng Shao updated HIVE-375:
----------------------------
Attachment: HIVE-375.2.patch
Added 2 tests and fixed a bug.
> LazySimpleSerDe to directly serialize (append) int/long/byte/short etc to
> UTF-8 buffer
> --------------------------------------------------------------------------------------
>
> Key: HIVE-375
> URL: https://issues.apache.org/jira/browse/HIVE-375
> Project: Hadoop Hive
> Issue Type: Improvement
> Affects Versions: 0.3.0
> Reporter: Zheng Shao
> Attachments: HIVE-375.1.patch, HIVE-375.2.patch
>
>
> LazySimpleSerDe currently serialize all data into a StringBuilder, and then
> convert it to String and then Text.
> Even if the data is of type int/long/byte/short, we still do that unnecessary
> conversion.
> We should directly serialize/append int/long/byte/short to a UTF-8 buffer.
> This is a very simple change, but it is expected to save 2-3% of the time of
> a typical mapper (on a group-by query with some int/long columns).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.