[
https://issues.apache.org/jira/browse/HIVE-17611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16183736#comment-16183736
]
Lefty Leverenz commented on HIVE-17611:
---------------------------------------
Should this be documented in the wiki?
Perhaps it belongs in one of these docs:
* [Developer Guide -- Hive SerDe |
https://cwiki.apache.org/confluence/display/Hive/DeveloperGuide#DeveloperGuide-HiveSerDe]
* [SerDe | https://cwiki.apache.org/confluence/display/Hive/SerDe]
> Add new LazyBinary SerDe for faster writes
> ------------------------------------------
>
> Key: HIVE-17611
> URL: https://issues.apache.org/jira/browse/HIVE-17611
> Project: Hive
> Issue Type: Improvement
> Components: Serializers/Deserializers
> Reporter: Jason Dere
> Assignee: Jason Dere
> Fix For: 3.0.0
>
> Attachments: HIVE-17611.1.patch
>
>
> LazyBinarySerDe.serialize() ends up making
> getCategory()/getPrimitiveCategory() calls for every column of every row.
> Tried some simple tests to eliminate these calls for the non-vectorized
> version, this looks like it speeds up the writes by ~3x.
> Adding a LazyBinarySerDe2 class with this new implementation.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)