Prasanth Jayachandran created HIVE-17417:
--------------------------------------------

             Summary: Lazy Timestamp and Date serialization is very expensive
                 Key: HIVE-17417
                 URL: https://issues.apache.org/jira/browse/HIVE-17417
             Project: Hive
          Issue Type: Bug
          Components: Serializers/Deserializers
    Affects Versions: 3.0.0, 2.4.0
            Reporter: Prasanth Jayachandran
            Assignee: Prasanth Jayachandran
            Priority: Critical


In a specific case where a schema contains array<struct> with timestamp and 
date fields (array size >10000). Any access to this column very very expensive 
in terms of CPU as most of the time is serialization of timestamp and date. 
Refer attached profiles. >70% time spent in serialization + tostring 
conversions. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to