[
https://issues.apache.org/jira/browse/HIVE-11031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14592140#comment-14592140
]
Prasanth Jayachandran commented on HIVE-11031:
----------------------------------------------
Note for backport: The branch-1.0 patch will apply cleanly, but if we run
orc_merge_incompat1.q it can fail on some platforms. To make it more consistent
we need HIVE-8801 patch which makes the orc_merge_incompat1.q test more
consistent across platforms.
> ORC concatenation of old files can fail while merging column statistics
> -----------------------------------------------------------------------
>
> Key: HIVE-11031
> URL: https://issues.apache.org/jira/browse/HIVE-11031
> Project: Hive
> Issue Type: Bug
> Affects Versions: 0.13.0, 0.14.0, 1.0.0, 1.2.0, 1.1.0, 2.0.0
> Reporter: Prasanth Jayachandran
> Assignee: Prasanth Jayachandran
> Priority: Critical
> Fix For: 1.2.1, 2.0.0
>
> Attachments: HIVE-11031-branch-1.0.patch, HIVE-11031.2.patch,
> HIVE-11031.3.patch, HIVE-11031.4.patch, HIVE-11031.patch
>
>
> Column statistics in ORC are optional protobuf fields. Old ORC files might
> not have statistics for newly added types like decimal, date, timestamp etc.
> But column statistics merging assumes column statistics exists for these
> types and invokes merge. For example, merging of TimestampColumnStatistics
> directly casts the received ColumnStatistics object without doing instanceof
> check. If the ORC file contains time stamp column statistics then this will
> work else it will throw ClassCastException.
> Also, the file merge operator swallows the exception.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)