[
https://issues.apache.org/jira/browse/HIVE-10484?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Matt McCline updated HIVE-10484:
--------------------------------
Attachment: HIVE-10484.01.patch
> Vectorization : RuntimeException "Big Table Retained Mapping duplicate column"
> ------------------------------------------------------------------------------
>
> Key: HIVE-10484
> URL: https://issues.apache.org/jira/browse/HIVE-10484
> Project: Hive
> Issue Type: Bug
> Components: Tez, Vectorization
> Affects Versions: 1.2.0
> Reporter: Mostafa Mokhtar
> Assignee: Matt McCline
> Fix For: 1.2.0
>
> Attachments: HIVE-10484.01.patch
>
>
> With vectorization and tez enabled TPC-DS Q70 fails with
> {code}
> Caused by: java.lang.RuntimeException: Big Table Retained Mapping duplicate
> column 6 in ordered column map {6=(value column: 6, type name: int),
> 21=(value column: 21, type name: float), 22=(value column: 22, type name:
> int)} when adding value column 6, type int
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorColumnOrderedMap.add(VectorColumnOrderedMap.java:97)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorColumnOutputMapping.add(VectorColumnOutputMapping.java:40)
> at
> org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.determineCommonInfo(VectorMapJoinCommonOperator.java:320)
> at
> org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinCommonOperator.<init>(VectorMapJoinCommonOperator.java:254)
> at
> org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinGenerateResultOperator.<init>(VectorMapJoinGenerateResultOperator.java:89)
> at
> org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinInnerGenerateResultOperator.<init>(VectorMapJoinInnerGenerateResultOperator.java:97)
> at
> org.apache.hadoop.hive.ql.exec.vector.mapjoin.VectorMapJoinInnerLongOperator.<init>(VectorMapJoinInnerLongOperator.java:79)
> ... 49 more
> {code}
> Query
> {code}
> select s_state
> from (select s_state as s_state, sum(ss_net_profit),
> rank() over ( partition by s_state order by
> sum(ss_net_profit) desc) as ranking
> from store_sales, store, date_dim
> where d_month_seq between 1193 and 1193+11
> and date_dim.d_date_sk =
> store_sales.ss_sold_date_sk
> and store.s_store_sk = store_sales.ss_store_sk
> group by s_state
> ) tmp1
> where ranking <= 5
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)