[
https://issues.apache.org/jira/browse/HIVE-12535?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15029654#comment-15029654
]
Gopal V commented on HIVE-12535:
--------------------------------
[~jdere]: any idea why it would do this? The vectorizer picks up the wrong
column types from this reference.
> Dynamic Hash Join: Key references are cyclic
> --------------------------------------------
>
> Key: HIVE-12535
> URL: https://issues.apache.org/jira/browse/HIVE-12535
> Project: Hive
> Issue Type: Bug
> Components: Query Planning
> Affects Versions: 2.0.0
> Reporter: Gopal V
> Assignee: Jason Dere
> Attachments: philz_26.txt
>
>
> MAPJOIN_4227 is inside "Reducer 2", but refers back to "Reducer 2" in its
> keys. It should say "Map 1" there.
> {code}
> | |<-Reducer 2 [SIMPLE_EDGE] vectorized, llap
>
>
> |
> | Reduce Output Operator [RS_4189]
>
>
> |
> | key expressions:_col0 (type: string), _col1 (type:
> int)
>
> |
> | Map-reduce partition columns:_col0 (type: string),
> _col1 (type: int)
>
> |
> | sort order:++
>
>
> |
> | Statistics:Num rows: 83 Data size: 9213 Basic stats:
> COMPLETE Column stats: COMPLETE
>
> |
> | value expressions:_col2 (type: double)
>
>
> |
> | Group By Operator [OP_4229]
>
>
> |
> | aggregations:["sum(_col2)"]
>
>
> |
> | keys:_col0 (type: string), _col1 (type: int)
>
>
> |
> | outputColumnNames:["_col0","_col1","_col2"]
>
>
> |
> | Statistics:Num rows: 83 Data size: 9213 Basic
> stats: COMPLETE Column stats: COMPLETE
>
> |
> | Select Operator [OP_4228]
>
>
> |
> | outputColumnNames:["_col0","_col1","_col2"]
>
>
> |
> | Statistics:Num rows: 166 Data size: 26394 Basic
> stats: COMPLETE Column stats: COMPLETE
>
> |
> | Map Join Operator [MAPJOIN_4227]
>
>
> |
> | | condition map:[{"":"Inner Join 0 to 1"}]
>
>
> |
> | | keys:{"Reducer 2":"KEY.reducesinkkey0 (type:
> bigint), KEY.reducesinkkey1 (type: int), KEY.reducesinkkey2 (type: int)","Map
> 5":"KEY.reducesinkkey0 (type: bigint), KEY.reducesinkkey1 (type: int),
> KEY.reducesinkkey2 (type: int)"} |
> | | outputColumnNames:["_col1","_col3","_col5"]
>
>
> |
> | | Statistics:Num rows: 166 Data size: 26394
> Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | |<-Map 5 [CUSTOM_SIMPLE_EDGE] vectorized, llap
>
>
> |
> | | Reduce Output Operator [RS_4226]
>
>
> |
> | | key expressions:_col1 (type: bigint),
> year(_col2) (type: int), month(_col2) (type: int)
>
> |
> | | Map-reduce partition columns:_col1 (type:
> bigint), year(_col2) (type: int), month(_col2) (type: int)
>
> |
> | | sort order:+++
>
>
> |
> | | Statistics:Num rows: 74973886 Data size:
> 5098224248 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | value expressions:_col0 (type: float),
> _col2 (type: date)
>
> |
> | | Select Operator [OP_4225]
>
>
> |
> | |
> outputColumnNames:["_col0","_col1","_col2"]
>
> |
> | | Statistics:Num rows: 74973886 Data
> size: 5098224248 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | Filter Operator [FIL_4224]
>
>
> |
> | | predicate:((account_id is not null
> and month(effective_date) BETWEEN 4 AND 7) and month(effective_date) is not
> null) (type: boolean)
> |
> | | Statistics:Num rows: 74973886 Data
> size: 5098224248 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | TableScan [TS_4171]
>
>
> |
> | | alias:t
>
>
> |
> | | Statistics:Num rows: 149947772
> Data size: 10196448496 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | |<-Map 1 [CUSTOM_SIMPLE_EDGE] vectorized, llap
>
>
> |
> | Reduce Output Operator [RS_4223]
>
>
> |
> | key expressions:_col0 (type: bigint),
> year(_col2) (type: int), month(_col2) (type: int)
>
> |
> | Map-reduce partition columns:_col0 (type:
> bigint), year(_col2) (type: int), month(_col2) (type: int)
>
> |
> | sort order:+++
>
>
> |
> | Statistics:Num rows: 50289673 Data size:
> 8197216699 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | value expressions:_col1 (type: string)
>
>
> |
> | Map Join Operator [MAPJOIN_4222]
>
>
> |
> | | condition map:[{"":"Left Semi Join 0 to
> 1"}]
>
> |
> | | keys:{"Map 1":"_col1 (type:
> string)","Map 4":"_col0 (type: string)"}
>
> |
> | |
> outputColumnNames:["_col0","_col1","_col2"]
>
> |
> | | Statistics:Num rows: 50289673 Data
> size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | |<-Map 4 [BROADCAST_EDGE] vectorized, llap
>
>
> |
> | | Reduce Output Operator [RS_4179]
>
>
> |
> | | key expressions:_col0 (type: string)
>
>
> |
> | | Map-reduce partition columns:_col0
> (type: string)
>
> |
> | | sort order:+
>
>
> |
> | | Statistics:Num rows: 1 Data size: 99
> Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | Group By Operator [OP_4219]
>
>
> |
> | | keys:_col0 (type: string)
>
>
> |
> | | outputColumnNames:["_col0"]
>
>
> |
> | | Statistics:Num rows: 1 Data size:
> 99 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | Select Operator [OP_4218]
>
>
> |
> | | outputColumnNames:["_col0"]
>
>
> |
> | | Statistics:Num rows: 3 Data
> size: 297 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | Filter Operator [FIL_4217]
>
>
> |
> | | predicate:(account_type =
> 'order ahead') (type: boolean)
>
> |
> | | Statistics:Num rows: 3 Data
> size: 294 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | | TableScan [TS_4168]
>
>
> |
> | | alias:at
>
>
> |
> | | Statistics:Num rows: 13
> Data size: 1274 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | |<-Select Operator [OP_4221]
>
>
> |
> |
> outputColumnNames:["_col0","_col1","_col2"]
>
> |
> | Statistics:Num rows: 50289673 Data
> size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | Filter Operator [FIL_4220]
>
>
> |
> | predicate:(((account_id is not
> null and (account_type = 'order ahead')) and year(effective_date) is not
> null) and month(effective_date) is not null) (type: boolean)
> |
> | Statistics:Num rows: 50289673
> Data size: 8197216699 Basic stats: COMPLETE Column stats: COMPLETE
>
> |
> | TableScan [TS_4165]
>
>
> |
> | alias:a
>
>
> |
> | Statistics:Num rows: 201158695
> Data size: 32788867285 Basic stats: COMPLETE Column stats: COMPLETE
>
>
>
>
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)