[ https://issues.apache.org/jira/browse/HIVE-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ashutosh Chauhan updated HIVE-7421: ----------------------------------- Comment: was deleted (was: Here is the explain output for query 47 with SPECIAL annotation showing the VectorExpression(s): {code} STAGE DEPENDENCIES: Stage-1 is a root stage Stage-0 depends on stages: Stage-1 STAGE PLANS: Stage: Stage-1 Map Reduce Map Operator Tree: TableScan alias: staples Statistics: Num rows: 54860 Data size: 158216240 Basic stats: COMPLETE Column stats: NONE Filter Operator predicate: (((concat(to_date(order_date_), ' 00:00:00') = '1997-01-01 00:00:00') or (concat(to_date(order_date_), ' 00:00:00') = '1997-01-03 00:00:00')) and ((to_date(order_date_) = '1997-01-01') or (to_date(order_date_) = '1997-01-03'))) (type: boolean) Statistics: Num rows: 54860 Data size: 158216240 Basic stats: COMPLETE Column stats: NONE vector filter expressions: FilterExprAndExpr[-1](FilterExprOrExpr[-1](FilterStringColEqualStringScalar[-1](StringConcatColScalar[51](VectorUDFDateString[50])) FilterStringColEqualStringScalar[-1](StringConcatColScalar[51](VectorUDFDateString[50]))) FilterExprOrExpr[-1](FilterStringColEqualStringScalar[-1](VectorUDFDateString[50]) FilterStringColEqualStringScalar[-1](VectorUDFDateString[50]))) Select Operator expressions: order_priority (type: string) outputColumnNames: order_priority Statistics: Num rows: 54860 Data size: 158216240 Basic stats: COMPLETE Column stats: NONE vector select expressions: IdentityExpression[2] Group By Operator keys: order_priority (type: string) mode: hash outputColumnNames: _col0 Statistics: Num rows: 54860 Data size: 158216240 Basic stats: COMPLETE Column stats: NONE Reduce Output Operator key expressions: _col0 (type: string) sort order: + Map-reduce partition columns: _col0 (type: string) Statistics: Num rows: 54860 Data size: 158216240 Basic stats: COMPLETE Column stats: NONE Execution mode: vectorized Reduce Operator Tree: Group By Operator keys: KEY._col0 (type: string) mode: mergepartial outputColumnNames: _col0 Statistics: Num rows: 27430 Data size: 79108120 Basic stats: COMPLETE Column stats: NONE Select Operator expressions: _col0 (type: string) outputColumnNames: _col0 Statistics: Num rows: 27430 Data size: 79108120 Basic stats: COMPLETE Column stats: NONE File Output Operator compressed: false Statistics: Num rows: 27430 Data size: 79108120 Basic stats: COMPLETE Column stats: NONE table: input format: org.apache.hadoop.mapred.TextInputFormat output format: org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe Stage: Stage-0 Fetch Operator limit: -1 Processor Tree: ListSink {code}) > Make VectorUDFDateString use the same date parsing and formatting as > GenericUDFDate > ----------------------------------------------------------------------------------- > > Key: HIVE-7421 > URL: https://issues.apache.org/jira/browse/HIVE-7421 > Project: Hive > Issue Type: Bug > Components: Vectorization > Affects Versions: 0.13.0, 0.13.1 > Reporter: Matt McCline > Assignee: Matt McCline > Fix For: 0.14.0 > > Attachments: HIVE-7421.1.patch > > > One of several found by Raj Bains. > M/R or Tez. > {code} > set hive.vectorized.execution.enabled=true; > {code} > Stack trace: > {code} > Caused by: java.lang.NullPointerException > at java.lang.System.arraycopy(Native Method) > at > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector.setConcat(BytesColumnVector.java:190) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.StringConcatColScalar.evaluate(StringConcatColScalar.java:78) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.VectorUDFDateDiffColCol.evaluate(VectorUDFDateDiffColCol.java:59) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.LongScalarAddLongColumn.evaluate(LongScalarAddLongColumn.java:65) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.LongColAddLongColumn.evaluate(LongColAddLongColumn.java:52) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.LongColDivideLongScalar.evaluate(LongColDivideLongScalar.java:52) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpression.evaluateChildren(VectorExpression.java:112) > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FuncFloorDoubleToLong.evaluate(FuncFloorDoubleToLong.java:47) > at > org.apache.hadoop.hive.ql.exec.vector.VectorHashKeyWrapperBatch.evaluateBatch(VectorHashKeyWrapperBatch.java:147) > at > org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator$ProcessingModeHashAggregate.processBatch(VectorGroupByOperator.java:289) > at > org.apache.hadoop.hive.ql.exec.vector.VectorGroupByOperator.processOp(VectorGroupByOperator.java:711) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:800) > at > org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.processOp(VectorSelectOperator.java:139) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:800) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.processOp(TableScanOperator.java:95) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:800) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:43) > {code} -- This message was sent by Atlassian JIRA (v6.2#6252)