[
https://issues.apache.org/jira/browse/HIVE-10609?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Mostafa Mokhtar updated HIVE-10609:
-----------------------------------
Component/s: Vectorization
> Vectorization : Q64 fails with ClassCastException
> -------------------------------------------------
>
> Key: HIVE-10609
> URL: https://issues.apache.org/jira/browse/HIVE-10609
> Project: Hive
> Issue Type: Bug
> Components: Vectorization
> Affects Versions: 1.2.0
> Reporter: Mostafa Mokhtar
> Assignee: Matt McCline
> Fix For: 1.2.0
>
>
> TPC-DS Q64 fails with ClassCastException.
> Query
> {code}
> select cs1.product_name ,cs1.store_name ,cs1.store_zip ,cs1.b_street_number
> ,cs1.b_streen_name ,cs1.b_city
> ,cs1.b_zip ,cs1.c_street_number ,cs1.c_street_name ,cs1.c_city
> ,cs1.c_zip ,cs1.syear ,cs1.cnt
> ,cs1.s1 ,cs1.s2 ,cs1.s3
> ,cs2.s1 ,cs2.s2 ,cs2.s3 ,cs2.syear ,cs2.cnt
> from
> (select i_product_name as product_name ,i_item_sk as item_sk ,s_store_name as
> store_name
> ,s_zip as store_zip ,ad1.ca_street_number as b_street_number
> ,ad1.ca_street_name as b_streen_name
> ,ad1.ca_city as b_city ,ad1.ca_zip as b_zip ,ad2.ca_street_number as
> c_street_number
> ,ad2.ca_street_name as c_street_name ,ad2.ca_city as c_city ,ad2.ca_zip
> as c_zip
> ,d1.d_year as syear ,d2.d_year as fsyear ,d3.d_year as s2year ,count(*)
> as cnt
> ,sum(ss_wholesale_cost) as s1 ,sum(ss_list_price) as s2
> ,sum(ss_coupon_amt) as s3
> FROM store_sales
> JOIN store_returns ON store_sales.ss_item_sk =
> store_returns.sr_item_sk and store_sales.ss_ticket_number =
> store_returns.sr_ticket_number
> JOIN customer ON store_sales.ss_customer_sk = customer.c_customer_sk
> JOIN date_dim d1 ON store_sales.ss_sold_date_sk = d1.d_date_sk
> JOIN date_dim d2 ON customer.c_first_sales_date_sk = d2.d_date_sk
> JOIN date_dim d3 ON customer.c_first_shipto_date_sk = d3.d_date_sk
> JOIN store ON store_sales.ss_store_sk = store.s_store_sk
> JOIN customer_demographics cd1 ON store_sales.ss_cdemo_sk=
> cd1.cd_demo_sk
> JOIN customer_demographics cd2 ON customer.c_current_cdemo_sk =
> cd2.cd_demo_sk
> JOIN promotion ON store_sales.ss_promo_sk = promotion.p_promo_sk
> JOIN household_demographics hd1 ON store_sales.ss_hdemo_sk =
> hd1.hd_demo_sk
> JOIN household_demographics hd2 ON customer.c_current_hdemo_sk =
> hd2.hd_demo_sk
> JOIN customer_address ad1 ON store_sales.ss_addr_sk =
> ad1.ca_address_sk
> JOIN customer_address ad2 ON customer.c_current_addr_sk =
> ad2.ca_address_sk
> JOIN income_band ib1 ON hd1.hd_income_band_sk = ib1.ib_income_band_sk
> JOIN income_band ib2 ON hd2.hd_income_band_sk = ib2.ib_income_band_sk
> JOIN item ON store_sales.ss_item_sk = item.i_item_sk
> JOIN
> (select cs_item_sk
> ,sum(cs_ext_list_price) as
> sale,sum(cr_refunded_cash+cr_reversed_charge+cr_store_credit) as refund
> from catalog_sales JOIN catalog_returns
> ON catalog_sales.cs_item_sk = catalog_returns.cr_item_sk
> and catalog_sales.cs_order_number = catalog_returns.cr_order_number
> group by cs_item_sk
> having
> sum(cs_ext_list_price)>2*sum(cr_refunded_cash+cr_reversed_charge+cr_store_credit))
> cs_ui
> ON store_sales.ss_item_sk = cs_ui.cs_item_sk
> WHERE
> cd1.cd_marital_status <> cd2.cd_marital_status and
> i_color in ('maroon','burnished','dim','steel','navajo','chocolate')
> and
> i_current_price between 35 and 35 + 10 and
> i_current_price between 35 + 1 and 35 + 15
> group by i_product_name ,i_item_sk ,s_store_name ,s_zip ,ad1.ca_street_number
> ,ad1.ca_street_name ,ad1.ca_city ,ad1.ca_zip ,ad2.ca_street_number
> ,ad2.ca_street_name ,ad2.ca_city ,ad2.ca_zip ,d1.d_year ,d2.d_year
> ,d3.d_year
> ) cs1
> JOIN
> (select i_product_name as product_name ,i_item_sk as item_sk ,s_store_name as
> store_name
> ,s_zip as store_zip ,ad1.ca_street_number as b_street_number
> ,ad1.ca_street_name as b_streen_name
> ,ad1.ca_city as b_city ,ad1.ca_zip as b_zip ,ad2.ca_street_number as
> c_street_number
> ,ad2.ca_street_name as c_street_name ,ad2.ca_city as c_city ,ad2.ca_zip
> as c_zip
> ,d1.d_year as syear ,d2.d_year as fsyear ,d3.d_year as s2year ,count(*)
> as cnt
> ,sum(ss_wholesale_cost) as s1 ,sum(ss_list_price) as s2
> ,sum(ss_coupon_amt) as s3
> FROM store_sales
> JOIN store_returns ON store_sales.ss_item_sk =
> store_returns.sr_item_sk and store_sales.ss_ticket_number =
> store_returns.sr_ticket_number
> JOIN customer ON store_sales.ss_customer_sk = customer.c_customer_sk
> JOIN date_dim d1 ON store_sales.ss_sold_date_sk = d1.d_date_sk
> JOIN date_dim d2 ON customer.c_first_sales_date_sk = d2.d_date_sk
> JOIN date_dim d3 ON customer.c_first_shipto_date_sk = d3.d_date_sk
> JOIN store ON store_sales.ss_store_sk = store.s_store_sk
> JOIN customer_demographics cd1 ON store_sales.ss_cdemo_sk=
> cd1.cd_demo_sk
> JOIN customer_demographics cd2 ON customer.c_current_cdemo_sk =
> cd2.cd_demo_sk
> JOIN promotion ON store_sales.ss_promo_sk = promotion.p_promo_sk
> JOIN household_demographics hd1 ON store_sales.ss_hdemo_sk =
> hd1.hd_demo_sk
> JOIN household_demographics hd2 ON customer.c_current_hdemo_sk =
> hd2.hd_demo_sk
> JOIN customer_address ad1 ON store_sales.ss_addr_sk =
> ad1.ca_address_sk
> JOIN customer_address ad2 ON customer.c_current_addr_sk =
> ad2.ca_address_sk
> JOIN income_band ib1 ON hd1.hd_income_band_sk = ib1.ib_income_band_sk
> JOIN income_band ib2 ON hd2.hd_income_band_sk = ib2.ib_income_band_sk
> JOIN item ON store_sales.ss_item_sk = item.i_item_sk
> JOIN
> (select cs_item_sk
> ,sum(cs_ext_list_price) as
> sale,sum(cr_refunded_cash+cr_reversed_charge+cr_store_credit) as refund
> from catalog_sales JOIN catalog_returns
> ON catalog_sales.cs_item_sk = catalog_returns.cr_item_sk
> and catalog_sales.cs_order_number = catalog_returns.cr_order_number
> group by cs_item_sk
> having
> sum(cs_ext_list_price)>2*sum(cr_refunded_cash+cr_reversed_charge+cr_store_credit))
> cs_ui
> ON store_sales.ss_item_sk = cs_ui.cs_item_sk
> WHERE
> cd1.cd_marital_status <> cd2.cd_marital_status and
> i_color in ('maroon','burnished','dim','steel','navajo','chocolate')
> and
> i_current_price between 35 and 35 + 10 and
> i_current_price between 35 + 1 and 35 + 15
> group by i_product_name ,i_item_sk ,s_store_name ,s_zip ,ad1.ca_street_number
> ,ad1.ca_street_name ,ad1.ca_city ,ad1.ca_zip ,ad2.ca_street_number
> ,ad2.ca_street_name ,ad2.ca_city ,ad2.ca_zip ,d1.d_year ,d2.d_year
> ,d3.d_year
> ) cs2
> ON cs1.item_sk=cs2.item_sk
> where
> cs1.syear = 2000 and
> cs2.syear = 2000 + 1 and
> cs2.cnt <= cs1.cnt and
> cs1.store_name = cs2.store_name and
> cs1.store_zip = cs2.store_zip
> order by cs1.product_name ,cs1.store_name ,cs2.cnt
> {code}
> Exception
> {code}
> ], TaskAttempt 3 failed, info=[Error: Failure while running
> task:java.lang.RuntimeException: java.lang.RuntimeException:
> org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while
> processing row
> at
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:171)
> at
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:137)
> at
> org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:337)
> at
> org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:179)
> at
> org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run(TezTaskRunner.java:171)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
> at
> org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callInternal(TezTaskRunner.java:171)
> at
> org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.callInternal(TezTaskRunner.java:167)
> at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
> at java.util.concurrent.FutureTask.run(FutureTask.java:262)
> at
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
> at
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
> at java.lang.Thread.run(Thread.java:745)
> Caused by: java.lang.RuntimeException:
> org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while
> processing row
> at
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:91)
> at
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:68)
> at
> org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:290)
> at
> org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:148)
> ... 14 more
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime
> Error while processing row
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:52)
> at
> org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:83)
> ... 17 more
> Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Unexpected
> exception: org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector cannot be
> cast to org.apache.hadoop.hive.ql.exec.vector.DoubleColumnVector
> at
> org.apache.hadoop.hive.ql.exec.MapJoinOperator.process(MapJoinOperator.java:426)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.process(VectorMapJoinOperator.java:290)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:837)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorSelectOperator.process(VectorSelectOperator.java:138)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:837)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:114)
> at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:837)
> at
> org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:97)
> at
> org.apache.hadoop.hive.ql.exec.MapOperator$MapOpCtx.forward(MapOperator.java:162)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:45)
> ... 18 more
> Caused by: java.lang.ClassCastException:
> org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector cannot be cast to
> org.apache.hadoop.hive.ql.exec.vector.DoubleColumnVector
> at
> org.apache.hadoop.hive.ql.exec.vector.expressions.VectorExpressionWriterFactory$VectorExpressionWriterDouble.writeValue(VectorExpressionWriterFactory.java:225)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.getRowObject(VectorMapJoinOperator.java:352)
> at
> org.apache.hadoop.hive.ql.exec.vector.VectorMapJoinOperator.spillBigTableRow(VectorMapJoinOperator.java:309)
> at
> org.apache.hadoop.hive.ql.exec.MapJoinOperator.process(MapJoinOperator.java:404)
> ... 27 more
> ]], Vertex failed as one or more tasks failed. failedTasks:1, Vertex
> vertex_1426707664723_4575_1_34 [Map 40] killed/failed due to:null]Vertex
> killed, vertexName=Reducer 6, vertexId=vertex_1426707664723_4575_1_42,
> diagnostics=[Vertex received Kill while in RUNNING state., Vertex killed as
> other vertex failed. failedTasks:0, Vertex vertex_1426707664723_4575_1_42
> [Reducer 6] killed/failed due to:null]Vertex killed, vertexName=Reducer 27,
> vertexId=vertex_1426707664723_4575_1_36, diagnostics=[Vertex received Kill
> while in RUNNING state., Vertex killed as other vertex failed. failedTasks:0,
> Vertex vertex_1426707664723_4575_1_36 [Reducer 27] killed/failed due
> to:null]Vertex killed, vertexName=Reducer 5,
> vertexId=vertex_1426707664723_4575_1_41, diagnostics=[Vertex received Kill
> while in RUNNING state., Vertex killed as other vertex failed. failedTasks:0,
> Vertex vertex_1426707664723_4575_1_41 [Reducer 5] killed/failed due
> to:null]Vertex killed, vertexName=Map 26,
> vertexId=vertex_1426707664723_4575_1_35, diagnostics=[Vertex received Kill
> while in RUNNING state., Vertex killed as other vertex failed. failedTasks:0,
> Vertex vertex_1426707664723_4575_1_35 [Map 26] killed/failed due
> to:null]Vertex killed, vertexName=Map 19,
> vertexId=vertex_1426707664723_4575_1_39, diagnostics=[Vertex received Kill
> while in RUNNING state., Vertex killed as other vertex failed. failedTasks:0,
> Vertex vertex_1426707664723_4575_1_39 [Map 19] killed/failed due
> to:null]Vertex killed, vertexName=Map 4,
> vertexId=vertex_1426707664723_4575_1_40, diagnostics=[Vertex received Kill
> while in RUNNING state., Vertex killed as other vertex failed. failedTasks:0,
> Vertex vertex_1426707664723_4575_1_40 [Map 4] killed/failed due to:null]DAG
> failed due to vertex failure. failedVertices:1 killedVertices:6
> {code}
> Plan
> {code}
> STAGE PLANS:
> Stage: Stage-1
> Tez
> Edges:
> Map 1 <- Map 2 (BROADCAST_EDGE)
> Map 10 <- Map 11 (BROADCAST_EDGE), Map 12 (BROADCAST_EDGE), Map 13
> (BROADCAST_EDGE), Map 18 (BROADCAST_EDGE)
> Map 14 <- Map 15 (BROADCAST_EDGE)
> Map 15 <- Map 16 (BROADCAST_EDGE), Map 17 (BROADCAST_EDGE)
> Map 18 <- Map 14 (BROADCAST_EDGE)
> Map 19 <- Map 10 (BROADCAST_EDGE), Map 7 (BROADCAST_EDGE), Map 9
> (BROADCAST_EDGE), Reducer 21 (BROADCAST_EDGE)
> Map 20 <- Map 22 (BROADCAST_EDGE)
> Map 23 <- Map 24 (BROADCAST_EDGE)
> Map 26 <- Map 23 (BROADCAST_EDGE), Map 25 (BROADCAST_EDGE), Map 40
> (BROADCAST_EDGE)
> Map 28 <- Map 29 (BROADCAST_EDGE)
> Map 31 <- Map 32 (BROADCAST_EDGE), Map 33 (BROADCAST_EDGE), Map 34
> (BROADCAST_EDGE), Map 39 (BROADCAST_EDGE)
> Map 35 <- Map 36 (BROADCAST_EDGE)
> Map 36 <- Map 37 (BROADCAST_EDGE), Map 38 (BROADCAST_EDGE)
> Map 39 <- Map 35 (BROADCAST_EDGE)
> Map 4 <- Map 1 (BROADCAST_EDGE), Map 19 (BROADCAST_EDGE), Map 3
> (BROADCAST_EDGE)
> Map 40 <- Map 28 (BROADCAST_EDGE), Map 30 (BROADCAST_EDGE), Map 31
> (BROADCAST_EDGE), Reducer 42 (BROADCAST_EDGE)
> Map 41 <- Map 43 (BROADCAST_EDGE)
> Map 7 <- Map 8 (BROADCAST_EDGE)
> Reducer 21 <- Map 20 (SIMPLE_EDGE)
> Reducer 27 <- Map 26 (SIMPLE_EDGE)
> Reducer 42 <- Map 41 (SIMPLE_EDGE)
> Reducer 5 <- Map 4 (SIMPLE_EDGE), Reducer 27 (BROADCAST_EDGE)
> Reducer 6 <- Reducer 5 (SIMPLE_EDGE)
> DagName: jenkins_20150503192312_dad6086a-e3ca-4519-aab9-d1179786ba1d:1
> Vertices:
> Map 1
> Map Operator Tree:
> TableScan
> alias: hd1
> filterExpr: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 770400 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 57600 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: hd_demo_sk (type: int), hd_income_band_sk
> (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 7200 Data size: 57600 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col0
> input vertices:
> 1 Map 2
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 10
> Map Operator Tree:
> TableScan
> alias: cd1
> filterExpr: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 718379200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 170951200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cd_demo_sk (type: int), cd_marital_status
> (type: string)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 1920800 Data size: 170951200
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col1 (type: int)
> outputColumnNames: _col1, _col2, _col4, _col5, _col6,
> _col7
> input vertices:
> 1 Map 11
> Statistics: Num rows: 1387730 Data size: 145711650
> Basic stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col7 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col2, _col4, _col5,
> _col6, _col9
> input vertices:
> 1 Map 12
> Statistics: Num rows: 1387730 Data size: 145711650
> Basic stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col6 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col2, _col4, _col5,
> _col9, _col11
> input vertices:
> 1 Map 13
> Statistics: Num rows: 1387730 Data size:
> 145711650 Basic stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col2 (type: int)
> 1 _col4 (type: int)
> outputColumnNames: _col1, _col4, _col5, _col9,
> _col11, _col13, _col15, _col18, _col19, _col21, _col23, _col24, _col25,
> _col29, _col31, _col32
> input vertices:
> 1 Map 18
> Statistics: Num rows: 68 Data size: 26860 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Filter Operator
> predicate: (_col13 <> _col1) (type: boolean)
> Statistics: Num rows: 68 Data size: 26860
> Basic stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: _col15 (type: int), _col24
> (type: float), _col25 (type: float), _col4 (type: int), _col5 (type: int),
> _col29 (type: int), _col9 (type: int), _col11 (type: int), _col31 (type:
> string), _col32 (type: string), _col18 (type: int), _col19 (type: int),
> _col21 (type: int), _col23 (type: float)
> outputColumnNames: _col1, _col10, _col11,
> _col16, _col17, _col21, _col23, _col25, _col27, _col28, _col4, _col5, _col7,
> _col9
> Statistics: Num rows: 68 Data size: 15300
> Basic stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col1 (type: int)
> sort order: +
> Map-reduce partition columns: _col1
> (type: int)
> Statistics: Num rows: 68 Data size: 15300
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col4 (type: int),
> _col5 (type: int), _col7 (type: int), _col9 (type: float), _col10 (type:
> float), _col11 (type: float), _col16 (type: int), _col17 (type: int), _col21
> (type: int), _col23 (type: int), _col25 (type: int), _col27 (type: string),
> _col28 (type: string)
> Execution mode: vectorized
> Map 11
> Map Operator Tree:
> TableScan
> alias: customer
> filterExpr: (((((c_current_cdemo_sk is not null and
> c_first_sales_date_sk is not null) and c_first_shipto_date_sk is not null)
> and c_customer_sk is not null) and c_current_hdemo_sk is not null) and
> c_current_addr_sk is not null) (type: boolean)
> Statistics: Num rows: 1600000 Data size: 1241633212 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (((((c_current_cdemo_sk is not null and
> c_first_sales_date_sk is not null) and c_first_shipto_date_sk is not null)
> and c_customer_sk is not null) and c_current_hdemo_sk is not null) and
> c_current_addr_sk is not null) (type: boolean)
> Statistics: Num rows: 1387730 Data size: 32529324 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: c_customer_sk (type: int),
> c_current_cdemo_sk (type: int), c_current_hdemo_sk (type: int),
> c_current_addr_sk (type: int), c_first_shipto_date_sk (type: int),
> c_first_sales_date_sk (type: int)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5
> Statistics: Num rows: 1387730 Data size: 32529324 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col1 (type: int)
> sort order: +
> Map-reduce partition columns: _col1 (type: int)
> Statistics: Num rows: 1387730 Data size: 32529324
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col0 (type: int), _col2 (type:
> int), _col3 (type: int), _col4 (type: int), _col5 (type: int)
> Execution mode: vectorized
> Map 12
> Map Operator Tree:
> TableScan
> alias: d1
> filterExpr: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 81741831 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: d_date_sk (type: int), d_year (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: int)
> Execution mode: vectorized
> Map 13
> Map Operator Tree:
> TableScan
> alias: d1
> filterExpr: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 81741831 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: d_date_sk (type: int), d_year (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: int)
> Execution mode: vectorized
> Map 14
> Map Operator Tree:
> TableScan
> alias: cd1
> filterExpr: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 718379200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 170951200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cd_demo_sk (type: int), cd_marital_status
> (type: string)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 1920800 Data size: 170951200
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col3 (type: int)
> outputColumnNames: _col1, _col3, _col4, _col6, _col7,
> _col8, _col9, _col11, _col12, _col13, _col17
> input vertices:
> 1 Map 15
> Statistics: Num rows: 78 Data size: 9750 Basic stats:
> COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Reduce Output Operator
> key expressions: _col8 (type: int)
> sort order: +
> Map-reduce partition columns: _col8 (type: int)
> Statistics: Num rows: 78 Data size: 9750 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: string), _col3
> (type: int), _col4 (type: int), _col6 (type: int), _col7 (type: int), _col9
> (type: int), _col11 (type: float), _col12 (type: float), _col13 (type:
> float), _col17 (type: int)
> Execution mode: vectorized
> Map 15
> Map Operator Tree:
> TableScan
> alias: store_sales
> filterExpr: ((((((((ss_item_sk is not null and
> ss_ticket_number is not null) and ss_sold_date_sk is not null) and
> ss_cdemo_sk is not null) and ss_store_sk is not null) and ss_customer_sk is
> not null) and ss_promo_sk is not null) and ss_addr_sk is not null) and
> ss_hdemo_sk is not null) (type: boolean)
> Statistics: Num rows: 550076554 Data size: 101277521188
> Basic stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((((((((ss_item_sk is not null and
> ss_ticket_number is not null) and ss_sold_date_sk is not null) and
> ss_cdemo_sk is not null) and ss_store_sk is not null) and ss_customer_sk is
> not null) and ss_promo_sk is not null) and ss_addr_sk is not null) and
> ss_hdemo_sk is not null) (type: boolean)
> Statistics: Num rows: 476766968 Data size: 22480476572
> Basic stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ss_sold_date_sk (type: int), ss_item_sk
> (type: int), ss_customer_sk (type: int), ss_cdemo_sk (type: int), ss_hdemo_sk
> (type: int), ss_addr_sk (type: int), ss_store_sk (type: int), ss_promo_sk
> (type: int), ss_ticket_number (type: int), ss_wholesale_cost (type: float),
> ss_list_price (type: float), ss_coupon_amt (type: float)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col8, _col9, _col10, _col11
> Statistics: Num rows: 476766968 Data size: 22480476572
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int), _col8 (type: int)
> 1 _col0 (type: int), _col1 (type: int)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col9, _col10, _col11
> input vertices:
> 1 Map 16
> Statistics: Num rows: 8686 Data size: 382184 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col9, _col10, _col11
> input vertices:
> 1 Map 17
> Statistics: Num rows: 78 Data size: 3120 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: int), _col10 (type:
> float), _col11 (type: float), 2000 (type: int), _col2 (type: int), _col3
> (type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col7
> (type: int), _col9 (type: float)
> outputColumnNames: _col1, _col10, _col11, _col15,
> _col2, _col3, _col4, _col5, _col6, _col7, _col9
> Statistics: Num rows: 78 Data size: 3432 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col3 (type: int)
> sort order: +
> Map-reduce partition columns: _col3 (type: int)
> Statistics: Num rows: 78 Data size: 3432 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: int), _col2
> (type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col7
> (type: int), _col9 (type: float), _col10 (type: float), _col11 (type: float),
> _col15 (type: int)
> Execution mode: vectorized
> Map 16
> Map Operator Tree:
> TableScan
> alias: store_returns
> filterExpr: (sr_item_sk is not null and sr_ticket_number is
> not null) (type: boolean)
> Statistics: Num rows: 55578005 Data size: 9601960106 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (sr_item_sk is not null and sr_ticket_number
> is not null) (type: boolean)
> Statistics: Num rows: 55578005 Data size: 444624040 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: sr_item_sk (type: int), sr_ticket_number
> (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 55578005 Data size: 444624040
> Basic stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int), _col1 (type: int)
> sort order: ++
> Map-reduce partition columns: _col0 (type: int),
> _col1 (type: int)
> Statistics: Num rows: 55578005 Data size: 444624040
> Basic stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 17
> Map Operator Tree:
> TableScan
> alias: d1
> filterExpr: ((d_year = 2000) and d_date_sk is not null)
> (type: boolean)
> Statistics: Num rows: 73049 Data size: 81741831 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((d_year = 2000) and d_date_sk is not null)
> (type: boolean)
> Statistics: Num rows: 652 Data size: 5216 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: d_date_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 652 Data size: 2608 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 652 Data size: 2608 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 18
> Map Operator Tree:
> TableScan
> alias: store
> filterExpr: ((s_store_sk is not null and s_store_name is
> not null) and s_zip is not null) (type: boolean)
> Statistics: Num rows: 212 Data size: 405680 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((s_store_sk is not null and s_store_name is
> not null) and s_zip is not null) (type: boolean)
> Statistics: Num rows: 212 Data size: 38372 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: s_store_sk (type: int), s_store_name
> (type: string), s_zip (type: string)
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 212 Data size: 38372 Basic stats:
> COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col8 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col3, _col4, _col6, _col7,
> _col9, _col11, _col12, _col13, _col17, _col19, _col20
> input vertices:
> 0 Map 14
> Statistics: Num rows: 78 Data size: 23244 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: string), _col11 (type:
> float), _col12 (type: float), _col13 (type: float), _col17 (type: int),
> _col19 (type: string), _col20 (type: string), _col3 (type: int), _col4 (type:
> int), _col6 (type: int), _col7 (type: int), _col9 (type: int)
> outputColumnNames: _col1, _col11, _col12, _col13,
> _col17, _col19, _col20, _col3, _col4, _col6, _col7, _col9
> Statistics: Num rows: 78 Data size: 23244 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col4 (type: int)
> sort order: +
> Map-reduce partition columns: _col4 (type: int)
> Statistics: Num rows: 78 Data size: 23244 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: string), _col3
> (type: int), _col6 (type: int), _col7 (type: int), _col9 (type: int), _col11
> (type: float), _col12 (type: float), _col13 (type: float), _col17 (type:
> int), _col19 (type: string), _col20 (type: string)
> Execution mode: vectorized
> Map 19
> Map Operator Tree:
> TableScan
> alias: item
> filterExpr: ((((i_color) IN ('maroon', 'burnished', 'dim',
> 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45) and
> i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: boolean)
> Statistics: Num rows: 48000 Data size: 68732712 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((((i_color) IN ('maroon', 'burnished', 'dim',
> 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45) and
> i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: boolean)
> Statistics: Num rows: 6000 Data size: 1199936 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: i_item_sk (type: int), i_product_name
> (type: string)
> outputColumnNames: _col0, _col3
> Statistics: Num rows: 6000 Data size: 642000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> Inner Join 0 to 2
> keys:
> 0 _col1 (type: int)
> 1 _col0 (type: int)
> 2 _col0 (type: int)
> outputColumnNames: _col4, _col5, _col7, _col9,
> _col10, _col11, _col16, _col17, _col21, _col23, _col25, _col27, _col28,
> _col33, _col36
> input vertices:
> 0 Map 10
> 2 Reducer 21
> Statistics: Num rows: 13200 Data size: 1412400 Basic
> stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col10 (type: float), _col11 (type:
> float), _col16 (type: int), _col17 (type: int), _col21 (type: int), _col23
> (type: int), _col25 (type: int), _col27 (type: string), _col28 (type:
> string), _col33 (type: int), _col36 (type: string), _col4 (type: int), _col5
> (type: int), _col7 (type: int), _col9 (type: float)
> outputColumnNames: _col10, _col11, _col16, _col17,
> _col21, _col23, _col25, _col27, _col28, _col33, _col36, _col4, _col5, _col7,
> _col9
> Statistics: Num rows: 13200 Data size: 1412400
> Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col7 (type: int)
> outputColumnNames: _col5, _col6, _col10, _col11,
> _col12, _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34, _col37
> input vertices:
> 0 Map 9
> Statistics: Num rows: 14520 Data size: 1553640
> Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col10 (type: float), _col11
> (type: float), _col12 (type: float), _col17 (type: int), _col18 (type: int),
> _col22 (type: int), _col24 (type: int), _col26 (type: int), _col28 (type:
> string), _col29 (type: string), _col34 (type: int), _col37 (type: string),
> _col5 (type: int), _col6 (type: int)
> outputColumnNames: _col10, _col11, _col12,
> _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34, _col37,
> _col5, _col6
> Statistics: Num rows: 14520 Data size: 1553640
> Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col17 (type: int)
> outputColumnNames: _col8, _col9, _col13,
> _col14, _col15, _col21, _col25, _col27, _col29, _col31, _col32, _col37, _col40
> input vertices:
> 0 Map 7
> Statistics: Num rows: 15972 Data size:
> 1709004 Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col13 (type: float), _col14
> (type: float), _col15 (type: float), _col21 (type: int), _col25 (type: int),
> _col27 (type: int), _col29 (type: int), _col31 (type: string), _col32 (type:
> string), _col37 (type: int), _col40 (type: string), _col8 (type: int), _col9
> (type: int)
> outputColumnNames: _col13, _col14, _col15,
> _col21, _col25, _col27, _col29, _col31, _col32, _col37, _col40, _col8, _col9
> Statistics: Num rows: 15972 Data size:
> 1709004 Basic stats: COMPLETE Column stats: NONE
> Reduce Output Operator
> key expressions: _col9 (type: int)
> sort order: +
> Map-reduce partition columns: _col9
> (type: int)
> Statistics: Num rows: 15972 Data size:
> 1709004 Basic stats: COMPLETE Column stats: NONE
> value expressions: _col8 (type: int),
> _col13 (type: float), _col14 (type: float), _col15 (type: float), _col21
> (type: int), _col25 (type: int), _col27 (type: int), _col29 (type: int),
> _col31 (type: string), _col32 (type: string), _col37 (type: int), _col40
> (type: string)
> Execution mode: vectorized
> Map 2
> Map Operator Tree:
> TableScan
> alias: ib1
> filterExpr: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 240 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ib_income_band_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 20
> Map Operator Tree:
> TableScan
> alias: catalog_sales
> filterExpr: (cs_item_sk is not null and cs_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 286549727 Data size: 65825832570
> Basic stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (cs_item_sk is not null and cs_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 286549727 Data size: 3435718732
> Basic stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cs_item_sk (type: int), cs_order_number
> (type: int), cs_ext_list_price (type: float)
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 286549727 Data size: 3435718732
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int), _col1 (type: int)
> 1 _col0 (type: int), _col1 (type: int)
> outputColumnNames: _col0, _col2, _col5, _col6, _col7
> input vertices:
> 1 Map 22
> Statistics: Num rows: 2911 Data size: 58220 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col0 (type: int), _col2 (type:
> float), ((_col5 + _col6) + _col7) (type: float)
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 2911 Data size: 58220 Basic
> stats: COMPLETE Column stats: COMPLETE
> Group By Operator
> aggregations: sum(_col1), sum(_col2)
> keys: _col0 (type: int)
> mode: hash
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 1 Data size: 20 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 1 Data size: 20 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: double), _col2
> (type: double)
> Execution mode: vectorized
> Map 22
> Map Operator Tree:
> TableScan
> alias: catalog_returns
> filterExpr: (cr_item_sk is not null and cr_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 28798881 Data size: 5764329494 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (cr_item_sk is not null and cr_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 28798881 Data size: 569059536 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cr_item_sk (type: int), cr_order_number
> (type: int), cr_refunded_cash (type: float), cr_reversed_charge (type:
> float), cr_store_credit (type: float)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4
> Statistics: Num rows: 28798881 Data size: 569059536
> Basic stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int), _col1 (type: int)
> sort order: ++
> Map-reduce partition columns: _col0 (type: int),
> _col1 (type: int)
> Statistics: Num rows: 28798881 Data size: 569059536
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col2 (type: float), _col3 (type:
> float), _col4 (type: float)
> Execution mode: vectorized
> Map 23
> Map Operator Tree:
> TableScan
> alias: hd1
> filterExpr: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 770400 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 57600 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: hd_demo_sk (type: int), hd_income_band_sk
> (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 7200 Data size: 57600 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col0
> input vertices:
> 1 Map 24
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 24
> Map Operator Tree:
> TableScan
> alias: ib1
> filterExpr: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 240 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ib_income_band_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 25
> Map Operator Tree:
> TableScan
> alias: ad1
> filterExpr: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 811903688 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ca_address_sk (type: int),
> ca_street_number (type: string), ca_street_name (type: string), ca_city
> (type: string), ca_zip (type: string)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 800000 Data size: 292000000
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: string), _col2 (type:
> string), _col3 (type: string), _col4 (type: string)
> Execution mode: vectorized
> Map 26
> Map Operator Tree:
> TableScan
> alias: ad1
> filterExpr: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 811903688 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ca_address_sk (type: int),
> ca_street_number (type: string), ca_street_name (type: string), ca_city
> (type: string), ca_zip (type: string)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col9 (type: int)
> outputColumnNames: _col1, _col2, _col3, _col4,
> _col13, _col18, _col19, _col20, _col26, _col30, _col32, _col34, _col36,
> _col37, _col42, _col45
> input vertices:
> 1 Map 40
> Statistics: Num rows: 880000 Data size: 321200006
> Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: string), _col13 (type:
> int), _col18 (type: float), _col19 (type: float), _col2 (type: string),
> _col20 (type: float), _col26 (type: int), _col3 (type: string), _col30 (type:
> int), _col32 (type: int), _col34 (type: int), _col36 (type: string), _col37
> (type: string), _col4 (type: string), _col42 (type: int), _col45 (type:
> string)
> outputColumnNames: _col1, _col13, _col18, _col19,
> _col2, _col20, _col26, _col3, _col30, _col32, _col34, _col36, _col37, _col4,
> _col42, _col45
> Statistics: Num rows: 880000 Data size: 321200006
> Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col26 (type: int)
> outputColumnNames: _col1, _col2, _col3, _col4,
> _col6, _col7, _col8, _col9, _col18, _col23, _col24, _col25, _col35, _col37,
> _col39, _col41, _col42, _col47, _col50
> input vertices:
> 0 Map 25
> Statistics: Num rows: 968000 Data size: 353320014
> Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: string), _col18
> (type: int), _col2 (type: string), _col23 (type: float), _col24 (type:
> float), _col25 (type: float), _col3 (type: string), _col35 (type: int),
> _col37 (type: int), _col39 (type: int), _col4 (type: string), _col41 (type:
> string), _col42 (type: string), _col47 (type: int), _col50 (type: string),
> _col6 (type: string), _col7 (type: string), _col8 (type: string), _col9
> (type: string)
> outputColumnNames: _col1, _col18, _col2,
> _col23, _col24, _col25, _col3, _col35, _col37, _col39, _col4, _col41, _col42,
> _col47, _col50, _col6, _col7, _col8, _col9
> Statistics: Num rows: 968000 Data size:
> 353320014 Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col18 (type: int)
> outputColumnNames: _col4, _col5, _col6,
> _col7, _col9, _col10, _col11, _col12, _col26, _col27, _col28, _col38, _col40,
> _col42, _col44, _col45, _col50, _col53
> input vertices:
> 0 Map 23
> Statistics: Num rows: 1064800 Data size:
> 388652023 Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col53 (type: string), _col50
> (type: int), _col44 (type: string), _col45 (type: string), _col9 (type:
> string), _col10 (type: string), _col11 (type: string), _col12 (type: string),
> _col4 (type: string), _col5 (type: string), _col6 (type: string), _col7
> (type: string), _col38 (type: int), _col40 (type: int), _col42 (type: int),
> _col26 (type: float), _col27 (type: float), _col28 (type: float)
> outputColumnNames: _col0, _col1, _col2,
> _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12,
> _col13, _col14, _col15, _col16, _col17
> Statistics: Num rows: 1064800 Data size:
> 388652023 Basic stats: COMPLETE Column stats: NONE
> Group By Operator
> aggregations: count(), sum(_col15),
> sum(_col16), sum(_col17)
> keys: _col0 (type: string), _col1 (type:
> int), _col2 (type: string), _col3 (type: string), _col4 (type: string), _col5
> (type: string), _col6 (type: string), _col7 (type: string), _col8 (type:
> string), _col9 (type: string), _col10 (type: string), _col11 (type: string),
> _col12 (type: int), _col13 (type: int), _col14 (type: int)
> mode: hash
> outputColumnNames: _col0, _col1, _col2,
> _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12,
> _col13, _col14, _col15, _col16, _col17, _col18
> Statistics: Num rows: 1064800 Data size:
> 388652023 Basic stats: COMPLETE Column stats: NONE
> Reduce Output Operator
> key expressions: _col0 (type: string),
> _col1 (type: int), _col2 (type: string), _col3 (type: string), _col4 (type:
> string), _col5 (type: string), _col6 (type: string), _col7 (type: string),
> _col8 (type: string), _col9 (type: string), _col10 (type: string), _col11
> (type: string), _col12 (type: int), _col13 (type: int), _col14 (type: int)
> sort order: +++++++++++++++
> Map-reduce partition columns: _col0
> (type: string), _col1 (type: int), _col2 (type: string), _col3 (type:
> string), _col4 (type: string), _col5 (type: string), _col6 (type: string),
> _col7 (type: string), _col8 (type: string), _col9 (type: string), _col10
> (type: string), _col11 (type: string), _col12 (type: int), _col13 (type:
> int), _col14 (type: int)
> Statistics: Num rows: 1064800 Data
> size: 388652023 Basic stats: COMPLETE Column stats: NONE
> value expressions: _col15 (type:
> bigint), _col16 (type: double), _col17 (type: double), _col18 (type: double)
> Execution mode: vectorized
> Map 28
> Map Operator Tree:
> TableScan
> alias: hd1
> filterExpr: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 770400 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 57600 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: hd_demo_sk (type: int), hd_income_band_sk
> (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 7200 Data size: 57600 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col0
> input vertices:
> 1 Map 29
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 29
> Map Operator Tree:
> TableScan
> alias: ib1
> filterExpr: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 240 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ib_income_band_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 3
> Map Operator Tree:
> TableScan
> alias: ad1
> filterExpr: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 811903688 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ca_address_sk (type: int),
> ca_street_number (type: string), ca_street_name (type: string), ca_city
> (type: string), ca_zip (type: string)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 800000 Data size: 292000000
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: string), _col2 (type:
> string), _col3 (type: string), _col4 (type: string)
> Execution mode: vectorized
> Map 30
> Map Operator Tree:
> TableScan
> alias: promotion
> filterExpr: p_promo_sk is not null (type: boolean)
> Statistics: Num rows: 450 Data size: 530848 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: p_promo_sk is not null (type: boolean)
> Statistics: Num rows: 450 Data size: 1800 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: p_promo_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 450 Data size: 1800 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 450 Data size: 1800 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 31
> Map Operator Tree:
> TableScan
> alias: cd1
> filterExpr: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 718379200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 170951200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cd_demo_sk (type: int), cd_marital_status
> (type: string)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 1920800 Data size: 170951200
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col1 (type: int)
> outputColumnNames: _col1, _col2, _col4, _col5, _col6,
> _col7
> input vertices:
> 1 Map 32
> Statistics: Num rows: 1387730 Data size: 145711650
> Basic stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col7 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col2, _col4, _col5,
> _col6, _col9
> input vertices:
> 1 Map 33
> Statistics: Num rows: 1387730 Data size: 145711650
> Basic stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col6 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col2, _col4, _col5,
> _col9, _col11
> input vertices:
> 1 Map 34
> Statistics: Num rows: 1387730 Data size:
> 145711650 Basic stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col2 (type: int)
> 1 _col4 (type: int)
> outputColumnNames: _col1, _col4, _col5, _col9,
> _col11, _col13, _col15, _col18, _col19, _col21, _col23, _col24, _col25,
> _col29, _col31, _col32
> input vertices:
> 1 Map 39
> Statistics: Num rows: 68 Data size: 26860 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Filter Operator
> predicate: (_col13 <> _col1) (type: boolean)
> Statistics: Num rows: 68 Data size: 26860
> Basic stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: _col15 (type: int), _col24
> (type: float), _col25 (type: float), _col4 (type: int), _col5 (type: int),
> _col29 (type: int), _col9 (type: int), _col11 (type: int), _col31 (type:
> string), _col32 (type: string), _col18 (type: int), _col19 (type: int),
> _col21 (type: int), _col23 (type: float)
> outputColumnNames: _col1, _col10, _col11,
> _col16, _col17, _col21, _col23, _col25, _col27, _col28, _col4, _col5, _col7,
> _col9
> Statistics: Num rows: 68 Data size: 15300
> Basic stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col1 (type: int)
> sort order: +
> Map-reduce partition columns: _col1
> (type: int)
> Statistics: Num rows: 68 Data size: 15300
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col4 (type: int),
> _col5 (type: int), _col7 (type: int), _col9 (type: float), _col10 (type:
> float), _col11 (type: float), _col16 (type: int), _col17 (type: int), _col21
> (type: int), _col23 (type: int), _col25 (type: int), _col27 (type: string),
> _col28 (type: string)
> Execution mode: vectorized
> Map 32
> Map Operator Tree:
> TableScan
> alias: customer
> filterExpr: (((((c_current_cdemo_sk is not null and
> c_first_sales_date_sk is not null) and c_first_shipto_date_sk is not null)
> and c_customer_sk is not null) and c_current_hdemo_sk is not null) and
> c_current_addr_sk is not null) (type: boolean)
> Statistics: Num rows: 1600000 Data size: 1241633212 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (((((c_current_cdemo_sk is not null and
> c_first_sales_date_sk is not null) and c_first_shipto_date_sk is not null)
> and c_customer_sk is not null) and c_current_hdemo_sk is not null) and
> c_current_addr_sk is not null) (type: boolean)
> Statistics: Num rows: 1387730 Data size: 32529324 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: c_customer_sk (type: int),
> c_current_cdemo_sk (type: int), c_current_hdemo_sk (type: int),
> c_current_addr_sk (type: int), c_first_shipto_date_sk (type: int),
> c_first_sales_date_sk (type: int)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5
> Statistics: Num rows: 1387730 Data size: 32529324 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col1 (type: int)
> sort order: +
> Map-reduce partition columns: _col1 (type: int)
> Statistics: Num rows: 1387730 Data size: 32529324
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col0 (type: int), _col2 (type:
> int), _col3 (type: int), _col4 (type: int), _col5 (type: int)
> Execution mode: vectorized
> Map 33
> Map Operator Tree:
> TableScan
> alias: d1
> filterExpr: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 81741831 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: d_date_sk (type: int), d_year (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: int)
> Execution mode: vectorized
> Map 34
> Map Operator Tree:
> TableScan
> alias: d1
> filterExpr: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 81741831 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: d_date_sk is not null (type: boolean)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: d_date_sk (type: int), d_year (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 73049 Data size: 584392 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: int)
> Execution mode: vectorized
> Map 35
> Map Operator Tree:
> TableScan
> alias: cd1
> filterExpr: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 718379200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: cd_demo_sk is not null (type: boolean)
> Statistics: Num rows: 1920800 Data size: 170951200 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cd_demo_sk (type: int), cd_marital_status
> (type: string)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 1920800 Data size: 170951200
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col3 (type: int)
> outputColumnNames: _col1, _col3, _col4, _col6, _col7,
> _col8, _col9, _col11, _col12, _col13, _col17
> input vertices:
> 1 Map 36
> Statistics: Num rows: 78 Data size: 9750 Basic stats:
> COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Reduce Output Operator
> key expressions: _col8 (type: int)
> sort order: +
> Map-reduce partition columns: _col8 (type: int)
> Statistics: Num rows: 78 Data size: 9750 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: string), _col3
> (type: int), _col4 (type: int), _col6 (type: int), _col7 (type: int), _col9
> (type: int), _col11 (type: float), _col12 (type: float), _col13 (type:
> float), _col17 (type: int)
> Execution mode: vectorized
> Map 36
> Map Operator Tree:
> TableScan
> alias: store_sales
> filterExpr: ((((((((ss_item_sk is not null and
> ss_ticket_number is not null) and ss_sold_date_sk is not null) and
> ss_cdemo_sk is not null) and ss_store_sk is not null) and ss_customer_sk is
> not null) and ss_promo_sk is not null) and ss_addr_sk is not null) and
> ss_hdemo_sk is not null) (type: boolean)
> Statistics: Num rows: 550076554 Data size: 101277521188
> Basic stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((((((((ss_item_sk is not null and
> ss_ticket_number is not null) and ss_sold_date_sk is not null) and
> ss_cdemo_sk is not null) and ss_store_sk is not null) and ss_customer_sk is
> not null) and ss_promo_sk is not null) and ss_addr_sk is not null) and
> ss_hdemo_sk is not null) (type: boolean)
> Statistics: Num rows: 476766968 Data size: 22480476572
> Basic stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ss_sold_date_sk (type: int), ss_item_sk
> (type: int), ss_customer_sk (type: int), ss_cdemo_sk (type: int), ss_hdemo_sk
> (type: int), ss_addr_sk (type: int), ss_store_sk (type: int), ss_promo_sk
> (type: int), ss_ticket_number (type: int), ss_wholesale_cost (type: float),
> ss_list_price (type: float), ss_coupon_amt (type: float)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col8, _col9, _col10, _col11
> Statistics: Num rows: 476766968 Data size: 22480476572
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int), _col8 (type: int)
> 1 _col0 (type: int), _col1 (type: int)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col9, _col10, _col11
> input vertices:
> 1 Map 37
> Statistics: Num rows: 8686 Data size: 382184 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col9, _col10, _col11
> input vertices:
> 1 Map 38
> Statistics: Num rows: 78 Data size: 3120 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: int), _col10 (type:
> float), _col11 (type: float), 2001 (type: int), _col2 (type: int), _col3
> (type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col7
> (type: int), _col9 (type: float)
> outputColumnNames: _col1, _col10, _col11, _col15,
> _col2, _col3, _col4, _col5, _col6, _col7, _col9
> Statistics: Num rows: 78 Data size: 3432 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col3 (type: int)
> sort order: +
> Map-reduce partition columns: _col3 (type: int)
> Statistics: Num rows: 78 Data size: 3432 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: int), _col2
> (type: int), _col4 (type: int), _col5 (type: int), _col6 (type: int), _col7
> (type: int), _col9 (type: float), _col10 (type: float), _col11 (type: float),
> _col15 (type: int)
> Execution mode: vectorized
> Map 37
> Map Operator Tree:
> TableScan
> alias: store_returns
> filterExpr: (sr_item_sk is not null and sr_ticket_number is
> not null) (type: boolean)
> Statistics: Num rows: 55578005 Data size: 9601960106 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (sr_item_sk is not null and sr_ticket_number
> is not null) (type: boolean)
> Statistics: Num rows: 55578005 Data size: 444624040 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: sr_item_sk (type: int), sr_ticket_number
> (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 55578005 Data size: 444624040
> Basic stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int), _col1 (type: int)
> sort order: ++
> Map-reduce partition columns: _col0 (type: int),
> _col1 (type: int)
> Statistics: Num rows: 55578005 Data size: 444624040
> Basic stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 38
> Map Operator Tree:
> TableScan
> alias: d1
> filterExpr: ((d_year = 2001) and d_date_sk is not null)
> (type: boolean)
> Statistics: Num rows: 73049 Data size: 81741831 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((d_year = 2001) and d_date_sk is not null)
> (type: boolean)
> Statistics: Num rows: 652 Data size: 5216 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: d_date_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 652 Data size: 2608 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 652 Data size: 2608 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 39
> Map Operator Tree:
> TableScan
> alias: store
> filterExpr: ((s_store_sk is not null and s_store_name is
> not null) and s_zip is not null) (type: boolean)
> Statistics: Num rows: 212 Data size: 405680 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((s_store_sk is not null and s_store_name is
> not null) and s_zip is not null) (type: boolean)
> Statistics: Num rows: 212 Data size: 38372 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: s_store_sk (type: int), s_store_name
> (type: string), s_zip (type: string)
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 212 Data size: 38372 Basic stats:
> COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col8 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col1, _col3, _col4, _col6, _col7,
> _col9, _col11, _col12, _col13, _col17, _col19, _col20
> input vertices:
> 0 Map 35
> Statistics: Num rows: 78 Data size: 23244 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: string), _col11 (type:
> float), _col12 (type: float), _col13 (type: float), _col17 (type: int),
> _col19 (type: string), _col20 (type: string), _col3 (type: int), _col4 (type:
> int), _col6 (type: int), _col7 (type: int), _col9 (type: int)
> outputColumnNames: _col1, _col11, _col12, _col13,
> _col17, _col19, _col20, _col3, _col4, _col6, _col7, _col9
> Statistics: Num rows: 78 Data size: 23244 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col4 (type: int)
> sort order: +
> Map-reduce partition columns: _col4 (type: int)
> Statistics: Num rows: 78 Data size: 23244 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: string), _col3
> (type: int), _col6 (type: int), _col7 (type: int), _col9 (type: int), _col11
> (type: float), _col12 (type: float), _col13 (type: float), _col17 (type:
> int), _col19 (type: string), _col20 (type: string)
> Execution mode: vectorized
> Map 4
> Map Operator Tree:
> TableScan
> alias: ad1
> filterExpr: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 811903688 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ca_address_sk is not null (type: boolean)
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ca_address_sk (type: int),
> ca_street_number (type: string), ca_street_name (type: string), ca_city
> (type: string), ca_zip (type: string)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4
> Statistics: Num rows: 800000 Data size: 292000000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col9 (type: int)
> outputColumnNames: _col1, _col2, _col3, _col4,
> _col13, _col18, _col19, _col20, _col26, _col30, _col32, _col34, _col36,
> _col37, _col42, _col45
> input vertices:
> 1 Map 19
> Statistics: Num rows: 880000 Data size: 321200006
> Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: string), _col13 (type:
> int), _col18 (type: float), _col19 (type: float), _col2 (type: string),
> _col20 (type: float), _col26 (type: int), _col3 (type: string), _col30 (type:
> int), _col32 (type: int), _col34 (type: int), _col36 (type: string), _col37
> (type: string), _col4 (type: string), _col42 (type: int), _col45 (type:
> string)
> outputColumnNames: _col1, _col13, _col18, _col19,
> _col2, _col20, _col26, _col3, _col30, _col32, _col34, _col36, _col37, _col4,
> _col42, _col45
> Statistics: Num rows: 880000 Data size: 321200006
> Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col26 (type: int)
> outputColumnNames: _col1, _col2, _col3, _col4,
> _col6, _col7, _col8, _col9, _col18, _col23, _col24, _col25, _col35, _col37,
> _col39, _col41, _col42, _col47, _col50
> input vertices:
> 0 Map 3
> Statistics: Num rows: 968000 Data size: 353320014
> Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col1 (type: string), _col18
> (type: int), _col2 (type: string), _col23 (type: float), _col24 (type:
> float), _col25 (type: float), _col3 (type: string), _col35 (type: int),
> _col37 (type: int), _col39 (type: int), _col4 (type: string), _col41 (type:
> string), _col42 (type: string), _col47 (type: int), _col50 (type: string),
> _col6 (type: string), _col7 (type: string), _col8 (type: string), _col9
> (type: string)
> outputColumnNames: _col1, _col18, _col2,
> _col23, _col24, _col25, _col3, _col35, _col37, _col39, _col4, _col41, _col42,
> _col47, _col50, _col6, _col7, _col8, _col9
> Statistics: Num rows: 968000 Data size:
> 353320014 Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col18 (type: int)
> outputColumnNames: _col4, _col5, _col6,
> _col7, _col9, _col10, _col11, _col12, _col26, _col27, _col28, _col38, _col40,
> _col42, _col44, _col45, _col50, _col53
> input vertices:
> 0 Map 1
> Statistics: Num rows: 1064800 Data size:
> 388652023 Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col53 (type: string), _col50
> (type: int), _col44 (type: string), _col45 (type: string), _col9 (type:
> string), _col10 (type: string), _col11 (type: string), _col12 (type: string),
> _col4 (type: string), _col5 (type: string), _col6 (type: string), _col7
> (type: string), _col38 (type: int), _col40 (type: int), _col42 (type: int),
> _col26 (type: float), _col27 (type: float), _col28 (type: float)
> outputColumnNames: _col0, _col1, _col2,
> _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12,
> _col13, _col14, _col15, _col16, _col17
> Statistics: Num rows: 1064800 Data size:
> 388652023 Basic stats: COMPLETE Column stats: NONE
> Group By Operator
> aggregations: count(), sum(_col15),
> sum(_col16), sum(_col17)
> keys: _col0 (type: string), _col1 (type:
> int), _col2 (type: string), _col3 (type: string), _col4 (type: string), _col5
> (type: string), _col6 (type: string), _col7 (type: string), _col8 (type:
> string), _col9 (type: string), _col10 (type: string), _col11 (type: string),
> _col12 (type: int), _col13 (type: int), _col14 (type: int)
> mode: hash
> outputColumnNames: _col0, _col1, _col2,
> _col3, _col4, _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12,
> _col13, _col14, _col15, _col16, _col17, _col18
> Statistics: Num rows: 1064800 Data size:
> 388652023 Basic stats: COMPLETE Column stats: NONE
> Reduce Output Operator
> key expressions: _col0 (type: string),
> _col1 (type: int), _col2 (type: string), _col3 (type: string), _col4 (type:
> string), _col5 (type: string), _col6 (type: string), _col7 (type: string),
> _col8 (type: string), _col9 (type: string), _col10 (type: string), _col11
> (type: string), _col12 (type: int), _col13 (type: int), _col14 (type: int)
> sort order: +++++++++++++++
> Map-reduce partition columns: _col0
> (type: string), _col1 (type: int), _col2 (type: string), _col3 (type:
> string), _col4 (type: string), _col5 (type: string), _col6 (type: string),
> _col7 (type: string), _col8 (type: string), _col9 (type: string), _col10
> (type: string), _col11 (type: string), _col12 (type: int), _col13 (type:
> int), _col14 (type: int)
> Statistics: Num rows: 1064800 Data
> size: 388652023 Basic stats: COMPLETE Column stats: NONE
> value expressions: _col15 (type:
> bigint), _col16 (type: double), _col17 (type: double), _col18 (type: double)
> Execution mode: vectorized
> Map 40
> Map Operator Tree:
> TableScan
> alias: item
> filterExpr: ((((i_color) IN ('maroon', 'burnished', 'dim',
> 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45) and
> i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: boolean)
> Statistics: Num rows: 48000 Data size: 68732712 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ((((i_color) IN ('maroon', 'burnished', 'dim',
> 'steel', 'navajo', 'chocolate') and i_current_price BETWEEN 35 AND 45) and
> i_current_price BETWEEN 36 AND 50) and i_item_sk is not null) (type: boolean)
> Statistics: Num rows: 6000 Data size: 1199936 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: i_item_sk (type: int), i_product_name
> (type: string)
> outputColumnNames: _col0, _col3
> Statistics: Num rows: 6000 Data size: 642000 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> Inner Join 0 to 2
> keys:
> 0 _col1 (type: int)
> 1 _col0 (type: int)
> 2 _col0 (type: int)
> outputColumnNames: _col4, _col5, _col7, _col9,
> _col10, _col11, _col16, _col17, _col21, _col23, _col25, _col27, _col28,
> _col33, _col36
> input vertices:
> 0 Map 31
> 2 Reducer 42
> Statistics: Num rows: 13200 Data size: 1412400 Basic
> stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col10 (type: float), _col11 (type:
> float), _col16 (type: int), _col17 (type: int), _col21 (type: int), _col23
> (type: int), _col25 (type: int), _col27 (type: string), _col28 (type:
> string), _col33 (type: int), _col36 (type: string), _col4 (type: int), _col5
> (type: int), _col7 (type: int), _col9 (type: float)
> outputColumnNames: _col10, _col11, _col16, _col17,
> _col21, _col23, _col25, _col27, _col28, _col33, _col36, _col4, _col5, _col7,
> _col9
> Statistics: Num rows: 13200 Data size: 1412400
> Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col7 (type: int)
> outputColumnNames: _col5, _col6, _col10, _col11,
> _col12, _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34, _col37
> input vertices:
> 0 Map 30
> Statistics: Num rows: 14520 Data size: 1553640
> Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col10 (type: float), _col11
> (type: float), _col12 (type: float), _col17 (type: int), _col18 (type: int),
> _col22 (type: int), _col24 (type: int), _col26 (type: int), _col28 (type:
> string), _col29 (type: string), _col34 (type: int), _col37 (type: string),
> _col5 (type: int), _col6 (type: int)
> outputColumnNames: _col10, _col11, _col12,
> _col17, _col18, _col22, _col24, _col26, _col28, _col29, _col34, _col37,
> _col5, _col6
> Statistics: Num rows: 14520 Data size: 1553640
> Basic stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int)
> 1 _col17 (type: int)
> outputColumnNames: _col8, _col9, _col13,
> _col14, _col15, _col21, _col25, _col27, _col29, _col31, _col32, _col37, _col40
> input vertices:
> 0 Map 28
> Statistics: Num rows: 15972 Data size:
> 1709004 Basic stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col13 (type: float), _col14
> (type: float), _col15 (type: float), _col21 (type: int), _col25 (type: int),
> _col27 (type: int), _col29 (type: int), _col31 (type: string), _col32 (type:
> string), _col37 (type: int), _col40 (type: string), _col8 (type: int), _col9
> (type: int)
> outputColumnNames: _col13, _col14, _col15,
> _col21, _col25, _col27, _col29, _col31, _col32, _col37, _col40, _col8, _col9
> Statistics: Num rows: 15972 Data size:
> 1709004 Basic stats: COMPLETE Column stats: NONE
> Reduce Output Operator
> key expressions: _col9 (type: int)
> sort order: +
> Map-reduce partition columns: _col9
> (type: int)
> Statistics: Num rows: 15972 Data size:
> 1709004 Basic stats: COMPLETE Column stats: NONE
> value expressions: _col8 (type: int),
> _col13 (type: float), _col14 (type: float), _col15 (type: float), _col21
> (type: int), _col25 (type: int), _col27 (type: int), _col29 (type: int),
> _col31 (type: string), _col32 (type: string), _col37 (type: int), _col40
> (type: string)
> Execution mode: vectorized
> Map 41
> Map Operator Tree:
> TableScan
> alias: catalog_sales
> filterExpr: (cs_item_sk is not null and cs_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 286549727 Data size: 65825832570
> Basic stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (cs_item_sk is not null and cs_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 286549727 Data size: 3435718732
> Basic stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cs_item_sk (type: int), cs_order_number
> (type: int), cs_ext_list_price (type: float)
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 286549727 Data size: 3435718732
> Basic stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col0 (type: int), _col1 (type: int)
> 1 _col0 (type: int), _col1 (type: int)
> outputColumnNames: _col0, _col2, _col5, _col6, _col7
> input vertices:
> 1 Map 43
> Statistics: Num rows: 2911 Data size: 58220 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Select Operator
> expressions: _col0 (type: int), _col2 (type:
> float), ((_col5 + _col6) + _col7) (type: float)
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 2911 Data size: 58220 Basic
> stats: COMPLETE Column stats: COMPLETE
> Group By Operator
> aggregations: sum(_col1), sum(_col2)
> keys: _col0 (type: int)
> mode: hash
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 1 Data size: 20 Basic
> stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 1 Data size: 20 Basic
> stats: COMPLETE Column stats: COMPLETE
> value expressions: _col1 (type: double), _col2
> (type: double)
> Execution mode: vectorized
> Map 43
> Map Operator Tree:
> TableScan
> alias: catalog_returns
> filterExpr: (cr_item_sk is not null and cr_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 28798881 Data size: 5764329494 Basic
> stats: COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (cr_item_sk is not null and cr_order_number is
> not null) (type: boolean)
> Statistics: Num rows: 28798881 Data size: 569059536 Basic
> stats: COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: cr_item_sk (type: int), cr_order_number
> (type: int), cr_refunded_cash (type: float), cr_reversed_charge (type:
> float), cr_store_credit (type: float)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4
> Statistics: Num rows: 28798881 Data size: 569059536
> Basic stats: COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int), _col1 (type: int)
> sort order: ++
> Map-reduce partition columns: _col0 (type: int),
> _col1 (type: int)
> Statistics: Num rows: 28798881 Data size: 569059536
> Basic stats: COMPLETE Column stats: COMPLETE
> value expressions: _col2 (type: float), _col3 (type:
> float), _col4 (type: float)
> Execution mode: vectorized
> Map 7
> Map Operator Tree:
> TableScan
> alias: hd1
> filterExpr: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 770400 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: (hd_income_band_sk is not null and hd_demo_sk
> is not null) (type: boolean)
> Statistics: Num rows: 7200 Data size: 57600 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: hd_demo_sk (type: int), hd_income_band_sk
> (type: int)
> outputColumnNames: _col0, _col1
> Statistics: Num rows: 7200 Data size: 57600 Basic
> stats: COMPLETE Column stats: COMPLETE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int)
> 1 _col0 (type: int)
> outputColumnNames: _col0
> input vertices:
> 1 Map 8
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> HybridGraceHashJoin: true
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 7200 Data size: 28800 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 8
> Map Operator Tree:
> TableScan
> alias: ib1
> filterExpr: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 240 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: ib_income_band_sk is not null (type: boolean)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: ib_income_band_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 20 Data size: 80 Basic stats:
> COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Map 9
> Map Operator Tree:
> TableScan
> alias: promotion
> filterExpr: p_promo_sk is not null (type: boolean)
> Statistics: Num rows: 450 Data size: 530848 Basic stats:
> COMPLETE Column stats: COMPLETE
> Filter Operator
> predicate: p_promo_sk is not null (type: boolean)
> Statistics: Num rows: 450 Data size: 1800 Basic stats:
> COMPLETE Column stats: COMPLETE
> Select Operator
> expressions: p_promo_sk (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 450 Data size: 1800 Basic stats:
> COMPLETE Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 450 Data size: 1800 Basic
> stats: COMPLETE Column stats: COMPLETE
> Execution mode: vectorized
> Reducer 21
> Reduce Operator Tree:
> Group By Operator
> aggregations: sum(VALUE._col0), sum(VALUE._col1)
> keys: KEY._col0 (type: int)
> mode: mergepartial
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLETE
> Column stats: COMPLETE
> Filter Operator
> predicate: (_col1 > (2.0 * _col2)) (type: boolean)
> Statistics: Num rows: 0 Data size: 0 Basic stats: NONE
> Column stats: COMPLETE
> Select Operator
> expressions: _col0 (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 0 Data size: 0 Basic stats: NONE
> Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 0 Data size: 0 Basic stats: NONE
> Column stats: COMPLETE
> Reducer 27
> Reduce Operator Tree:
> Group By Operator
> aggregations: count(VALUE._col0), sum(VALUE._col1),
> sum(VALUE._col2), sum(VALUE._col3)
> keys: KEY._col0 (type: string), KEY._col1 (type: int),
> KEY._col2 (type: string), KEY._col3 (type: string), KEY._col4 (type: string),
> KEY._col5 (type: string), KEY._col6 (type: string), KEY._col7 (type: string),
> KEY._col8 (type: string), KEY._col9 (type: string), KEY._col10 (type:
> string), KEY._col11 (type: string), KEY._col12 (type: int), KEY._col13 (type:
> int), KEY._col14 (type: int)
> mode: mergepartial
> outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5,
> _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _col15,
> _col16, _col17, _col18
> Statistics: Num rows: 532400 Data size: 194326011 Basic
> stats: COMPLETE Column stats: NONE
> Select Operator
> expressions: _col1 (type: int), _col2 (type: string), _col3
> (type: string), _col12 (type: int), _col15 (type: bigint), _col16 (type:
> double), _col17 (type: double), _col18 (type: double)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7
> Statistics: Num rows: 532400 Data size: 194326011 Basic
> stats: COMPLETE Column stats: NONE
> Reduce Output Operator
> key expressions: _col0 (type: int), _col1 (type: string),
> _col2 (type: string)
> sort order: +++
> Map-reduce partition columns: _col0 (type: int), _col1
> (type: string), _col2 (type: string)
> Statistics: Num rows: 532400 Data size: 194326011 Basic
> stats: COMPLETE Column stats: NONE
> value expressions: _col3 (type: int), _col4 (type:
> bigint), _col5 (type: double), _col6 (type: double), _col7 (type: double)
> Reducer 42
> Reduce Operator Tree:
> Group By Operator
> aggregations: sum(VALUE._col0), sum(VALUE._col1)
> keys: KEY._col0 (type: int)
> mode: mergepartial
> outputColumnNames: _col0, _col1, _col2
> Statistics: Num rows: 1 Data size: 20 Basic stats: COMPLETE
> Column stats: COMPLETE
> Filter Operator
> predicate: (_col1 > (2.0 * _col2)) (type: boolean)
> Statistics: Num rows: 0 Data size: 0 Basic stats: NONE
> Column stats: COMPLETE
> Select Operator
> expressions: _col0 (type: int)
> outputColumnNames: _col0
> Statistics: Num rows: 0 Data size: 0 Basic stats: NONE
> Column stats: COMPLETE
> Reduce Output Operator
> key expressions: _col0 (type: int)
> sort order: +
> Map-reduce partition columns: _col0 (type: int)
> Statistics: Num rows: 0 Data size: 0 Basic stats: NONE
> Column stats: COMPLETE
> Reducer 5
> Reduce Operator Tree:
> Group By Operator
> aggregations: count(VALUE._col0), sum(VALUE._col1),
> sum(VALUE._col2), sum(VALUE._col3)
> keys: KEY._col0 (type: string), KEY._col1 (type: int),
> KEY._col2 (type: string), KEY._col3 (type: string), KEY._col4 (type: string),
> KEY._col5 (type: string), KEY._col6 (type: string), KEY._col7 (type: string),
> KEY._col8 (type: string), KEY._col9 (type: string), KEY._col10 (type:
> string), KEY._col11 (type: string), KEY._col12 (type: int), KEY._col13 (type:
> int), KEY._col14 (type: int)
> mode: mergepartial
> outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5,
> _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _col15,
> _col16, _col17, _col18
> Statistics: Num rows: 532400 Data size: 194326011 Basic
> stats: COMPLETE Column stats: NONE
> Select Operator
> expressions: _col0 (type: string), _col1 (type: int), _col2
> (type: string), _col3 (type: string), _col4 (type: string), _col5 (type:
> string), _col6 (type: string), _col7 (type: string), _col8 (type: string),
> _col9 (type: string), _col10 (type: string), _col11 (type: string), _col12
> (type: int), _col15 (type: bigint), _col16 (type: double), _col17 (type:
> double), _col18 (type: double)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14,
> _col15, _col16
> Statistics: Num rows: 532400 Data size: 194326011 Basic
> stats: COMPLETE Column stats: NONE
> Map Join Operator
> condition map:
> Inner Join 0 to 1
> keys:
> 0 _col1 (type: int), _col2 (type: string), _col3 (type:
> string)
> 1 _col0 (type: int), _col1 (type: string), _col2 (type:
> string)
> outputColumnNames: _col0, _col2, _col3, _col4, _col5,
> _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _col15,
> _col16, _col20, _col21, _col22, _col23, _col24
> input vertices:
> 1 Reducer 27
> Statistics: Num rows: 585640 Data size: 213758616 Basic
> stats: COMPLETE Column stats: NONE
> HybridGraceHashJoin: true
> Filter Operator
> predicate: (_col21 <= _col13) (type: boolean)
> Statistics: Num rows: 195213 Data size: 71252750 Basic
> stats: COMPLETE Column stats: NONE
> Select Operator
> expressions: _col0 (type: string), _col2 (type:
> string), _col3 (type: string), _col4 (type: string), _col5 (type: string),
> _col6 (type: string), _col7 (type: string), _col8 (type: string), _col9
> (type: string), _col10 (type: string), _col11 (type: string), _col12 (type:
> int), _col13 (type: bigint), _col14 (type: double), _col15 (type: double),
> _col16 (type: double), _col22 (type: double), _col23 (type: double), _col24
> (type: double), _col20 (type: int), _col21 (type: bigint)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4,
> _col5, _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14,
> _col15, _col16, _col17, _col18, _col19, _col20
> Statistics: Num rows: 195213 Data size: 71252750
> Basic stats: COMPLETE Column stats: NONE
> Reduce Output Operator
> key expressions: _col0 (type: string), _col1 (type:
> string), _col20 (type: bigint)
> sort order: +++
> Statistics: Num rows: 195213 Data size: 71252750
> Basic stats: COMPLETE Column stats: NONE
> value expressions: _col2 (type: string), _col3
> (type: string), _col4 (type: string), _col5 (type: string), _col6 (type:
> string), _col7 (type: string), _col8 (type: string), _col9 (type: string),
> _col10 (type: string), _col11 (type: int), _col12 (type: bigint), _col13
> (type: double), _col14 (type: double), _col15 (type: double), _col16 (type:
> double), _col17 (type: double), _col18 (type: double), _col19 (type: int)
> Reducer 6
> Reduce Operator Tree:
> Select Operator
> expressions: KEY.reducesinkkey0 (type: string),
> KEY.reducesinkkey1 (type: string), VALUE._col0 (type: string), VALUE._col1
> (type: string), VALUE._col2 (type: string), VALUE._col3 (type: string),
> VALUE._col4 (type: string), VALUE._col5 (type: string), VALUE._col6 (type:
> string), VALUE._col7 (type: string), VALUE._col8 (type: string), VALUE._col9
> (type: int), VALUE._col10 (type: bigint), VALUE._col11 (type: double),
> VALUE._col12 (type: double), VALUE._col13 (type: double), VALUE._col14 (type:
> double), VALUE._col15 (type: double), VALUE._col16 (type: double),
> VALUE._col17 (type: int), KEY.reducesinkkey2 (type: bigint)
> outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5,
> _col6, _col7, _col8, _col9, _col10, _col11, _col12, _col13, _col14, _col15,
> _col16, _col17, _col18, _col19, _col20
> Statistics: Num rows: 195213 Data size: 71252750 Basic stats:
> COMPLETE Column stats: NONE
> File Output Operator
> compressed: false
> Statistics: Num rows: 195213 Data size: 71252750 Basic
> stats: COMPLETE Column stats: NONE
> table:
> input format: org.apache.hadoop.mapred.TextInputFormat
> output format:
> org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
> serde:
> org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
> Stage: Stage-0
> Fetch Operator
> limit: -1
> Processor Tree:
> ListSink
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)