[
https://issues.apache.org/jira/browse/HIVE-17102?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16106067#comment-16106067
]
Lefty Leverenz commented on HIVE-17102:
---------------------------------------
[~anubhavtarar], your new example does not show what is supposed to be shown in
that section: "Vectorized execution: true" notation in an explain output. We
need an example showing vectorization as well as your example showing absence
of vectorization.
Can you modify the original example to make it accurate, then give your example
after it? (Page History is available in the ". . ." drop-down menu on the
upper right corner of the page.)
* [Vectorized Query Execution -- Seeing whether vectorization is used for a
query |
https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=73630110#VectorizedQueryExecution-Seeingwhethervectorizationisusedforaquery]
> Example For Vectorized Execution in Hive in Cwiki not Seems to Work
> -------------------------------------------------------------------
>
> Key: HIVE-17102
> URL: https://issues.apache.org/jira/browse/HIVE-17102
> Project: Hive
> Issue Type: Bug
> Components: Documentation
> Affects Versions: 1.2.0
> Reporter: anubhav tarar
> Assignee: anubhav tarar
>
> i tried to do vectorized execution in hive by using hive cwiki but example do
> not seems to work
> step1:created a orc table
> hive> create table Addresses (
> > name string,
> > street string,
> > city string,
> > state string,
> > zip int
> > ) stored as orc tblproperties ("orc.compress"="NONE");
> step2:insert the values in table
> hive> insert into Addresses values('anubhav','ggn','ggn','haryana','122001');
> Query ID = hduser_20170716093152_14774003-d2c4-4620-b773-ca17cafd902b
> Total jobs = 1
> Launching Job 1 out of 1
> Number of reduce tasks is set to 0 since there's no reduce operator
> Listening for transport dt_socket at address: 5005
> Job running in-process (local Hadoop)
> 2017-07-16 09:31:59,689 Stage-1 map = 100%, reduce = 0%
> Ended Job = job_local1858411694_0004
> Stage-4 is selected by condition resolver.
> Stage-3 is filtered out by condition resolver.
> Stage-5 is filtered out by condition resolver.
> Moving data to:
> hdfs://localhost:54310/user/hive/warehouse/addresses/.hive-staging_hive_2017-07-16_09-31-52_428_7861150459629073282-1/-ext-10000
> Loading data to table default.addresses
> Table default.addresses stats: [numFiles=1, numRows=1, totalSize=713,
> rawDataSize=360]
> MapReduce Jobs Launched:
> Stage-Stage-1: HDFS Read: 778 HDFS Write: 818 SUCCESS
> Total MapReduce CPU Time Spent: 0 msec
> step3:query the table with explain command
> hive> set hive.vectorized.execution.enabled = true;
> hive> explain select name from Addresses where zip>1;
> OK
> STAGE DEPENDENCIES:
> Stage-0 is a root stage
> STAGE PLANS:
> Stage: Stage-0
> Fetch Operator
> limit: -1
> Processor Tree:
> TableScan
> alias: addresses
> Statistics: Num rows: 1 Data size: 360 Basic stats: COMPLETE Column
> stats: NONE
> Filter Operator
> predicate: (zip > 1) (type: boolean)
> Statistics: Num rows: 1 Data size: 360 Basic stats: COMPLETE
> Column stats: NONE
> Select Operator
> expressions: name (type: string)
> outputColumnNames: _col0
> Statistics: Num rows: 1 Data size: 360 Basic stats: COMPLETE
> Column stats: NONE
> ListSink
> Time taken: 0.081 seconds, Fetched: 20 row(s)
> note:in explain command there is not vectorized reader applied
> reason for failiure is that When Fetch is used in the plan instead of Map, it
> do not vectorize
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)