[ 
https://issues.apache.org/jira/browse/HIVE-5185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yin Huai updated HIVE-5185:
---------------------------

    Description: 
The file "reduce_deduplicate_exclude_gby.q" contains
{code:sql}
create table t1( key_int1 int, key_int2 int, key_string1 string, key_string2 
string);

set hive.optimize.reducededuplication=false;

set hive.map.aggr=false;
select Q1.key_int1, sum(Q1.key_int1) from (select * from t1 cluster by 
key_int1) Q1 group by Q1.key_int1;

drop table t1;
{code}

Since the table is not populated, there is no result will be in the .out file.

The same thing in "reducesink-dedup.q"
{code:sql}
DROP TABLE part;
-- data setup
CREATE TABLE part( 
    p_partkey INT,
    p_name STRING,
    p_mfgr STRING,
    p_brand STRING,
    p_type STRING,
    p_size INT,
    p_container STRING,
    p_retailprice DOUBLE,
    p_comment STRING
);
select p_name 
from (select p_name from part distribute by 1 sort by 1) p 
distribute by 1 sort by 1
;
{code}

  was:
The file "reduce_deduplicate_exclude_gby.q" contains
{code:sql}
create table t1( key_int1 int, key_int2 int, key_string1 string, key_string2 
string);

set hive.optimize.reducededuplication=false;

set hive.map.aggr=false;
select Q1.key_int1, sum(Q1.key_int1) from (select * from t1 cluster by 
key_int1) Q1 group by Q1.key_int1;

drop table t1;
{\code}

Since the table is not populated, there is no result will be in the .out file.

The same thing in "reducesink-dedup.q"
{code:sql}

{\code}

    
> test query files reduce_deduplicate_exclude_gby.q and reducesink_dedup.q are 
> useless
> ------------------------------------------------------------------------------------
>
>                 Key: HIVE-5185
>                 URL: https://issues.apache.org/jira/browse/HIVE-5185
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Yin Huai
>            Assignee: Yin Huai
>            Priority: Minor
>
> The file "reduce_deduplicate_exclude_gby.q" contains
> {code:sql}
> create table t1( key_int1 int, key_int2 int, key_string1 string, key_string2 
> string);
> set hive.optimize.reducededuplication=false;
> set hive.map.aggr=false;
> select Q1.key_int1, sum(Q1.key_int1) from (select * from t1 cluster by 
> key_int1) Q1 group by Q1.key_int1;
> drop table t1;
> {code}
> Since the table is not populated, there is no result will be in the .out file.
> The same thing in "reducesink-dedup.q"
> {code:sql}
> DROP TABLE part;
> -- data setup
> CREATE TABLE part( 
>     p_partkey INT,
>     p_name STRING,
>     p_mfgr STRING,
>     p_brand STRING,
>     p_type STRING,
>     p_size INT,
>     p_container STRING,
>     p_retailprice DOUBLE,
>     p_comment STRING
> );
> select p_name 
> from (select p_name from part distribute by 1 sort by 1) p 
> distribute by 1 sort by 1
> ;
> {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to