Jet Guo created HIVE-9052:
-----------------------------

             Summary: Missing grouping rows when multi-insert
                 Key: HIVE-9052
                 URL: https://issues.apache.org/jira/browse/HIVE-9052
             Project: Hive
          Issue Type: Bug
    Affects Versions: 0.14.0
            Reporter: Jet Guo


Giving a table and data as below:

create table score (class string, student string, score int) ROW FORMAT 
DELIMITED FIELDS TERMINATED BY ',' ;
------------------Data---------------
class1,Jack,7
class1,Mike,8
class2,Tom,7

The HQL 'from score INSERT OVERWRITE DIRECTORY '/tmp/dpp/hql1' select 
class,student , count(score) group by class, student grouping sets ((class), 
(class,student)) ' 
will get result like :
----------hql1--------------
class1\N2
class1Jack1
class1Mike1
class2\N1
class2Tom1

And the HQL 'from score INSERT OVERWRITE DIRECTORY '/tmp/dpp/hql2' select 
class,student , sum(score)   group by class, student grouping sets ((class), 
(class,student)) '
will get result like :
----------hql2--------------
class1\N15
class1Jack7
class1Mike8
class2\N7
class2Tom7



But, if you run the HQL with above two inserts, 'from score INSERT OVERWRITE 
DIRECTORY '/tmp/dpp/hql1' select class,student , count(score) group by class, 
student grouping sets ((class), (class,student)) INSERT OVERWRITE DIRECTORY 
'/tmp/dpp/hql2' select class,student , sum(score)   group by class, student 
grouping sets ((class), (class,student))'
, the results will miss some grouping rows as below:

----------hql1--------------
class1Jack1
class1Mike1
class2Tom1

----------hql2--------------
class1Jack7
class1Mike8
class2Tom7






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to