Krisztian Kasa created HIVE-25918:
-------------------------------------

             Summary: Invalid stats after multi inserting into the same 
partition
                 Key: HIVE-25918
                 URL: https://issues.apache.org/jira/browse/HIVE-25918
             Project: Hive
          Issue Type: Bug
          Components: Statistics
            Reporter: Krisztian Kasa
            Assignee: Krisztian Kasa


{code}
create table source(p int, key int,value string);
insert into source(p, key, value) values (101,42,'string42');

create table stats_part(key int,value string) partitioned by (p int);

from source
insert into stats_part select key, value, p
insert into stats_part select key, value, p;

select count(*) from stats_part;
{code}

In this case {{StatsOptimizer}} helps serving this query because the result 
should be {{rowNum}} of the partition {{p=101}}. The result is
{code}
1
{code}
however it shloud be
{code}
2
{code}
because both insert branches inserts 1-1 records.




--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to