Joe McDonnell created IMPALA-13688:
--------------------------------------

             Summary: Add test for cume_dist with duplicate values
                 Key: IMPALA-13688
                 URL: https://issues.apache.org/jira/browse/IMPALA-13688
             Project: IMPALA
          Issue Type: Task
          Components: Test
    Affects Versions: Impala 4.5.0
            Reporter: Joe McDonnell


A crucial piece of the behavior of cume_dist() is handling duplicates properly. 
Our existing analytic test cases don't adequately test this. We should add a 
test that verifies cume_dist() for duplicates.

Here is an example:
{noformat}
create table cume_dist_test (i int);
insert into cume_dist_test values (1);
insert into cume_dist_test values (1);
insert into cume_dist_test values (3);

select i, cume_dist() over (order by i) from cume_dist_test order by i;

# Expected values:
+---+-----------------------+
| i | cume_dist() OVER(...) |
+---+-----------------------+
| 1 | 0.666666666667        |
| 1 | 0.666666666667        |
| 3 | 1.0                   |
+---+-----------------------+
Fetched 3 row(s) in 0.12s{noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to