Joe McDonnell created IMPALA-13688:
--------------------------------------
Summary: Add test for cume_dist with duplicate values
Key: IMPALA-13688
URL: https://issues.apache.org/jira/browse/IMPALA-13688
Project: IMPALA
Issue Type: Task
Components: Test
Affects Versions: Impala 4.5.0
Reporter: Joe McDonnell
A crucial piece of the behavior of cume_dist() is handling duplicates properly.
Our existing analytic test cases don't adequately test this. We should add a
test that verifies cume_dist() for duplicates.
Here is an example:
{noformat}
create table cume_dist_test (i int);
insert into cume_dist_test values (1);
insert into cume_dist_test values (1);
insert into cume_dist_test values (3);
select i, cume_dist() over (order by i) from cume_dist_test order by i;
# Expected values:
+---+-----------------------+
| i | cume_dist() OVER(...) |
+---+-----------------------+
| 1 | 0.666666666667 |
| 1 | 0.666666666667 |
| 3 | 1.0 |
+---+-----------------------+
Fetched 3 row(s) in 0.12s{noformat}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)