[ 
https://issues.apache.org/jira/browse/HIVE-12574?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aihua Xu updated HIVE-12574:
----------------------------
    Description: 
In PTF windowing, when the partition is small and the window size is larger 
than the partition size, we are seeing incorrect result. It happens for max, 
min, first_value, last_value and sum functions. 

{noformat}
CREATE TABLE sdy1(
ord int,
type string);
{noformat}

The data is:
{noformat}
2 a
3 a
1 a 
{noformat}

The result is as follows for the query {{select ord, min(ord) over (partition 
by type order by ord rows between 1 preceding and 7 following)}}
{noformat}
1 1
2 1
3 1 
{noformat}

The expected result is:
{noformat}
1 1
2 1
3 2
{noformat}

  was:
In PTF windowing, when the partition is small and the window size is larger 
than the partition size, we are seeing incorrect result. It happens for max, 
min, first_value, last_value, sum and collect_set functions.

{noformat}
CREATE TABLE sdy1(
ord int,
type string);
{noformat}

The data is:
{noformat}
2 a
3 a
1 a 
{noformat}

The result is as follows for the query {{select ord, collect_set(ord) over 
(partition by type order by ord rows between 1 preceding and 7 following)}}
{noformat}
1 [1,2,3]
2 [1,2,3]
3 [1,2,3] 
{noformat}

The expected result is:
{noformat}
1 [1,2,3]
2 [1,2,3]
3 [2,3]
{noformat}


> windowing function returns incorrect result when the window size is larger 
> than the partition size
> --------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-12574
>                 URL: https://issues.apache.org/jira/browse/HIVE-12574
>             Project: Hive
>          Issue Type: Sub-task
>          Components: PTF-Windowing
>    Affects Versions: 2.0.0
>            Reporter: Aihua Xu
>            Assignee: Aihua Xu
>             Fix For: 1.3.0, 2.0.0
>
>
> In PTF windowing, when the partition is small and the window size is larger 
> than the partition size, we are seeing incorrect result. It happens for max, 
> min, first_value, last_value and sum functions. 
> {noformat}
> CREATE TABLE sdy1(
> ord int,
> type string);
> {noformat}
> The data is:
> {noformat}
> 2 a
> 3 a
> 1 a 
> {noformat}
> The result is as follows for the query {{select ord, min(ord) over (partition 
> by type order by ord rows between 1 preceding and 7 following)}}
> {noformat}
> 1 1
> 2 1
> 3 1 
> {noformat}
> The expected result is:
> {noformat}
> 1 1
> 2 1
> 3 2
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to