Khurram Faraaz created DRILL-3633:
-------------------------------------

             Summary: FIRST_VALUE , LAST_VALUE functions take too long to 
complete
                 Key: DRILL-3633
                 URL: https://issues.apache.org/jira/browse/DRILL-3633
             Project: Apache Drill
          Issue Type: Bug
          Components: Execution - Flow
    Affects Versions: 1.2.0
         Environment: private-branch 
https://github.com/adeneche/incubator-drill/tree/new-window-funcs
            Reporter: Khurram Faraaz
            Assignee: Chris Westin


Query that uses FIRST_VALUE function takes twelve minutes to complete on 
developers private branch.

{code}
select first_value(key1) over(partition by key2 order by key1) firstValue from 
`twoKeyJsn.json`; 
...

26,212,355 rows selected (720.229 seconds)
0: jdbc:drill:schema=dfs.tmp> 
{code}

{code}
select last_value(key1) over(partition by key2 order by key1) firstValue from 
`twoKeyJsn.json`; 
...

+------------------+
26,212,355 rows selected (239.109 seconds)
{code}

number of rows in the JSON file
{code}
0: jdbc:drill:schema=dfs.tmp> select count(*) from `twoKeyJsn.json`;
+-----------+
|  EXPR$0   |
+-----------+
| 26212355  |
+-----------+
1 row selected (13.949 seconds)
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to