Khurram Faraaz created DRILL-3633:
-------------------------------------
Summary: FIRST_VALUE , LAST_VALUE functions take too long to
complete
Key: DRILL-3633
URL: https://issues.apache.org/jira/browse/DRILL-3633
Project: Apache Drill
Issue Type: Bug
Components: Execution - Flow
Affects Versions: 1.2.0
Environment: private-branch
https://github.com/adeneche/incubator-drill/tree/new-window-funcs
Reporter: Khurram Faraaz
Assignee: Chris Westin
Query that uses FIRST_VALUE function takes twelve minutes to complete on
developers private branch.
{code}
select first_value(key1) over(partition by key2 order by key1) firstValue from
`twoKeyJsn.json`;
...
26,212,355 rows selected (720.229 seconds)
0: jdbc:drill:schema=dfs.tmp>
{code}
{code}
select last_value(key1) over(partition by key2 order by key1) firstValue from
`twoKeyJsn.json`;
...
+------------------+
26,212,355 rows selected (239.109 seconds)
{code}
number of rows in the JSON file
{code}
0: jdbc:drill:schema=dfs.tmp> select count(*) from `twoKeyJsn.json`;
+-----------+
| EXPR$0 |
+-----------+
| 26212355 |
+-----------+
1 row selected (13.949 seconds)
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)