[
https://issues.apache.org/jira/browse/DRILL-3633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14694136#comment-14694136
]
Khurram Faraaz commented on DRILL-3633:
---------------------------------------
Good point. (time spent to read input + time spent to compute results + time
spent to print results) = total time taken.
Without the printing of results, I did a count over that query at it took close
to two minutes for query to complete on Drill. I have not run the same query on
another existing system to compare time taken for query execution. For now we
don't know if two minutes is the expected value for this query to complete.
{code}
0: jdbc:drill:schema=dfs.tmp> select count(firstValue) from (select
first_value(key1) over(partition by key2 order by key1) as firstValue from
`twoKeyJsn.json`);
+-----------+
| EXPR$0 |
+-----------+
| 26212355 |
+-----------+
1 row selected (108.507 seconds)
{code}
> FIRST_VALUE , LAST_VALUE functions take too long to complete
> ------------------------------------------------------------
>
> Key: DRILL-3633
> URL: https://issues.apache.org/jira/browse/DRILL-3633
> Project: Apache Drill
> Issue Type: Bug
> Components: Execution - Flow
> Affects Versions: 1.2.0
> Environment: private-branch
> https://github.com/adeneche/incubator-drill/tree/new-window-funcs
> Reporter: Khurram Faraaz
> Assignee: Deneche A. Hakim
> Labels: window_funcion
>
> Query that uses FIRST_VALUE function takes twelve minutes to complete on
> developers private branch.
> {code}
> select first_value(key1) over(partition by key2 order by key1) firstValue
> from `twoKeyJsn.json`;
> ...
> 26,212,355 rows selected (720.229 seconds)
> 0: jdbc:drill:schema=dfs.tmp>
> {code}
> {code}
> select last_value(key1) over(partition by key2 order by key1) firstValue from
> `twoKeyJsn.json`;
> ...
> +------------------+
> 26,212,355 rows selected (239.109 seconds)
> {code}
> number of rows in the JSON file
> {code}
> 0: jdbc:drill:schema=dfs.tmp> select count(*) from `twoKeyJsn.json`;
> +-----------+
> | EXPR$0 |
> +-----------+
> | 26212355 |
> +-----------+
> 1 row selected (13.949 seconds)
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)