[
https://issues.apache.org/jira/browse/HIVE-2925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13423792#comment-13423792
]
Namit Jain commented on HIVE-2925:
----------------------------------
It would be very difficult to deploy it this way.
In general, I can think of the following:
1. For queries with limits, this optimization should be enabled.
2. Ideally, it would be good, if there is a threshold of the limit.
3. For queries without limits, given the fact that we dont a cost based
optimizer, it might be a good to
have a threshold on the total input data.
I mean, in general, non MR fetch might make sense for the following.
1. select from a small table (where small is configurable)
2. select from a big table is OK if there is a limit
Note that, it is still possible to get a plan where this optimization might
make not sense.
For eg: select col1 from T where col2 = 10 limit 10;
It is possible that there are very rows for which col2 is 10, so not having a
MR job may really slow down this query.
Solving that would be more difficult without more statistics.
But, it may be a good idea to add more config parameters to tune the
hive.aggresive.fetch.task.conversion appropriately.
It can also be done in a follow-up patch, and is independent of this.
> Support non-MR fetching for simple queries with select/limit/filter
> operations only
> -----------------------------------------------------------------------------------
>
> Key: HIVE-2925
> URL: https://issues.apache.org/jira/browse/HIVE-2925
> Project: Hive
> Issue Type: Improvement
> Affects Versions: 0.10.0
> Reporter: Navis
> Assignee: Navis
> Priority: Trivial
> Attachments: HIVE-2925.D2607.1.patch, HIVE-2925.D2607.2.patch,
> HIVE-2925.D2607.3.patch, HIVE-2925.D2607.4.patch
>
>
> It's trivial but frequently asked by end-users. Currently, select queries
> with simple conditions or limit should run MR job which takes some time
> especially for big tables, making the people irritated.
> For that kind of simple queries, using fetch task would make them happy.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira