add option to let hive automatically run in local mode based on tunable 
heuristics
----------------------------------------------------------------------------------

                 Key: HIVE-1408
                 URL: https://issues.apache.org/jira/browse/HIVE-1408
             Project: Hadoop Hive
          Issue Type: New Feature
          Components: Query Processor
            Reporter: Joydeep Sen Sarma
            Assignee: Joydeep Sen Sarma


as a followup to HIVE-543 - we should have a simple option (enabled by default) 
to let hive run in local mode if possible.

two levels of options are desirable:
1. hive.exec.mode.local.auto=true/false // control whether local mode is 
automatically chosen
2. Options to control different heuristics, some naiive examples:
     hive.exec.mode.local.auto.input.size.max=1G // don't choose local mode if 
data > 1G
     hive.exec.mode.local.auto.script.enable=true/false // choose if local mode 
is enabled for queries with user scripts

this can be implemented as a pre/post execution hook. It makes sense to provide 
this as a standard hook in the hive codebase since it's likely to improve 
response time for many users (especially for test queries).

the initial proposal is to choose this at a query level and not at per 
hive-task (ie. hadoop job) level. per job-level requires more changes to 
compilation (to not pre-commit to hdfs or local scratch directories at compile 
time).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to