[ 
https://issues.apache.org/jira/browse/HIVE-23831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ayush Saxena updated HIVE-23831:
--------------------------------
    Summary: Make auto reducer parallelism min threshold configurable.  (was: 
Introduce a threshold to turn on or off auto-parallelism of Tez)

> Make auto reducer parallelism min threshold configurable.
> ---------------------------------------------------------
>
>                 Key: HIVE-23831
>                 URL: https://issues.apache.org/jira/browse/HIVE-23831
>             Project: Hive
>          Issue Type: Improvement
>          Components: Hive
>            Reporter: okumin
>            Assignee: okumin
>            Priority: Major
>              Labels: pull-request-available
>
> Hive on Tez disables auto-reduce and uses the maximum number of partitions 
> when the minimum number of reducers, `estimated number * 
> min.partition.factor`, is less than or equal to 1.
> [https://github.com/apache/hive/blob/f98e136bdd5642e3de10d2fd1a4c14d1d6762113/ql/src/java/org/apache/hadoop/hive/ql/parse/GenTezUtils.java#L121-L131]
> https://issues.apache.org/jira/browse/HIVE-14200
>  
> This can make Tez lose chance to optimize # of reducers especially in an 
> environment with small `hive.tez.min.partition.factor` or big 
> `hive.tez.max.partition.factor`.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to