[ 
https://issues.apache.org/jira/browse/HIVE-29458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated HIVE-29458:
----------------------------------
    Labels: pull-request-available  (was: )

> Z-order and custom sort expressions in WRITE LOCALLY ORDERED BY cause data 
> scatter across reducers due to hash distribution
> ---------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HIVE-29458
>                 URL: https://issues.apache.org/jira/browse/HIVE-29458
>             Project: Hive
>          Issue Type: Bug
>            Reporter: Kokila N
>            Assignee: Kokila N
>            Priority: Major
>              Labels: pull-request-available
>
> *Problem*
> When using 
> {code:java}
> `WRITE LOCALLY ORDERED BY zorder()`{code}
>  or other custom sort expressions on Iceberg tables, the sort expressions are 
> added to the ReduceSink partition columns (distribution keys). This causes 
> data to be scattered across reducers based on `hash(partition_col, sort_expr) 
> % num_reducers`.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to