[ 
https://issues.apache.org/jira/browse/SPARK-47169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mihailo Milosevic updated SPARK-47169:
--------------------------------------
    Description: 
What changes were proposed in this pull request?
Disable bucketing on columns that are non default collated.

Why are the changes needed?
With current implementation bucketIds are generated from a string value where 
each unique string guarantees unique id, but when collation is turned on, this 
is not the case.

> Disable bucketing on collated collumns
> --------------------------------------
>
>                 Key: SPARK-47169
>                 URL: https://issues.apache.org/jira/browse/SPARK-47169
>             Project: Spark
>          Issue Type: Improvement
>          Components: SQL
>    Affects Versions: 4.0.0
>            Reporter: Mihailo Milosevic
>            Priority: Major
>              Labels: pull-request-available
>
> What changes were proposed in this pull request?
> Disable bucketing on columns that are non default collated.
> Why are the changes needed?
> With current implementation bucketIds are generated from a string value where 
> each unique string guarantees unique id, but when collation is turned on, 
> this is not the case.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to