Patrick Woody created SPARK-21317:
-------------------------------------
Summary: Avoid unnecessary sort in FileFormatWriter if data is
already bucketed
Key: SPARK-21317
URL: https://issues.apache.org/jira/browse/SPARK-21317
Project: Spark
Issue Type: Bug
Components: SQL
Affects Versions: 2.1.1
Reporter: Patrick Woody
When bucketing in FileFormatWriter, the partition is always sorted on
bucketIdExpression, the partition id produced by the hash bucketing. If the
data is already bucketed in that format, then this expression will be constant
so there is no need to sort.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]