Ryan Blue created SPARK-23889:
---------------------------------

             Summary: Add interfaces to pass required sorting and clustering 
for writes
                 Key: SPARK-23889
                 URL: https://issues.apache.org/jira/browse/SPARK-23889
             Project: Spark
          Issue Type: Sub-task
          Components: SQL
    Affects Versions: 2.3.0
            Reporter: Ryan Blue


>From a [discussion on the dev 
>list|https://lists.apache.org/thread.html/d8bb72fc9b4be8acc3f49367bfc99cbf029194a58333eba69df49717@%3Cdev.spark.apache.org%3E],
> there is consensus around adding interfaces to pass required sorting and 
>clustering to Spark. The proposal is to add:


{code:java}
interface RequiresClustering {
  Set<Expression> requiredClustering();
}

interface RequiresSort {
  List<SortOrder> requiredOrdering();
}
{code}

When only {{RequiresSort}} is present, the sort would produce a global sort. 
The partitioning introduced by that sort would be overridden by 
{{RequiresClustering}}, making the sort local to each partition.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to