[GitHub] [spark] rdblue commented on a change in pull request #30562: [SPARK-33623][SQL] Add canDeleteWhere to SupportsDelete

GitBox Wed, 02 Dec 2020 09:15:52 -0800


rdblue commented on a change in pull request #30562:
URL: https://github.com/apache/spark/pull/30562#discussion_r534340700




##########
File path: 
sql/catalyst/src/main/java/org/apache/spark/sql/connector/catalog/SupportsDelete.java
##########
@@ -28,6 +28,25 @@
  */
 @Evolving
 public interface SupportsDelete {
+
+  /**
+   * Checks whether it is possible to delete data from a data source table 
that matches filter
+   * expressions.
+   * <p>
+   * Rows should be deleted from the data source iff all of the filter 
expressions match.
+   * That is, the expressions must be interpreted as a set of filters that are 
ANDed together.
+   * <p>
+   * Spark will call this method to check if the delete is possible without 
significant effort.

Review comment:
       > My worry is that we may change the API back and forth if we don't have 
a clear big picture. Right now this patch is not useful as it only changes 
where we throw the exception, but I can see that this will be useful when we 
have the row-level delete API and we can use the canDeleteWhere to decide if we 
want to use the row-level API or not.
   
   This is exactly the reason for adding this API. It is a step toward 
rewriting plans for row-level DELETE and MERGE operations. The current 
`deleteWhere` exception approach happens while running the physical plan, when 
it is too late to rewrite the plan for row-level changes. Adding the 
`canDeleteWhere` check fixes that problem.
   
   Since you can easily see how it will be used, what is the concern about 
adding this?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] rdblue commented on a change in pull request #30562: [SPARK-33623][SQL] Add canDeleteWhere to SupportsDelete

Reply via email to