[GitHub] [iceberg] rdblue opened a new pull request #1512: Spark: Implement equals and hashCode in SparkBatchScan

GitBox Fri, 25 Sep 2020 13:42:34 -0700


rdblue opened a new pull request #1512:
URL: https://github.com/apache/iceberg/pull/1512



   Spark creates a "canonicalized" plan to determine whether two plan trees 
produce the same result. If two trees produce the same result, then Spark will 
reuse shuffle data to reduce the number of stages. In DSv2, the equality of 
Spark's `BatchScanExec` depends on the equality of the `Scan` produced by 
sources.
   
   This implements `equals` and `hashCode` in `SparkBatchScan` so that the 
canonicalized plan is equal when scans are equivalent.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [iceberg] rdblue opened a new pull request #1512: Spark: Implement equals and hashCode in SparkBatchScan

Reply via email to