Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/20397#discussion_r164347542
--- Diff:
sql/core/src/main/java/org/apache/spark/sql/sources/v2/reader/ClusteredDistribution.java
---
@@ -22,7 +22,7 @@
/**
* A concrete implementation of {@link Distribution}. Represents a
distribution where records that
* share the same values for the {@link #clusteredColumns} will be
produced by the same
- * {@link ReadTask}.
+ * {@link DataReaderFactory}.
--- End diff --
actually `DataReader` is more precise here.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]