Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20397#discussion_r164349078
  
    --- Diff: 
sql/core/src/main/java/org/apache/spark/sql/sources/v2/reader/Distribution.java 
---
    @@ -21,9 +21,9 @@
     
     /**
      * An interface to represent data distribution requirement, which 
specifies how the records should
    - * be distributed among the {@link ReadTask}s that are returned by
    - * {@link DataSourceV2Reader#createReadTasks()}. Note that this interface 
has nothing to do with
    - * the data ordering inside one partition(the output records of a single 
{@link ReadTask}).
    + * be distributed among the {@link DataReaderFactory}s that are returned by
    --- End diff --
    
    `distributed among the data partition(one DataReader outputs data for one 
partition).`


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to