Github user gatorsmile commented on a diff in the pull request:
https://github.com/apache/spark/pull/21145#discussion_r186140632
--- Diff:
sql/core/src/main/java/org/apache/spark/sql/sources/v2/reader/ReadTask.java ---
@@ -22,20 +22,20 @@
import org.apache.spark.annotation.InterfaceStability;
/**
- * A reader factory returned by {@link
DataSourceReader#createDataReaderFactories()} and is
+ * A read task returned by {@link DataSourceReader#createReadTasks()} and
is
--- End diff --
I still think `ReadTask` is confusing. I were asked by multiple people what
is a `ReadTask`, especially when a `Task` is already clearly defined as a unit
of execution in Spark.
https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/scheduler/Task.scala#L31-L51
Let us avoid using `Task` here.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]