Github user rdblue commented on a diff in the pull request:
https://github.com/apache/spark/pull/19623#discussion_r148849054
--- Diff:
sql/core/src/main/java/org/apache/spark/sql/sources/v2/reader/ReadTask.java ---
@@ -36,14 +36,24 @@
/**
* The preferred locations where this read task can run faster, but
Spark does not guarantee that
* this task will always run on these locations. The implementations
should make sure that it can
- * be run on any location. The location is a string representing the
host name of an executor.
+ * be run on any location. The location is a string representing the
host name.
+ *
+ * Note that if a host name cannot be recognized by Spark, it will be
ignored as it was not in
+ * the returned locations. By default this method returns empty string,
which means this task
--- End diff --
This isn't the empty string, it is a 0-length array.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]