Github user rdblue commented on a diff in the pull request:
https://github.com/apache/spark/pull/20933#discussion_r179514000
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/DataSourceScanExec.scala
---
@@ -368,8 +368,7 @@ case class FileSourceScanExec(
val bucketed =
selectedPartitions.flatMap { p =>
p.files.map { f =>
- val hosts = getBlockHosts(getBlockLocations(f), 0, f.getLen)
--- End diff --
Why does this only make sense for this PR? It looks like this is a
reasonable refactor that could be stand-alone.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]