Github user dongjoon-hyun commented on a diff in the pull request:
https://github.com/apache/spark/pull/22661#discussion_r224375578
--- Diff:
sql/core/src/test/scala/org/apache/spark/sql/execution/benchmark/JoinBenchmark.scala
---
@@ -48,13 +48,11 @@ object JoinBenchmark extends SqlBasedBenchmark {
}
}
-
def broadcastHashJoinLongKeyWithDuplicates(): Unit = {
val N = 20 << 20
val M = 1 << 16
-
+ val dim = broadcast(spark.range(M).selectExpr("cast(id/10 as long) as
k"))
--- End diff --
For this change, we need rerun the benchmark to get a new result.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]