Github user cloud-fan commented on a diff in the pull request:
https://github.com/apache/spark/pull/19781#discussion_r151959078
--- Diff:
sql/core/src/main/scala/org/apache/spark/sql/execution/joins/BroadcastHashJoinExec.scala
---
@@ -90,6 +90,10 @@ case class BroadcastHashJoinExec(
case _ => false
--- End diff --
ah i see where is the problem. Even the join produce one row for one input,
we still need to copy the result if child needs it. Since it's a broadcast
join, we only need to check the streaming side?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]