[GitHub] [spark] c21 commented on a change in pull request #29277: [SPARK-32421][SQL] Add code-gen for shuffled hash join

GitBox Thu, 30 Jul 2020 21:11:14 -0700


c21 commented on a change in pull request #29277:
URL: https://github.com/apache/spark/pull/29277#discussion_r463395132




##########
File path: 
sql/core/src/main/scala/org/apache/spark/sql/execution/WholeStageCodegenExec.scala
##########
@@ -903,6 +904,10 @@ case class CollapseCodegenStages(
         // The children of SortMergeJoin should do codegen separately.
         j.withNewChildren(j.children.map(
           child => InputAdapter(insertWholeStageCodegen(child))))
+      case j: ShuffledHashJoinExec =>
+        // The children of ShuffledHashJoin should do codegen separately.
+        j.withNewChildren(j.children.map(

Review comment:
       @cloud-fan, @viirya - if we only codegen separately for build side, we 
would still have the same problem as above for multiple SHJs right? Essentially 
we would fuse multiple stream sides codegen together in one codegen method, so 
we will have multiple build side initialized in init(), and naming collision as 
above. Let me know if it doesn't make sense, or I can create a counter example 
here.




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] c21 commented on a change in pull request #29277: [SPARK-32421][SQL] Add code-gen for shuffled hash join

Reply via email to