Github user tdas commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19452#discussion_r144437093
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/StreamingSymmetricHashJoinExec.scala
 ---
    @@ -161,6 +164,10 @@ case class StreamingSymmetricHashJoinExec(
         new SerializableConfiguration(SessionState.newHadoopConf(
           sparkContext.hadoopConfiguration, sqlContext.conf)))
     
    +
    +  val nullLeft = new 
GenericInternalRow(left.output.map(_.withNullability(true)).length)
    +  val nullRight = new 
GenericInternalRow(right.output.map(_.withNullability(true)).length)
    --- End diff --
    
    This can also be moved into the Joiner. Use `Joiner.inputAttributes`, 
instead of `left.output`/`right.output`
    Basically if you have to define variables outside the Joine with the name 
"left", then it highly likely that you can move into the Joiner, as it is meant 
capture all the context needed to process one side.


---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to