chasingegg commented on pull request #35168:
URL: https://github.com/apache/spark/pull/35168#issuecomment-1014070459


   @cloud-fan Hello, Besides the PushProjectionThroughUnion as I explained 
above, I guess the code below causes this bug,
   ```
     private def collectFromPlan(plan: SparkPlan): Array[T] = {
       val fromRow = resolvedEnc.createDeserializer()
       plan.executeCollect().map(fromRow)
     }
   ```
   This `fromRow` method would bindReference for deserialization, in the second 
child of Union, it will take the first column twice as the result because the 
first child of Union has duplicate columns, and the first child determines the 
final union result's attributes.
   If you have time, could you please take a look, thx~


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to