cloud-fan opened a new pull request #25052: [SPARK-28250][SQL] 
QueryPlan#references should exclude producedAttributes
URL: https://github.com/apache/spark/pull/25052
 
 
   ## What changes were proposed in this pull request?
   
   This is a followup of the discussion in 
https://github.com/apache/spark/pull/24675#discussion_r286786053
   
   `QueryPlan#references` is an important property. The `ColumnPrunning` rule 
relies on it.
   
   Some query plan nodes have `Seq[Attribute]` parameter, which is used as its 
output attributes. For example, leaf nodes, `Generate`, 
`MapPartitionsInPandas`, etc. These nodes override `producedAttributes` to make 
`missingInputs` correct.
   
   However, these nodes also need to override `references` to make column 
pruning work. This PR proposes to exclude `producedAttributes` from the default 
implementation of `QueryPlan#references`.
   
   ## How was this patch tested?
   
   existing tests

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to