L. C. Hsieh created SPARK-49070:
-----------------------------------

             Summary: TransformWithStateExec.initialState is rewritten 
incorrectly to produce invalid query plan
                 Key: SPARK-49070
                 URL: https://issues.apache.org/jira/browse/SPARK-49070
             Project: Spark
          Issue Type: Bug
          Components: SQL, Structured Streaming
    Affects Versions: 4.0.0
            Reporter: L. C. Hsieh


SPARK-47363 added the support for users to provide initial state for streaming 
query. Such query operators like `TransformWithStateExec` might have 
`hasInitialState` as false which means the initial state related parameters are 
not used. But when query planner applies rules on the query, it will still 
apply on the initial state query plan. When `hasInitialState` is false, some 
related parameters  like `initialStateGroupingAttrs` are invalid and some rules 
will use these invalid parameters to transform the initial state query plan.

For example, `EnsureRequirements` may apply invalid Sort and Exchange on the 
initial query plan. We encountered these invalid query plan in our extension 
rules. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to