L. C. Hsieh created SPARK-33814:
-----------------------------------

             Summary: Provide preferred locations for stateful operations 
without reported state store locations
                 Key: SPARK-33814
                 URL: https://issues.apache.org/jira/browse/SPARK-33814
             Project: Spark
          Issue Type: Improvement
          Components: Structured Streaming
    Affects Versions: 3.2.0
            Reporter: L. C. Hsieh
            Assignee: L. C. Hsieh


Stateful operators in SS provides preferred locations on the previous batches 
if any. However, if there is no previous batch to follow, Spark possibly 
schedules stateful tasks in inefficient distribution. As stateful operations 
probably need to maintain large state stores, it is better we schedule stateful 
tasks across all executors.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to