Huanli Wang created SPARK-54118:
-----------------------------------

             Summary: Improve the put/merge operation in ListState when t here 
are multiple values
                 Key: SPARK-54118
                 URL: https://issues.apache.org/jira/browse/SPARK-54118
             Project: Spark
          Issue Type: Bug
          Components: Structured Streaming
    Affects Versions: 4.1.0
            Reporter: Huanli Wang


In SS TWS, when we do the `{{{}put(array)` {}}}operation in liststate, we put 
the first element and then merge the remaining elements one by one. so if we 
want to put an array with 100 elements, it means we need do 1 put + 99 merges. 
This can result in worse performance than a single put operation for the entire 
array.

 

Similar, we have the same issue in {{`merge(array)`}} 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to