Jungtaek Lim created SPARK-55732:
------------------------------------

             Summary: StateStore.putList to receive Iterator (non-materialized 
list of data)
                 Key: SPARK-55732
                 URL: https://issues.apache.org/jira/browse/SPARK-55732
             Project: Spark
          Issue Type: Task
          Components: Structured Streaming
    Affects Versions: 4.2.0
            Reporter: Jungtaek Lim


Currently, StateStore.putList() takes Array[UnsafeRow] as the value list, which 
enforces the values in the array to be materialized. This is particularly 
problematic when the elements of the array are generated from the same 
UnsafeProjection, which require the caller of putList() to copy all UnsafeRows 
to store to the array before calling the method.

We should have an overloaded version of putList() which takes Iterator or Seq 
to avoid enforcing copying UnsafeRows for the above case.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to