[ 
https://issues.apache.org/jira/browse/SPARK-45939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17788894#comment-17788894
 ] 

Anish Shrigondekar commented on SPARK-45939:
--------------------------------------------

Added the docs here -

SPIP: 
[https://docs.google.com/document/d/1QtC5qd4WQEia9kl1Qv74WE0TiXYy3x6zeTykygwPWig/edit?usp=sharing]

Design Doc: 
[https://docs.google.com/document/d/1QjZmNZ-fHBeeCYKninySDIoOEWfX6EmqXs2lK097u9o/edit?usp=sharing]

> SPIP: Structured Streaming - Arbitrary State API v2
> ---------------------------------------------------
>
>                 Key: SPARK-45939
>                 URL: https://issues.apache.org/jira/browse/SPARK-45939
>             Project: Spark
>          Issue Type: New Feature
>          Components: Structured Streaming
>    Affects Versions: 4.0.0
>            Reporter: Anish Shrigondekar
>            Priority: Major
>
> SPIP: Structured Streaming - Arbitrary State API v2
>  
> We are planning to introduce a new operator for [Spark Structured 
> Streaming|https://spark.apache.org/streaming/] around [arbitrary stateful 
> operations|https://spark.apache.org/docs/latest/structured-streaming-programming-guide.html#arbitrary-stateful-operations].
>  The operator(s) we have today 
> ([mapGroupsWithState/flatMapGroupsWithState|https://spark.apache.org/docs/3.1.3/api/java/org/apache/spark/sql/streaming/GroupState.html])
>  have a bunch of limitations (as noted in Q3 below) that prevent the use of 
> the APIs for complex/advanced use-cases for stateful stream processing. We 
> plan to remove these limitations with the introduction of a new API to handle 
> state manipulation for streaming queries.
>  
> As part of this effort, we want to address:
>  * composite types
>  * multiple state variables (flexible data modeling)
>  * state TTL
>  * callback/timer improvements
>  * schema evolution
>  * initial state
>  * side outputs
>  * output modes
>  * co-process functions
>  * chaining operators after new operator etc



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to