[ 
https://issues.apache.org/jira/browse/SPARK-44480?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17749433#comment-17749433
 ] 

Snoot.io commented on SPARK-44480:
----------------------------------

User 'ericm-db' has created a pull request for this issue:
https://github.com/apache/spark/pull/42066

> Add option for thread pool to perform maintenance for RocksDB/HDFS State 
> Store Providers
> ----------------------------------------------------------------------------------------
>
>                 Key: SPARK-44480
>                 URL: https://issues.apache.org/jira/browse/SPARK-44480
>             Project: Spark
>          Issue Type: Improvement
>          Components: Structured Streaming
>    Affects Versions: 3.5.0
>            Reporter: Eric Marnadi
>            Priority: Major
>
> Maintenance tasks on StateStore was being done by a single background thread, 
> which is prone to straggling. In this change, the single background thread 
> would instead schedule maintenance tasks to a thread pool.
> Introduce 
> {{spark.sql.streaming.stateStore.enableStateStoreMaintenanceThreadPool}} 
> config so that the user can enable a thread pool for maintenance manually.
> Introduce {{spark.sql.streaming.stateStore.numStateStoreMaintenanceThreads}} 
> config so the thread pool size is configurable.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to