[ https://issues.apache.org/jira/browse/SPARK-42565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17693320#comment-17693320 ]
Apache Spark commented on SPARK-42565: -------------------------------------- User 'huanliwang-db' has created a pull request for this issue: https://github.com/apache/spark/pull/40161 > Error log improve ment for the lock acquisition of RocksDB state store > instance > ------------------------------------------------------------------------------- > > Key: SPARK-42565 > URL: https://issues.apache.org/jira/browse/SPARK-42565 > Project: Spark > Issue Type: Improvement > Components: Structured Streaming > Affects Versions: 3.5.0 > Reporter: Huanli Wang > Priority: Minor > > > {code:java} > "23/02/23 23:57:44 INFO Executor: Running task 2.0 in stage 57.1 (TID 363) > "23/02/23 23:58:44 ERROR RocksDB StateStoreId(opId=0,partId=3,name=default): > RocksDB instance could not be acquired by [ThreadId: Some(49), task: 3.0 in > stage 57, TID 363] as it was not released by [ThreadId: Some(51), task: 3.1 > in stage 57, TID 342] after 60002 ms.{code} > > We are seeing those error messages for a testing query. The *taskId != > partitionId* but we fail to be clear on this in the error log. > It's confusing when we see those logs: the second log entry seems to talk > about `{*}task 3.0{*}` (it's actually partition 3 and retry attempt 0), but > the `{*}TID 363{*}` is already occupied by `{*}task 2.0 in stage 57.1{*}`. > > Also, it's unclear at which stage retry attempt, the lock is acquired (or > fails to be acquired) -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org