kinjal63 opened a new issue, #10452: URL: https://github.com/apache/ignite/issues/10452
I run Ignite as a local node within my java application process where I've observed a few Ignite crashes as mentioned below intermittently. In fact, I'm using Ignite as a SQL engine and around 100k records are persisted on an average and a continuous stream of data is getting updated for a given primary key in a record. Ignite is running in a persistent mode with a low memory footprint. Error Example 1: ERROR [tcp-disco-msg-worker] Critical system error detected. Will be handled accordingly to configured handler [hnd=StopNodeFailureHandler [failureCtx=FailureContext [type=SYSTEM_WORKER_BLOCKED, err=class o.a.i.IgniteException: GridWorker [name=db-checkpoint-thread, igniteInstanceName=null, finished=false, heartbeatTs=1671441824592]]] Error Example 2: ERROR [node-stopper] Stopping local node on Ignite failure: [failureCtx=FailureContext [type=SYSTEM_CRITICAL_OPERATION_TIMEOUT, err=class o.a.i.IgniteException: Checkpoint read lock acquisition has been timed out.]] What could be the potential issue here? I've tried put in required config such as systemWorkerBlockedTimeout set to ~30sec and enabled page throtting for checkpointing process but that doesn't help. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
