Hi,
In your case, striped pool starvation messages are showing that some
Ignite related tasks
are taking longer then a preconfigured threshold. (See
https://ignite.apache.org/releases/latest/javadoc/org/apache/ignite/configuration/IgniteConfiguration.html#getFailureDetectionTimeout--)
As long as the system works through all the tasks w/out any errors, no
data is being corrupted.
Looking at the Thread stacks below I see that most of the slow tasks
are related to I/O and memory allocation.
Per the metrics, you are running at the bounds of your configured default
data region.
I would increase the amount of memory given to this data region.
Follow these guides to optimize I/O and durable memory:
https://apacheignite.readme.io/docs/performance-tips
https://apacheignite.readme.io/v2.7.0/docs/durable-memory-tuning
If you are using putAll in your code:
https://apacheignite.readme.io/docs/jcache#basic-operations
If batch operations (such as IgniteCache#putAll, IgniteCache#invokeAll,
etc.) are performed in parallel, then keys should be ordered
in the same way to avoid deadlock. Use TreeMap instead of HashMap to
guarantee consistent ordering.
Thanks, Alex
----------------------------------------------------------------------------------
Look at the free indicators for default memory region.
^-- Off-heap [used=20084MB, free=3.35%, comm=20680MB]
^-- sysMemPlc region [used=0MB, free=99.98%, comm=100MB]
^-- default region [used=20084MB, free=1.93%, comm=20480MB]
I/O operation
Thread [name="sys-stripe-2-#3", id=28, state=WAITING, blockCnt=120,
waitCnt=5301307]
at sun.misc.Unsafe.park(Native Method)
at java.util.concurrent.locks.LockSupport.park(LockSupport.java:304)
at
o.a.i.i.util.future.GridFutureAdapter.get0(GridFutureAdapter.java:177)
at
o.a.i.i.util.future.GridFutureAdapter.getUninterruptibly(GridFutureAdapter.java:145)
at
o.a.i.i.processors.cache.persistence.file.AsyncFileIO.read(AsyncFileIO.java:128)
at
o.a.i.i.processors.cache.persistence.file.AbstractFileIO$2.run(AbstractFileIO.java:89)
at
o.a.i.i.processors.cache.persistence.file.AbstractFileIO.fully(AbstractFileIO.java:52)
at
o.a.i.i.processors.cache.persistence.file.AbstractFileIO.readFully(AbstractFileIO.java:87)
at
o.a.i.i.processors.cache.persistence.file.FilePageStore.readWithFailover(FilePageStore.java:791)
off-heap memory related operation.
Thread [name="sys-stripe-1-#2", id=27, state=RUNNABLE, blockCnt=70,
waitCnt=6227939]
at
o.a.i.i.processors.cache.persistence.pagemem.PageMemoryImpl.acquirePage(PageMemoryImpl.java:735)
at
o.a.i.i.processors.cache.persistence.pagemem.PageMemoryImpl.acquirePage(PageMemoryImpl.java:698)
at
o.a.i.i.processors.cache.persistence.pagemem.PageMemoryImpl.acquirePage(PageMemoryImpl.java:687)
at
o.a.i.i.processors.cache.persistence.DataStructure.acquirePage(DataStructure.java:156)
at
o.a.i.i.processors.cache.persistence.freelist.PagesList.put(PagesList.java:686)
at
--
Sent from: http://apache-ignite-users.70518.x6.nabble.com/