Hi Ilya, Thank you for your reply. I have done this test a few times and I consistently get stalling grids during failover/scaling/server swapping
I have tried tuning some parameters, according to ignite production prep docs <https://apacheignite.readme.io/docs/preparing-for-production> . I have increased the heap size to max of 10GB, removed logging of metrics and set igcfg.setFailureDetectionTimeout(60000); - one hour! However, this was done after the 2 tries in this thread. I will try to run one time and get logs for whole cluster including GC if problem persists but it will take some time as I have moved on to other tests. Meanwhile, here is the original log from my first experiment. Maybe you can have a clue. Once again, thank you for your time in this issue crash.log <http://apache-ignite-users.70518.x6.nabble.com/file/t2779/crash.log> -- Sent from: http://apache-ignite-users.70518.x6.nabble.com/
