Hello Ray, I'm glad that your problem was resolved. I just want to add that on PME beginning phase we're waiting for all current client operations finishing, new operations are freezed till PME end. After node finishes all ongoing client operations it counts down latch that you see in logs which in the message "Unable to await". When all nodes finish all their operations, exchange latch completes and PME continues. This latch was added to reach data consistency on all nodes during main PME phase (partition information exchange, affinity calculation, etc.). If you have network throttling between client and server, it becomes hard to notify a client that his datastreamer operation has finished and latch completing process is slowed down.
2018-08-02 12:11 GMT+03:00 Ray <ray...@cisco.com>: > The root cause for this issue is the network throttle between client and > servers. > > When I move the clients to run in the same cluster as the servers, there's > no such problem any more. > > > > -- > Sent from: http://apache-ignite-users.70518.x6.nabble.com/ >