Re: "Unable to await partitions release latch within timeout: ServerLatch" exception causing cluster freeze

Pavel Kovalenko Thu, 02 Aug 2018 03:26:31 -0700

Hello Ray,

I'm glad that your problem was resolved. I just want to add that on PME
beginning phase we're waiting for all current client operations finishing,
new operations are freezed till PME end. After node finishes all ongoing
client operations it counts down latch that you see in logs which in the
message "Unable to await". When all nodes finish all their operations,
exchange latch completes and PME continues. This latch was added to reach
data consistency on all nodes during main PME phase (partition information
exchange, affinity calculation, etc.). If you have network throttling
between client and server, it becomes hard to notify a client that
his datastreamer operation has finished and latch completing process is
slowed down.


2018-08-02 12:11 GMT+03:00 Ray <ray...@cisco.com>:

> The root cause for this issue is the network throttle between client and
> servers.
>
> When I move the clients to run in the same cluster as the servers, there's
> no such problem any more.
>
>
>
> --
> Sent from: http://apache-ignite-users.70518.x6.nabble.com/
>

Re: "Unable to await partitions release latch within timeout: ServerLatch" exception causing cluster freeze

Reply via email to