Re: Cluster session replication performance

Mitch Claborn Mon, 10 Sep 2018 09:34:03 -0700

Further information and questions.

I created my own interceptor based on ThroughputInterceptor so that Icould log the timing of specific sessions to correlate them with thefailures in my health check program. I was surprised to find that inthose instances where the health check reported a failure, theinterceptor reported that the session send was accomplished in < 5 ms,while the health check app is waiting a full 1000 ms between calls tothe different tomcat instances. So now I'm more confused than ever.


Anyone have any ideas?

In a ChannelInterceptor, does when getNext().sendMessage(destination,msg, payload) returns, does that mean that the message has been sent ANDreceived by the recipient member, or does that only indicate a send?



Mitch

On 09/06/2018 01:53 PM, Mitch Claborn wrote:

I'm using a cluster with the DeltaManager between two servers on Tomcat9.0.11. I've set channelSendOptions="8" (asynchronous session replication).
I have a "health check" app that I run periodically, one of thefunctions being to check that sessions are being replicated properly.That app1) Does a GET to tomcat A, calling a Struts action that creates asession and stores a known value in it
2) Waits 2 seconds
3) Uses the session ID cookie from step 1 and makes a call to tomcat B,to an action that retrieves that value from the session4) Compares the two values from the session to make sure that they arethe same.
Most of the time this check works fine, but occasionally the call to thesecond server will find that the session does not exist on that server,presumably because it has not yet replicated there yet. 2 seconds seemsa long time for a session to replicate, especially one as small as thisone is. If I decrease the amount of wait time at step 2, the failurerate increases.
I turned on the ThroughputInterceptor and have the following observations.
- Server A has a transmit throughput around 10 MB/sec while B has onlyaround 3 MB/sec. This might be accounted for by the fact that B was thelast server to start, so A would have (I think) transmitted all of thesessions at once when B started up, so it might get good throughput fromthe big send??
Questions:
1. IS 2 seconds a long time to replicate a session?
2. Other than actual network slowness, are there internal issues thatcould cause the replication to be slow?
3. If so, is there anyway to diagnose those?
4. I'm thinking about writing my own version of ThroughputInterceptorthat will give more information on specific messages and timings. Hasanyone tried that? In that interceptor can I access the session ID? Thatwould help me correlate timings between my failure reports and theinterceptor.


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Re: Cluster session replication performance

Reply via email to