Sorry if I am spamming this list but I am desperate.

We are getting random ST validation errors from our CAS clients, both internal 
and SaaS applications. Results in 500 errors to the end user.

On July 14th we got over 2000 of these errors out of about 30k successful 
logins. This led to (thanks ITIL
) awareness up to the VP level. I am under the gun to find a “solution” before 
the start of school August 24th.

I have turned up log level to debug on the CAS servers. I see successful 
validations in the logs, but not unsuccessful validations.
I also see ST creation in the audit log.

Now if I understand how CAS works, there can only be 3 reasons an ST won’t 
validate: it is being reused, it has timed out or it does not exist / is 
corrupted.

I just can’t find the actual code that validates the code and can log the EXACT 
reason.

Can someone point me to the method(s) that does the validation? I just want to 
add a log.debug message at the point of failure.

Today I found a validation failure that had 2 attempts, I can see when the ST 
was created and both attempts failed, so it wasn’t a re-use error?

Other info: version 3.5.2 with Hazelcast ticket registry. I have hazelcast 
logging set to debug and see some transfer over port 1501.

Here is a diagram of our infrastructure:

https://www.lucidchart.com/invitations/accept/da009b9d-e55f-4f95-9301-e6bd23d508ab

Yeah 2 Load Balancers (?). Netscape is really a Sun App Server. Why 2, because 
Peoplesoft can’t handle SHA-2 certs on the Netscalar. Yeah a mess. Not all 
failures go through the Sun App Server, but the majority do.

Thanks for any help.

-Bryan

-- 
You are currently subscribed to cas-dev@lists.jasig.org as: 
arch...@mail-archive.com
To unsubscribe, change settings or access archives, see 
http://www.ja-sig.org/wiki/display/JSG/cas-dev

Reply via email to