Re: What causes "client errors" with mod_jk

Rainer Jung Thu, 26 May 2022 13:47:03 -0700

Hi Chris,

Am 16.05.2022 um 19:48 schrieb Christopher Schultz:

I've been looking into this a little more in my production environment.
These errors are not super common, but there seems to be a steadytrickle of errors from my two services that have human users. I see 0errors for my API-based services, which makes me think that I don't haveany performance issues... I probably have human users disappearing forrandom reasons.

Could be unstable (mobile) client connections. Or people alreadyclicking on the next frontend action before they received the completeresponse. But that is speculating. So it is right, you try to identifysome individual reasons to understand more.

The errors in mod_jk.log look like this:
[Sun May 15 04:19:15.643 2022] [5859:139625025315392] [info]ajp_process_callback::jk_ajp_common.c (2077): (myworker) Writing toclient aborted or client network problems[Sun May 15 04:19:15.644 2022] [5859:139625025315392] [info]ajp_service::jk_ajp_common.c (2773): (myworker) sending request totomcat failed (unrecoverable), because of client write error (attempt=1)[Sun May 15 04:19:15.646 2022] [5859:139625025315392] [info]service::jk_lb_worker.c (1595): service failed, worker myworker is inlocal error state[Sun May 15 04:19:15.646 2022] [5859:139625025315392] [info]service::jk_lb_worker.c (1614): unrecoverable error 200, request failed.Client failed in the middle of request, we can't recover to anotherinstance.[Sun May 15 04:19:15.646 2022] [5859:139625025315392] [info]jk_handler::mod_jk.c (2984): Aborting connection for worker=myworker
(Note that the last message "Aborting connection for worker=myworker"may have a bug; my actual worker name has a name containing a hyphen (-)and only the text before the hyphen is being emitted in that errormessage.)

Strange, never observed that, but maybe never used a hyphen. Docs say,hypens are allowed. Would be interesting to do a server startup withtrace-Logging and see where things corresponding to the name start to gowrong. But of course not related to sporadic client failures.

Anyway, when researching these errors, it would be helpful to me to knowwhich requests are failing. By looking at the access_log, I only see asingle request logged for 04:19:15 on that server so it's probably theright one, but httpd says that the response code is 302 instead of e.g.50x for an error.


What I typically do:

- log "%P:%{tid}P" in your Apache httpd custom LogFormat used for theaccess log.

- make sure, I log in in the Apache httpd access log the requesttimestamp including milli or microseconds (not default butconfigurable). Can be done by using the %{format}t syntax in theLogFormat and adding "usec_frac" to the format.


- adding %D to the access log format (duration in microseconds)

- remember that Apache logs start of request as default time stamp, butmod_jk logs at the moment of error, so later than start of request.


Finding the right access log line for a mod_jk error log line now means:

- filter the access log according to the PID:TID logged in the mod_jkerror log. In your case 5859:139625025315392. We know, that the requestshandled by one thread in one process are run strictly sequentially.

- look for the last request in this filtered list, that by access logline timestamp started before (or unlikely exactly at) the point in timegiven by the mod_jk access log. If you find one exactly add, it might bealso the one directly before.

- look at the request durations of these one or two requests to doublecheck, whether the times fit.

If you can spare the additional log line bytes, you can additionally logthe end of request timestamp in the apache access log (prefix "format"by "begin:").

When we log these kinds of errors, it would be great to know a fewthings IMO:
1. What was the URL of the request
2. How long did the client wait for the response before we found wecouldn't write to the stream (or, conversely, the start-timestamp of therequest as well as the timestamp of the error, which I think is alreadyin the log itself)
I see the place in the code where the error is generated, but I'm notfamiliar enough with the code to know how to add that kind of thing. Thefunction in question (ajp_process_callback) has a pointer to ajk_ws_service_t structure. Is it as simple as also logging like this?
   /* convert start-time to a string */
   char[32] timestamp;
apr_strftime(timestamp, NULL, 32, "%Y-%m-%d %H:%M:%S",r->r->request_time);
   /* emit a detailed log message */
   jk_log(l, JK_LOG_INFO,
          "(%s) Request to (%s %s) started at %s,%ld",
ae->worker->name, r->method, r->req_uri, timestamp,r->r->request_time.tm_usec);
Does anyone think this might be generally useful?


I'll have a look at your other mail on this next.

Best regards,

Rainer

Thanks,
-chris

On 3/25/22 08:37, Christopher Schultz wrote:
Rainer,

On 3/24/22 05:50, Rainer Jung wrote:
Hi Chris,
client errors in jk log are always errors occurring when mod_jk triesto write back what it got from the backend using web server APIs tothe client of the web server (user, browser etc.). So they point to aproblem between and including the web server and something in frontof it.
Especially during performance problems, client errors are expected asa consequence, because whenever people try to reload, the browsercloses the original connection and sending back response data viathis connection later fails.
I was pretty sure this was the case. Is that specifically documentedanywhere? If not, I'd like to clarify that in the documentation formod_jk.
Thanks,
-chris
Am 23.03.2022 um 13:08 schrieb Christopher Schultz:
All,
What kinds of things will cause a "client error" in mod_jk'saccounting? Does that mean things like unexpected disconnects on thepart of the remote client (i.e. web browser), or does it meanfailure of the jk module itself to connect (as a client) to theback-end Tomcat?
I'm starting to see situations where we have small numbers of clienterrors occurring "all the time", meaning that we accumulate maybe10-20 per day. If that's web browser disconnects then I don't careat all. If it's a problem I have with my internal networking andresource-allocation, then it's something I have to adjust.
Thanks,
-chris


---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tomcat.apache.org
For additional commands, e-mail: users-h...@tomcat.apache.org

Re: What causes "client errors" with mod_jk

Reply via email to