Le 21/12/2017 à 08:42, Peter Lindegaard Hansen a écrit :
update:
we've disabled h2 on 1.8, and everything is running as expected again.
haproxy does not degrade performance anymore nor does it segfault.
so it issues seem to be related to the h2
Med venlig hilsen
*Peter Lindegaard Hansen*
/Softwareudvikler / Partner
/
Telefon: +45 96 500 300 | Direkte: 69 14 97 04 | Email:
[email protected] <mailto:[email protected]>
Tiger Media A/S | Gl. Gugvej 17C | 9000 Aalborg | Web: www.tigermedia.dk
<http://www.tigermedia.dk>
For supportspørgsmål kontakt os da på [email protected]
<mailto:[email protected]> eller på tlf. 96 500 300
og din henvendelse vil blive besvaret af første ledige medarbejder.
2017-12-19 11:36 GMT+01:00 Peter Lindegaard Hansen <[email protected]
<mailto:[email protected]>>:
Hi list,
We upgraded from 1.5 to 1.8 recently - then to 1.8.1
Now we're seeing segfaults and slowdowns with haproxy
Repeating:
Dec 19 11:14:26 haproxy02 kernel: [122635.295196] haproxy[29582]:
segfault at 55d5152279b2 ip 00007f9c2dcc5a28 sp 00007fff07caf4b8
error 6 in libc-2.23.so <http://libc-2.23.so>[7f9c2dc26000+1c0000]
Dec 19 11:14:26 haproxy02 systemd[1]: haproxy.service: Main process
exited, code=exited, status=139/n/a
Dec 19 11:14:26 haproxy02 systemd[1]: haproxy.service: Unit entered
failed state.
Dec 19 11:14:26 haproxy02 systemd[1]: haproxy.service: Failed with
result 'exit-code'.
Dec 19 11:14:26 haproxy02 systemd[1]: haproxy.service: Service
hold-off time over, scheduling restart.
Dec 19 11:14:26 haproxy02 systemd[1]: Stopped HAProxy Load Balancer.
Dec 19 11:14:26 haproxy02 systemd[1]: Starting HAProxy Load Balancer...
Dec 19 11:14:26 haproxy02 systemd[1]: Started HAProxy Load Balancer.
Dec 19 11:14:27 haproxy02 kernel: [122636.578738] haproxy[31479]:
segfault at 56409a8c1de2 ip 00007fa5fa349a28 sp 00007ffe66f4f688
error 6 in libc-2.23.so <http://libc-2.23.so>[7fa5fa2aa000+1c0000]
Dec 19 11:14:27 haproxy02 systemd[1]: haproxy.service: Main process
exited, code=exited, status=139/n/a
Dec 19 11:14:27 haproxy02 systemd[1]: haproxy.service: Unit entered
failed state.
Dec 19 11:14:27 haproxy02 systemd[1]: haproxy.service: Failed with
result 'exit-code'.
Dec 19 11:14:27 haproxy02 systemd[1]: haproxy.service: Service
hold-off time over, scheduling restart.
Dec 19 11:14:27 haproxy02 systemd[1]: Stopped HAProxy Load Balancer.
Dec 19 11:14:27 haproxy02 systemd[1]: Starting HAProxy Load Balancer...
Dec 19 11:14:28 haproxy02 systemd[1]: Started HAProxy Load Balancer.
Dec 19 11:14:28 haproxy02 kernel: [122637.569863] haproxy[31487]:
segfault at 55cb4bd59857 ip 00007f71e678aa28 sp 00007fffb94427b8
error 6 in libc-2.23.so <http://libc-2.23.so>[7f71e66eb000+1c0000]
Dec 19 11:14:28 haproxy02 systemd[1]: haproxy.service: Main process
exited, code=exited, status=139/n/a
Dec 19 11:14:28 haproxy02 systemd[1]: haproxy.service: Unit entered
failed state.
Dec 19 11:14:28 haproxy02 systemd[1]: haproxy.service: Failed with
result 'exit-code'.
Dec 19 11:14:28 haproxy02 systemd[1]: haproxy.service: Service
hold-off time over, scheduling restart.
Dec 19 11:14:28 haproxy02 systemd[1]: Stopped HAProxy Load Balancer.
Dec 19 11:14:28 haproxy02 systemd[1]: Starting HAProxy Load Balancer...
Dec 19 11:14:29 haproxy02 systemd[1]: Started HAProxy Load Balancer.
At same time in haproxy.log
(lots of ssl handshake failures...) then
Dec 19 11:14:26 haproxy02 haproxy[29579]: [ALERT] 352/090058 (29579)
: Current worker 29582 left with exit code 139
Dec 19 11:14:26 haproxy02 haproxy[29579]: [ALERT] 352/090058 (29579)
: exit-on-failure: killing every workers with SIGTERM
Dec 19 11:14:26 haproxy02 haproxy[29579]: [WARNING] 352/090058
(29579) : All workers are left. Leaving... (139)
Dec 19 11:14:27 haproxy02 haproxy[31476]: [ALERT] 352/111426 (31476)
: Current worker 31479 left with exit code 139
Dec 19 11:14:27 haproxy02 haproxy[31476]: [ALERT] 352/111426 (31476)
: exit-on-failure: killing every workers with SIGTERM
Dec 19 11:14:27 haproxy02 haproxy[31476]: [WARNING] 352/111426
(31476) : All workers are left. Leaving... (139)
Dec 19 11:14:28 haproxy02 haproxy[31485]: [ALERT] 352/111428 (31485)
: Current worker 31487 left with exit code 139
Dec 19 11:14:28 haproxy02 haproxy[31485]: [ALERT] 352/111428 (31485)
: exit-on-failure: killing every workers with SIGTERM
Dec 19 11:14:28 haproxy02 haproxy[31485]: [WARNING] 352/111428
(31485) : All workers are left. Leaving... (139)
Dec 19 11:14:29 haproxy02 haproxy[31493]: [ALERT] 352/111429 (31493)
: Current worker 31496 left with exit code 139
Dec 19 11:14:29 haproxy02 haproxy[31493]: [ALERT] 352/111429 (31493)
: exit-on-failure: killing every workers with SIGTERM
Dec 19 11:14:29 haproxy02 haproxy[31493]: [WARNING] 352/111429
(31493) : All workers are left. Leaving... (139)
Dec 19 11:14:30 haproxy02 haproxy[31503]: [ALERT] 352/111429 (31503)
: Current worker 31505 left with exit code 139
Dec 19 11:14:30 haproxy02 haproxy[31503]: [ALERT] 352/111429 (31503)
: exit-on-failure: killing every workers with SIGTERM
Dec 19 11:14:30 haproxy02 haproxy[31503]: [WARNING] 352/111429
(31503) : All workers are left. Leaving... (139)
Dec 19 11:14:30 haproxy02 haproxy[31511]: [ALERT] 352/111430 (31511)
: Current worker 31515 left with exit code 139
Dec 19 11:14:30 haproxy02 haproxy[31511]: [ALERT] 352/111430 (31511)
: exit-on-failure: killing every workers with SIGTERM
Dec 19 11:14:30 haproxy02 haproxy[31511]: [WARNING] 352/111430
(31511) : All workers are left. Leaving... (139)
Until haproxy does not respond to requests :-(
I dont know if it is related, but we do see huge slowdows running
1.8 (with h2 enabled)
After say 10-20 hours haproxy will degrade to serving requests at
4-500kb/s.. when we do a restart of the haproxy service, it will go
back to regular full-line speeds.
We're running ubuntu servers and using Vincent Bernat's PPA.
I can post conf if needed.
Any help appreciated.
Is it related to h2 or 1.8?
We've been running 1.5.9 (ubuntu repos default) for years on the
same requests with no problem.
Thanks
Med venlig hilsen
*Peter Lindegaard Hansen*
/Softwareudvikler / Partner
/
Telefon: +45 96 500 300 | Direkte: 69 14 97 04 | Email:
[email protected] <mailto:[email protected]>
Tiger Media A/S | Gl. Gugvej 17C | 9000 Aalborg | Web:
www.tigermedia.dk <http://www.tigermedia.dk>
For supportspørgsmål kontakt os da på [email protected]
<mailto:[email protected]> eller på tlf. 96 500 300
og din henvendelse vil blive besvaret af første ledige medarbejder.
Hi,
There have been several bugfixes on the H2 part since the 1.8.1. Could
you try the latest 1.8 source snapshot[1] please ?
Then, the segfault part may be unrelated. This is really strange. How
and when does it happens ? after a reload or when the traffic slow down
? And what is your CPU/RAM usage when it happens ?
If possible, share your configuration, it could help. And finally, if
you compile HAProxy by hand, compile it with debug info and enable
coredumps. Then, send us the full backtrace of the segfault. It will be
really useful to know what happens.
[1]
http://www.haproxy.org/download/1.8/src/snapshot/haproxy-ss-LATEST.tar.gz
--
Christopher Faulet