GitHub user mwaag created a discussion: Webserver.service health-checks keep 
failing for many vrouters

Cloudstack: 4.18.2.4 (in preparation to upgrade to a newer version)
Hypervisor: VMWare ESX 7.0.3
Management-Server OS: Ubuntu 24.04.3 LTS
vRouter version: 4.18.1
vRouter Software version: 4.18.2.4

Hi there,

we have a lot of virtual routers raising an alert because of a failing health 
check. The health check in question is always 'webserver.service' which keeps 
failing on many vrouters creating a lot of alerts :

<img width="1548" height="726" alt="11_37_12-000320" 
src="https://github.com/user-attachments/assets/0cddd54c-c36c-40fd-9f72-3c66f771d026";
 />

<img width="1469" height="630" alt="11_41_37-000321" 
src="https://github.com/user-attachments/assets/d80a90b6-fa2e-4992-a0b1-223b643832d1";
 />

We find this in managment.log as well:
2025-12-01 21:16:52,482 WARN  [c.c.n.r.VirtualNetworkApplianceManagerImpl] 
(RouterStatusMonitor-1:ctx-860476b6) (logid:7813569b) Health checks failed: 1 
failing checks on router r-178445-VM / 826d1cc5-3a3e-486c-9731-5ea140d9f18c. 
Checking failed health checks to see if router needs recreate

We usually find nothing in cloud.log on the vrouter, tried to grep for 'health' 
'fail' 'check' 'apache2' 'webserver', zero matches.

Systemctl-output:
root@r-178445-VM:~# systemctl is-active apache2
failed
root@r-178445-VM:~# systemctl status apache2
● apache2.service - The Apache HTTP Server
     Loaded: loaded (/lib/systemd/system/apache2.service; enabled; vendor 
preset: enabled)
     Active: failed (Result: exit-code) since Mon 2025-12-01 20:21:10 UTC; 34s 
ago
       Docs: https://httpd.apache.org/docs/2.4/
    Process: 115818 ExecStart=/usr/sbin/apachectl start (code=exited, 
status=1/FAILURE)
        CPU: 11ms

Dec 01 20:21:10 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:21:10 r-178445-VM apachectl[115821]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:21:10 r-178445-VM apachectl[115821]: no listening sockets available, 
shutting down
Dec 01 20:21:10 r-178445-VM apachectl[115821]: AH00015: Unable to open logs
Dec 01 20:21:10 r-178445-VM apachectl[115818]: Action 'start' failed.
Dec 01 20:21:10 r-178445-VM apachectl[115818]: The Apache error log may have 
more information.
Dec 01 20:21:10 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:21:10 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:21:10 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.

apache2-journal:
Dec 01 20:18:11 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:18:11 r-178445-VM apachectl[115680]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:18:11 r-178445-VM apachectl[115680]: no listening sockets available, 
shutting down
Dec 01 20:18:11 r-178445-VM apachectl[115680]: AH00015: Unable to open logs
Dec 01 20:18:11 r-178445-VM apachectl[115677]: Action 'start' failed.
Dec 01 20:18:11 r-178445-VM apachectl[115677]: The Apache error log may have 
more information.
Dec 01 20:18:11 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:18:11 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:18:11 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.
Dec 01 20:21:06 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:21:06 r-178445-VM apachectl[115773]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:21:06 r-178445-VM apachectl[115773]: no listening sockets available, 
shutting down
Dec 01 20:21:06 r-178445-VM apachectl[115773]: AH00015: Unable to open logs
Dec 01 20:21:06 r-178445-VM apachectl[115770]: Action 'start' failed.
Dec 01 20:21:06 r-178445-VM apachectl[115770]: The Apache error log may have 
more information.
Dec 01 20:21:06 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:21:06 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:21:06 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.
Dec 01 20:21:07 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:21:07 r-178445-VM apachectl[115785]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:21:07 r-178445-VM apachectl[115785]: no listening sockets available, 
shutting down
Dec 01 20:21:07 r-178445-VM apachectl[115785]: AH00015: Unable to open logs
Dec 01 20:21:07 r-178445-VM apachectl[115782]: Action 'start' failed.
Dec 01 20:21:07 r-178445-VM apachectl[115782]: The Apache error log may have 
more information.
Dec 01 20:21:07 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:21:07 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:21:07 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.
Dec 01 20:21:08 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:21:08 r-178445-VM apachectl[115797]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:21:08 r-178445-VM apachectl[115797]: no listening sockets available, 
shutting down
Dec 01 20:21:08 r-178445-VM apachectl[115797]: AH00015: Unable to open logs
Dec 01 20:21:08 r-178445-VM apachectl[115794]: Action 'start' failed.
Dec 01 20:21:08 r-178445-VM apachectl[115794]: The Apache error log may have 
more information.
Dec 01 20:21:08 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:21:08 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:21:08 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.
Dec 01 20:21:09 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:21:09 r-178445-VM apachectl[115809]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:21:09 r-178445-VM apachectl[115809]: no listening sockets available, 
shutting down
Dec 01 20:21:09 r-178445-VM apachectl[115809]: AH00015: Unable to open logs
Dec 01 20:21:09 r-178445-VM apachectl[115806]: Action 'start' failed.
Dec 01 20:21:09 r-178445-VM apachectl[115806]: The Apache error log may have 
more information.
Dec 01 20:21:09 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:21:09 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:21:09 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.
Dec 01 20:21:10 r-178445-VM systemd[1]: Starting The Apache HTTP Server...
Dec 01 20:21:10 r-178445-VM apachectl[115821]: AH00558: apache2: Could not 
reliably determine the server's fully qualified domain name, using 127.0.1.1. 
Set the 'ServerName' directive globally to suppress this message
Dec 01 20:21:10 r-178445-VM apachectl[115821]: no listening sockets available, 
shutting down
Dec 01 20:21:10 r-178445-VM apachectl[115821]: AH00015: Unable to open logs
Dec 01 20:21:10 r-178445-VM apachectl[115818]: Action 'start' failed.
Dec 01 20:21:10 r-178445-VM apachectl[115818]: The Apache error log may have 
more information.
Dec 01 20:21:10 r-178445-VM systemd[1]: apache2.service: Control process 
exited, code=exited, status=1/FAILURE
Dec 01 20:21:10 r-178445-VM systemd[1]: apache2.service: Failed with result 
'exit-code'.
Dec 01 20:21:10 r-178445-VM systemd[1]: Failed to start The Apache HTTP Server.

Syntax seems to be ok:
root@r-178445-VM:~# apache2ctl configtest
AH00558: apache2: Could not reliably determine the server's fully qualified 
domain name, using 127.0.1.1. Set the 'ServerName' directive globally to 
suppress this message
Syntax OK

Any other processes bound to port 80/443?
root@r-178445-VM:# ss -tulpn | grep -E ':80|:443'
(no output)
root@r-178445-VM:# lsof -nP -iTCP:80 -sTCP:LISTEN
(no output)

Checking rights on Log-dir:
root@r-178445-VM:# ls -ld /var/log/apache2
drwxr-x--- 2 root adm 4096 Mar 18  2025 /var/log/apache2
root@r-178445-VM:# test -w /var/log/apache2 && echo "log-dir writable" || echo 
"log-dir NOT writable"
log-dir writable

root@r-178445-VM:# uptime
 20:31:29 up 265 days,  5:05,  1 user,  load average: 0.00, 0.00, 0.00

Currently we have 52 of 398 vRouters in total showing this symptom.

We tried recreate one vrouter with vpc-restart with cleanup, but unfortunately 
with the same result.

Can we do some more troubleshooting? Unfortunately this keeps us away from 
filling alert.email.addresses because the traffic is just to high as every 
router send an alarm every 10 minutes or so.

Any hints, how we can further troubleshoot / eleminate this issues?



GitHub link: https://github.com/apache/cloudstack/discussions/12170

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]

Reply via email to