Hi all,
on my 7 node cluster, I see the occasional - every 5-10 tests - bunch of
messages dropped during a burst; usually on the DC (what a surprise), on
the order of ~200 messages dropped per incident.
This occurs only with debug 1, and only above >5 nodes or so.
So yes, my cluster is fully virtualized. However, the physical host has
8 x 2.66 Ghz cores; the guests don't write the messages to their own
image, but relay it via syslog-ng to the host, where it gets "written"
to a RAM disk, so no IO bottleneck. Each guest essentially has 1 core to
itself + 512MB RAM.
The network is fully virtual, so I can't be hitting that limit.
syslog-ng is running with a fifosize of 40000 lines, and I upped logd to
2048 sendqlen/recvqlen.
As a data point: I was experiencing the very same drop message rate and
doubled the buffers on syslog-ng and logd then; no change.
Any suggestions?
Regards,
Lars
--
Teamlead Kernel, SuSE Labs, Research and Development
SUSE LINUX Products GmbH, GF: Markus Rex, HRB 16746 (AG Nürnberg)
"Experience is the name everyone gives to their mistakes." -- Oscar Wilde
_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/