Still more information and recap:
- The following daemons continuously fail:
- zenmodeler (when running as a daemon)
- zenwin (seg faults)
- zeneventlog
- I am able to successfully connect to windows servers using:
wmic -U <user>%<password> //<server>
"select * from Win32_ComputerSystem"
- Centos 4.7 RPM upgrade from 2.2.4 to 2.3.2 to 2.3.3
(zenoss-2.3.3-257.el4.rpm)
- Zenoss 2.3.2 consumed significantly more resources than should be
expected (load averages jumped from .7 to well over 4.0)
- Zenoss 2.3.2 - several daemons were dieing and restarted by
watchdog (learned after upgrade to 2.3.3 it was not related to load
average)
- WMI services segmentation fault when trying to connect to windows
devices. Processes complete successfully if put all windows machines
into mantenance. (see strace below)
- zenmodeler execute without error when run manually. When run as a
daemon stops responding (never heartbeats during initial 600 second
pause). If left running I get hundreds of the following log entries at
a time:
- INFO zen.ZenModeler: Collecting for path /Devices
- This may be related to this topic, but I can't be sure :
http://forums.zenoss.com/viewtopic.php?t=8058
........
write(2, "INFO:zen.zenwin:Scanning WINDOWSSERVER"...,
44INFO:zen.zenwin:Scanning WINDOWSSERVER.DOMAIN.com
) = 44
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0xa488530, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
uname({sys="Linux", node="ZENOSSSERVER.DOMAIN.com", ...}) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
open("/etc/resolv.conf", O_RDONLY) = 4
fstat64(4, {st_mode=S_IFREG|0644, st_size=87, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb6d6a000
read(4, "search DOMAIN.com\nnameserver 10."..., 4096) = 87
read(4, "", 4096) = 0
close(4) = 0
munmap(0xb6d6a000, 4096) = 0
open("/etc/hosts", O_RDONLY) = 4
fcntl64(4, F_GETFD) = 0
fcntl64(4, F_SETFD, FD_CLOEXEC) = 0
fstat64(4, {st_mode=S_IFREG|0644, st_size=456, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb6d6a000
read(4, "# Do not remove the following li"..., 4096) = 456
read(4, "", 4096) = 0
close(4) = 0
munmap(0xb6d6a000, 4096) = 0
open("/etc/hosts", O_RDONLY) = 4
fcntl64(4, F_GETFD) = 0
fcntl64(4, F_SETFD, FD_CLOEXEC) = 0
fstat64(4, {st_mode=S_IFREG|0644, st_size=456, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb6d6a000
read(4, "# Do not remove the following li"..., 4096) = 456
close(4) = 0
munmap(0xb6d6a000, 4096) = 0
socket(PF_INET, SOCK_DGRAM, IPPROTO_IP) = 4
connect(4, {sa_family=AF_INET, sin_port=htons(0),
sin_addr=inet_addr("127.0.0.1")}, 16) = 0
getsockname(4, {sa_family=AF_INET, sin_port=htons(55787),
sin_addr=inet_addr("127.0.0.1")}, [16]) = 0
close(4) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
open("/etc/hosts", O_RDONLY) = 4
fcntl64(4, F_GETFD) = 0
fcntl64(4, F_SETFD, FD_CLOEXEC) = 0
fstat64(4, {st_mode=S_IFREG|0644, st_size=456, ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1,
0) = 0xb6d6a000
read(4, "# Do not remove the following li"..., 4096) = 456
close(4) = 0
munmap(0xb6d6a000, 4096) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
gettimeofday({1235742860, 96981}, NULL) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
write(2, "DEBUG:zen.WMIClient:connect to 1"...,
62DEBUG:zen.WMIClient:connect to XX.XX.XX.XX, user 'ZENOSS-USER'
) = 62
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0xa488530, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
futex(0x9f00a48, FUTEX_WAKE, 1) = 0
--- SIGSEGV (Segmentation fault) @ 0 (0) ---
+++ killed by SIGSEGV +++
James Roman wrote:
Not sure if this is related of not, but when I do an rpm verify on
zenoss, I show over 3000 files missing. Can anyone tell me if that is
appropriate for zenoss-2.3.3-257.el4.rpm?
[...@server]$ rpm -V zenoss | grep missing | wc -l
3257
--
James D. Roman
Sr. Network Administrator
Science Systems and Application, Inc.
Phone: 301-867-2101
|