Hi
I'm having an issue with ganglia-3.0.1. The build appeared to be
successful, but when running gmetad (when gmond is running) it segfaults.
I have tried on Solaris 8 and RedHat 3 - same result on both OS's.
end of truss:
19315: brk(0x0009A690) = 0
hash_create size = 19315: write(2, " h a s h _ c r e a t e ".., 19)
= 19
5019315: write(2, " 5 0", 2) = 2
19315: write(2, "\n", 1) = 1
hash->size is 19315: write(2, " h a s h - > s i z e i".., 14) =
14
5319315: write(2, " 5 3", 2) = 2
19315: write(2, "\n", 1) = 1
19315: brk(0x0009A690) = 0
19315: brk(0x0009C690) = 0
hash_create size = 19315: write(2, " h a s h _ c r e a t e ".., 19)
= 19
5019315: write(2, " 5 0", 2) = 2
19315: write(2, "\n", 1) = 1
hash->size is 19315: write(2, " h a s h - > s i z e i".., 14) =
14
5319315: write(2, " 5 3", 2) = 2
19315: write(2, "\n", 1) = 1
Updating host 19315: write(2, " U p d a t i n g h o s".., 14) =
14
andy19315: write(2, " a n d y", 4) = 4
, metric 19315: write(2, " , m e t r i c ", 9) = 9
cpu_num19315: write(2, " c p u _ n u m", 7) = 7
19315: write(2, "\n", 1) = 1
19315: mkdir("/var/lib/ganglia/rrds/unspecified", 0755) Err#17 EEXIST
19315: mkdir("/var/lib/ganglia/rrds/unspecified/andy", 0755) Err#17
EEXIST
19315: stat("/var/lib/ganglia/rrds/unspecified/andy/cpu_num.rrd",
0xEBC05258) Err#2 ENOENT
19315: time() = 1115224171
19315: time() = 1115224171
19315: open("/usr/share/lib/zoneinfo/GB", O_RDONLY) = 9
19315: read(9, " T Z i f\0\0\0\0\0\0\0\0".., 8192) = 1323
19315: close(9) = 0
19315: brk(0x0009C690) = 0
19315: brk(0x0009E690) = 0
19315: Incurred fault #6, FLTBOUNDS %pc = 0xFF0CFA14
19315: siginfo: SIGSEGV SEGV_ACCERR addr=0x00060083
19315: Received signal #11, SIGSEGV [default]
19315: siginfo: SIGSEGV SEGV_ACCERR addr=0x00060083
19315: *** process killed ***
(gdb) run
Starting program: /usr/sbin/gmetad
[New LWP 1]
[New LWP 2]
[New LWP 3]
[New LWP 4]
Going to run as user nobody
Sources are ...
Source: [my cluster, step 15] has 1 sources
127.0.0.1
xml listening on port 8651
interactive xml listening on port 8652
cleanup thread has been started
[New LWP 5]
[New LWP 6]
[New LWP 7]
[New LWP 8]
Data thread 19 is monitoring [my cluster] data source
127.0.0.1
[New LWP 9]
[my cluster] is an OLD version
hash_create size = 1024
hash->size is 1031
hash_create size = 50
hash->size is 53
hash_create size = 50
hash->size is 53
Updating host andy, metric cpu_num
Program received signal SIGSEGV, Segmentation fault.
[Switching to LWP 7]
0xff0cfa14 in strtok_r () from /usr/lib/libc.so.1
Does anybody have an idea what I have done wrong ?
Cheers
Andy
----------------------------------------------------------------------------------------
This is a PRIVATE message. If you are not the intended recipient, please
delete without copying and kindly advise us by e-mail of the mistake in
delivery. NOTE: Regardless of content, this e-mail shall not operate to
bind CSC to any order or other contract unless pursuant to explicit
written agreement or government initiative expressly permitting the use of
e-mail for such purpose.
----------------------------------------------------------------------------------------