Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Maciej Lasyk
Ok, that's weird. Could you recheck SELinux config and/or logs in
/var/log/messages and audit.log?

On Sun, Feb 09, 2014 at 09:54:09AM -0500, Jeff Layton wrote:
 Maciej,
 
 I'm attaching a tar-gzip file with the strace logs (I hope it
 gets through the various filters).
 
 There are 11 separate strace files (I didn't combine them
 into one). The only one that has a segfault in it is 3537:
 
 access(/usr/local/lib64/ganglia/python_modules, F_OK) = 0
 access(/usr/local/lib64/ganglia/python_modules, R_OK) = 0
 --- SIGSEGV (Segmentation fault) @ 0 (0) ---
 
 
 I ran gmond as root. It appears to be something just after
 the permissions on python module directory are checked
 (my naive attempt at understand the strace output).
 
 Thanks!
 
 Jeff
 
 
 
 strace -s 1024 -ff -o strace.log gmond-c /etc/ganglia/gmond.conf
 



-- 
-- 
pozdrawiam,
Maciej Lasyk

GPG key ID: FFA8AEEC
GPG info: http://maciek.lasyk.info/gpg.txt


pgpjwSku3ALi3.pgp
Description: PGP signature
--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Jeff Layton

Maciej,

Yep, SELinux is off:


[root@home4 laytonjb]# cat /etc/sysconfig/selinux

# This file controls the state of SELinux on the system.
# SELINUX= can take one of these three values:
# enforcing - SELinux security policy is enforced.
# permissive - SELinux prints warnings instead of enforcing.
# disabled - No SELinux policy is loaded.
SELINUX=disabled
# SELINUXTYPE= can take one of these two values:
# targeted - Targeted processes are protected,
# mls - Multi Level Security protection.
SELINUXTYPE=targeted



I don't see anything in any logs. Here is the end of /var/log/messages:


Feb  9 13:14:07 home4 kernel: gmond[8866]: segfault at 8 ip 
0036a7ce6ceb sp 7fffabb234d0 error 4 in 
libpython2.6.so.1.0[36a7c0+15d000]
Feb  9 13:14:07 home4 abrt[8877]: Saved core dump of pid 8866 
(/usr/local/sbin/gmond) to /var/spool/abrt/ccpp-2014-02-09-13:14:07-8866 
(5361664 bytes)
Feb  9 13:14:07 home4 abrtd: Directory 'ccpp-2014-02-09-13:14:07-8866' 
creation detected
Feb  9 13:14:07 home4 abrtd: Executable '/usr/local/sbin/gmond' doesn't 
belong to any package and ProcessUnpackaged is set to 'no'
Feb  9 13:14:07 home4 abrtd: 'post-create' on 
'/var/spool/abrt/ccpp-2014-02-09-13:14:07-8866' exited with 1
Feb  9 13:14:07 home4 abrtd: Deleting problem directory 
'/var/spool/abrt/ccpp-2014-02-09-13:14:07-8866'



I did check audit.log and there was only one line that
pertained to anything


type=ANOM_ABEND msg=audit(1391969647.201:480): auid=500 uid=0 gid=0 
ses=1 pid=8866 comm=gmond sig=11




I built the code with almost everything enabled. I thought
about removing some packages from my system and
rebuilding. For example, I could try removing libcmemcached
but I'm not sure that will do any good.

Maybe I need to drop back to an older ganglia version?

Thanks for your help!

Jeff


Ok, that's weird. Could you recheck SELinux config and/or logs in
/var/log/messages and audit.log?

On Sun, Feb 09, 2014 at 09:54:09AM -0500, Jeff Layton wrote:

Maciej,

I'm attaching a tar-gzip file with the strace logs (I hope it
gets through the various filters).

There are 11 separate strace files (I didn't combine them
into one). The only one that has a segfault in it is 3537:

access(/usr/local/lib64/ganglia/python_modules, F_OK) = 0
access(/usr/local/lib64/ganglia/python_modules, R_OK) = 0
--- SIGSEGV (Segmentation fault) @ 0 (0) ---


I ran gmond as root. It appears to be something just after
the permissions on python module directory are checked
(my naive attempt at understand the strace output).

Thanks!

Jeff




strace -s 1024 -ff -o strace.log gmond-c /etc/ganglia/gmond.conf





--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk


___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Jeff Layton
On 02/09/2014 02:28 PM, Maciej Lasyk wrote:
 On Sun, Feb 09, 2014 at 01:22:08PM -0500, Jeff Layton wrote:

 Maybe I need to drop back to an older ganglia version?
 What kind of repositories do you use on this Centos (yum repolist all)?
 Does 'yum update' finishes without any problems?
I apologies for the length.

repo idrepo 
name   status
C6.0-base  CentOS-6.0 - 
Base   disabled
C6.0-centosplusCentOS-6.0 - 
CentOSPlus disabled
C6.0-contrib   CentOS-6.0 - 
Contribdisabled
C6.0-extrasCentOS-6.0 - 
Extras disabled
C6.0-updates   CentOS-6.0 - 
Updatesdisabled
C6.1-base  CentOS-6.1 - 
Base   disabled
C6.1-centosplusCentOS-6.1 - 
CentOSPlus disabled
C6.1-contrib   CentOS-6.1 - 
Contribdisabled
C6.1-extrasCentOS-6.1 - 
Extras disabled
C6.1-updates   CentOS-6.1 - 
Updatesdisabled
C6.2-base  CentOS-6.2 - 
Base   disabled
C6.2-centosplusCentOS-6.2 - 
CentOSPlus disabled
C6.2-contrib   CentOS-6.2 - 
Contribdisabled
C6.2-extrasCentOS-6.2 - 
Extras disabled
C6.2-updates   CentOS-6.2 - 
Updatesdisabled
C6.3-base  CentOS-6.3 - 
Base   disabled
C6.3-centosplusCentOS-6.3 - 
CentOSPlus disabled
C6.3-contrib   CentOS-6.3 - 
Contribdisabled
C6.3-extrasCentOS-6.3 - 
Extras disabled
C6.3-updates   CentOS-6.3 - 
Updatesdisabled
C6.4-base  CentOS-6.4 - 
Base   disabled
C6.4-centosplusCentOS-6.4 - 
CentOSPlus disabled
C6.4-contrib   CentOS-6.4 - 
Contribdisabled
C6.4-extrasCentOS-6.4 - 
Extras disabled
C6.4-updates   CentOS-6.4 - 
Updatesdisabled
adobe-linux-i386   Adobe Systems 
Incorporated  enabled:  2
adobe-linux-x86_64 Adobe Systems 
Incorporated  enabled:  2
base   CentOS-6 - 
Base enabled:  6,367
c6-media   CentOS-6 - 
Mediadisabled
centosplus CentOS-6 - 
Plus disabled
contribCentOS-6 - 
Contrib  disabled
debug  CentOS-6 - 
Debuginfodisabled
elrepo ELRepo.org Community Enterprise Linux Repository 
- el6  enabled:262
elrepo-extras  ELRepo.org Community Enterprise Linux Extras 
Repository - e disabled
elrepo-kernel  ELRepo.org Community Enterprise Linux Kernel 
Repository - e disabled
elrepo-testing ELRepo.org Community Enterprise Linux Testing 
Repository -  disabled
epel   Extra Packages for Enterprise Linux 6 - 
x86_64  enabled: 10,444
epel-debuginfo Extra Packages for Enterprise Linux 6 - x86_64 - 
Debug  disabled
epel-sourceExtra Packages for Enterprise Linux 6 - x86_64 - 
Source disabled
epel-testing   Extra Packages for Enterprise Linux 6 - Testing - 
x86_64disabled
epel-testing-debuginfo Extra Packages for Enterprise Linux 6 - Testing - 
x86_64 -  disabled
epel-testing-sourceExtra Packages for Enterprise Linux 6 - Testing - 
x86_64 -  disabled
extras CentOS-6 - 
Extras   enabled: 14
updatesCentOS-6 - 
Updates  enabled:464
virtualbox Oracle Linux / RHEL / CentOS-6 / x86_64 - 
VirtualBoxenabled: 22
repolist: 17,577

Everything updates correctly (i.e. it finishes).

I built ganglia from source - I didn't use any rpm's.



 You could also try to catch on which particular check this segfault
 happens..?

Not sure how to check this. When I run gmond interactively, it
segfaults just after it says,

[root@home4 yum.repos.d]# /usr/local/sbin/gmond -d 

Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Jeff Layton
On 02/09/2014 02:48 PM, Jeff Layton wrote:
 On 02/09/2014 02:28 PM, Maciej Lasyk wrote:

 You could also try to catch on which particular check this segfault
 happens..?
 Not sure how to check this. When I run gmond interactively, it
 segfaults just after it says,

 [root@home4 yum.repos.d]# /usr/local/sbin/gmond -d 5 -c
 /etc/ganglia/gmond.conf
 loaded module: core_metrics
 loaded module: python_module
 loaded module: cpu_module
 loaded module: disk_module
 loaded module: load_module
 loaded module: mem_module
 loaded module: net_module
 loaded module: proc_module
 loaded module: sys_module
 loaded module: python_module
 Segmentation fault (core dumped)


 I'm not sure where to begin checking. I'm a very old-fashioned
 debugger - I tend to use a great deal of print statements to
 track down where things are happening. I can start doing this
 in gmond.

I tried putting fprintf's all over the gmond.c (yep - I'm that
poor of a debugger). I'm not sure but if looks like it segfaults
in the function setup_metric_callbacks on the statement,

   if (modp-init  modp-init(global_context)) {

or on the function,

   apr_pool_cleanup_register(global_context, modp,
 modular_metric_cleanup,
 apr_pool_cleanup_null);

I'm not too sure.

I apologize if I'm wasting your time with my poor debugging
skills.

Thanks!

Jeff




--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk
___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Jeff Layton

Sure - let me run off and do this.

In the meantime, I tried rebuilding ganglia with 3.5.0 and
3.4.0. For both of these cases, I get the same segfault. This
makes me think I'm doing something wrong but I can seem
to find it :)

I'll let you know what I find out as I go back to ganglia-3.6.0.

Thanks!

Jeff


Oh I didn't think about going that lowlevel :) Could you run ldd on
gmond also?

Could you also run 'tree' command on /etc/ganglia ? It's interesting
that you have two times msg: loaded module: python_module while
starting gmond. Rechecking this with strace log shows that it looks like
double loading of those modules? http://pastebin.com/BjdCGgbj


On Sun, Feb 09, 2014 at 03:28:10PM -0500, Jeff Layton wrote:

On 02/09/2014 02:48 PM, Jeff Layton wrote:

On 02/09/2014 02:28 PM, Maciej Lasyk wrote:


You could also try to catch on which particular check this segfault
happens..?

Not sure how to check this. When I run gmond interactively, it
segfaults just after it says,

[root@home4 yum.repos.d]# /usr/local/sbin/gmond -d 5 -c
/etc/ganglia/gmond.conf
loaded module: core_metrics
loaded module: python_module
loaded module: cpu_module
loaded module: disk_module
loaded module: load_module
loaded module: mem_module
loaded module: net_module
loaded module: proc_module
loaded module: sys_module
loaded module: python_module
Segmentation fault (core dumped)


I'm not sure where to begin checking. I'm a very old-fashioned
debugger - I tend to use a great deal of print statements to
track down where things are happening. I can start doing this
in gmond.

I tried putting fprintf's all over the gmond.c (yep - I'm that
poor of a debugger). I'm not sure but if looks like it segfaults
in the function setup_metric_callbacks on the statement,

   if (modp-init  modp-init(global_context)) {

or on the function,

   apr_pool_cleanup_register(global_context, modp,
 modular_metric_cleanup,
 apr_pool_cleanup_null);

I'm not too sure.

I apologize if I'm wasting your time with my poor debugging
skills.

Thanks!

Jeff






--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk


___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Jeff Layton

I hope this isn't too much output (I've heard about pastebin.com
but never really used it).


[root@home4 ganglia-3.6.0]# ldd /usr/local/sbin/gmond
linux-vdso.so.1 =  (0x7fff667f6000)
libapr-1.so.0 = /usr/lib64/libapr-1.so.0 (0x7f6a24049000)
libresolv.so.2 = /lib64/libresolv.so.2 (0x00337dc0)
libganglia-3.6.0.so.0 = /usr/local/lib64/libganglia-3.6.0.so.0 
(0x7f6a23e0d000)

libdl.so.2 = /lib64/libdl.so.2 (0x00337c40)
libnsl.so.1 = /lib64/libnsl.so.1 (0x003390c0)
libz.so.1 = /lib64/libz.so.1 (0x00337d00)
libpcre.so.0 = /lib64/libpcre.so.0 (0x003f7360)
libexpat.so.1 = /lib64/libexpat.so.1 (0x00337f80)
libconfuse.so.0 = /usr/lib64/libconfuse.so.0 (0x7f6a23bff000)
libpthread.so.0 = /lib64/libpthread.so.0 (0x00337c80)
libc.so.6 = /lib64/libc.so.6 (0x00337c00)
libuuid.so.1 = /lib64/libuuid.so.1 (0x00338380)
libcrypt.so.1 = /lib64/libcrypt.so.1 (0x00338c60)
/lib64/ld-linux-x86-64.so.2 (0x00337bc0)
libfreebl3.so = /usr/lib64/libfreebl3.so (0x00338ca0)


Below is the tree output:


[root@home4 ganglia-3.6.0]# tree /etc/ganglia
/etc/ganglia
??? conf.d
?   ??? procstat.pyconf
??? gmetad.conf
??? gmond.conf

1 directory, 3 files


I looked at the strace file for process 3537 and I did see two places
where gmond does an access() on the python_modules directory.
Does gmond automatically look for the python modes so I don't need
to put them the modules section of gmond.conf?

Thanks a million!

Jeff



Oh I didn't think about going that lowlevel :) Could you run ldd on
gmond also?

Could you also run 'tree' command on /etc/ganglia ? It's interesting
that you have two times msg: loaded module: python_module while
starting gmond. Rechecking this with strace log shows that it looks like
double loading of those modules? http://pastebin.com/BjdCGgbj


On Sun, Feb 09, 2014 at 03:28:10PM -0500, Jeff Layton wrote:

On 02/09/2014 02:48 PM, Jeff Layton wrote:

On 02/09/2014 02:28 PM, Maciej Lasyk wrote:


You could also try to catch on which particular check this segfault
happens..?

Not sure how to check this. When I run gmond interactively, it
segfaults just after it says,

[root@home4 yum.repos.d]# /usr/local/sbin/gmond -d 5 -c
/etc/ganglia/gmond.conf
loaded module: core_metrics
loaded module: python_module
loaded module: cpu_module
loaded module: disk_module
loaded module: load_module
loaded module: mem_module
loaded module: net_module
loaded module: proc_module
loaded module: sys_module
loaded module: python_module
Segmentation fault (core dumped)


I'm not sure where to begin checking. I'm a very old-fashioned
debugger - I tend to use a great deal of print statements to
track down where things are happening. I can start doing this
in gmond.

I tried putting fprintf's all over the gmond.c (yep - I'm that
poor of a debugger). I'm not sure but if looks like it segfaults
in the function setup_metric_callbacks on the statement,

   if (modp-init  modp-init(global_context)) {

or on the function,

   apr_pool_cleanup_register(global_context, modp,
 modular_metric_cleanup,
 apr_pool_cleanup_null);

I'm not too sure.

I apologize if I'm wasting your time with my poor debugging
skills.

Thanks!

Jeff






--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk


___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Maciej Lasyk
Could you post here your build options (that ones you entered while
./configure) and also could you paste gmond.conf into pastebin?

Also plz strace one more time, but now with strace -s 1024 -e trace=file
and paste the output to pastebin


On Sun, Feb 09, 2014 at 04:55:14PM -0500, Jeff Layton wrote:
 I hope this isn't too much output (I've heard about pastebin.com
 but never really used it).
 
 
 [root@home4 ganglia-3.6.0]# ldd /usr/local/sbin/gmond
 linux-vdso.so.1 =  (0x7fff667f6000)
 libapr-1.so.0 = /usr/lib64/libapr-1.so.0 (0x7f6a24049000)
 libresolv.so.2 = /lib64/libresolv.so.2 (0x00337dc0)
 libganglia-3.6.0.so.0 =
 /usr/local/lib64/libganglia-3.6.0.so.0 (0x7f6a23e0d000)
 libdl.so.2 = /lib64/libdl.so.2 (0x00337c40)
 libnsl.so.1 = /lib64/libnsl.so.1 (0x003390c0)
 libz.so.1 = /lib64/libz.so.1 (0x00337d00)
 libpcre.so.0 = /lib64/libpcre.so.0 (0x003f7360)
 libexpat.so.1 = /lib64/libexpat.so.1 (0x00337f80)
 libconfuse.so.0 = /usr/lib64/libconfuse.so.0 (0x7f6a23bff000)
 libpthread.so.0 = /lib64/libpthread.so.0 (0x00337c80)
 libc.so.6 = /lib64/libc.so.6 (0x00337c00)
 libuuid.so.1 = /lib64/libuuid.so.1 (0x00338380)
 libcrypt.so.1 = /lib64/libcrypt.so.1 (0x00338c60)
 /lib64/ld-linux-x86-64.so.2 (0x00337bc0)
 libfreebl3.so = /usr/lib64/libfreebl3.so (0x00338ca0)
 
 
 Below is the tree output:
 
 
 [root@home4 ganglia-3.6.0]# tree /etc/ganglia
 /etc/ganglia
 ??? conf.d
 ?   ??? procstat.pyconf
 ??? gmetad.conf
 ??? gmond.conf
 
 1 directory, 3 files
 
 
 I looked at the strace file for process 3537 and I did see two places
 where gmond does an access() on the python_modules directory.
 Does gmond automatically look for the python modes so I don't need
 to put them the modules section of gmond.conf?
 
 Thanks a million!
 
 Jeff
 
 
 Oh I didn't think about going that lowlevel :) Could you run ldd on
 gmond also?
 
 Could you also run 'tree' command on /etc/ganglia ? It's interesting
 that you have two times msg: loaded module: python_module while
 starting gmond. Rechecking this with strace log shows that it looks like
 double loading of those modules? http://pastebin.com/BjdCGgbj
 
 
 On Sun, Feb 09, 2014 at 03:28:10PM -0500, Jeff Layton wrote:
 On 02/09/2014 02:48 PM, Jeff Layton wrote:
 On 02/09/2014 02:28 PM, Maciej Lasyk wrote:
 
 You could also try to catch on which particular check this segfault
 happens..?
 Not sure how to check this. When I run gmond interactively, it
 segfaults just after it says,
 
 [root@home4 yum.repos.d]# /usr/local/sbin/gmond -d 5 -c
 /etc/ganglia/gmond.conf
 loaded module: core_metrics
 loaded module: python_module
 loaded module: cpu_module
 loaded module: disk_module
 loaded module: load_module
 loaded module: mem_module
 loaded module: net_module
 loaded module: proc_module
 loaded module: sys_module
 loaded module: python_module
 Segmentation fault (core dumped)
 
 
 I'm not sure where to begin checking. I'm a very old-fashioned
 debugger - I tend to use a great deal of print statements to
 track down where things are happening. I can start doing this
 in gmond.
 I tried putting fprintf's all over the gmond.c (yep - I'm that
 poor of a debugger). I'm not sure but if looks like it segfaults
 in the function setup_metric_callbacks on the statement,
 
if (modp-init  modp-init(global_context)) {
 
 or on the function,
 
apr_pool_cleanup_register(global_context, modp,
  modular_metric_cleanup,
  apr_pool_cleanup_null);
 
 I'm not too sure.
 
 I apologize if I'm wasting your time with my poor debugging
 skills.
 
 Thanks!
 
 Jeff
 
 
 
 
 
 --
 Managing the Performance of Cloud-Based Applications
 Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
 Read the Whitepaper.
 http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk
 
 
 ___
 Ganglia-developers mailing list
 Ganglia-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-developers
 

-- 
-- 
pozdrawiam,
Maciej Lasyk

GPG key ID: FFA8AEEC
GPG info: http://maciek.lasyk.info/gpg.txt


pgphnhmNENVnW.pgp
Description: PGP signature
--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] GSoC application started, more help needed

2014-02-09 Thread Bernard Li
Hi Daniel:

https://twitter.com/igrigorik/status/432255646306537472

Perhaps we can invite Ilya to be a mentor?  I believe he is (or at least
was) a user of Ganglia and I think he works for Google now...?

Cheers,

Bernard

On Friday, 7 February 2014, Daniel Pocock dan...@pocock.com.au wrote:


 Please feel free to add potential project ideas here:

 https://github.com/ganglia/monitor-core/wiki/GSoC-2014-project-ideas

 For an example of how the project ideas are documented in other
 organisations, see these pages:

https://wiki.debian.org/SummerOfCode2014/Projects
http://community.apache.org/gsoc.html

 For those who have a Google account, please also register at
   http://www.google-melange.com

 and ask to be added as a mentor for the Ganglia organisation

 If you are willing to be a mentor (or work as part of a joint mentoring
 team, which makes things easier for everybody), please include your name
 and a link to your blog or Github profile or something on the wiki.
 These things will help Ganglia's chance of getting selected.



 --
 Managing the Performance of Cloud-Based Applications
 Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
 Read the Whitepaper.

 http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk
 ___
 Ganglia-developers mailing list
 Ganglia-developers@lists.sourceforge.net javascript:;
 https://lists.sourceforge.net/lists/listinfo/ganglia-developers

--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


[Ganglia-developers] How to display node title to node hostname instead of IP address?

2014-02-09 Thread jupiter
Hi,

I've installed the latest release ganglia web 3.5.11, it display the
node with IP address in each node graphic. How can I change the node
title to use node hostname not IP address?

Thank you.

-j

--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk
___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] How to display node title to node hostname instead of IP address?

2014-02-09 Thread Bernard Li
Please make sure that the collector daemon has hostname entries in
/etc/hosts for all the hosts involved.  Once that's done, please
restart everything.

BTW, questions such as these should be asked in the ganglia-general
mailing-list -- this list is reserved for development discussions.

Thanks,

Bernard

On Sun, Feb 9, 2014 at 9:27 PM, jupiter jupiter@gmail.com wrote:
 Hi,

 I've installed the latest release ganglia web 3.5.11, it display the
 node with IP address in each node graphic. How can I change the node
 title to use node hostname not IP address?

 Thank you.

 -j

 --
 Managing the Performance of Cloud-Based Applications
 Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
 Read the Whitepaper.
 http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk
 ___
 Ganglia-developers mailing list
 Ganglia-developers@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/ganglia-developers

--
Managing the Performance of Cloud-Based Applications
Take advantage of what the Cloud has to offer - Avoid Common Pitfalls.
Read the Whitepaper.
http://pubads.g.doubleclick.net/gampad/clk?id=121051231iu=/4140/ostg.clktrk
___
Ganglia-developers mailing list
Ganglia-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-developers


Re: [Ganglia-developers] gmond segfault with libpython

2014-02-09 Thread Bernard Li
Do we still have a maintainer for the Ganglia packages for EPEL?  If
not, should we see if somebody would like to fill that position?

Thanks,

Bernard

On Sun, Feb 9, 2014 at 6:15 PM, Vladimir Vuksan vli...@veus.hr wrote:
 Those RPMS work just fine for me

 [root@localhost ~]# uname -a
 Linux localhost.localdomain 2.6.32-431.el6.x86_64 #1 SMP Fri Nov 22 03:15:09
 UTC 2013 x86_64 x86_64 x86_64 GNU/Linux
 [root@localhost ~]# cat /etc/issue
 CentOS release 6.5 (Final)
 Kernel \r on an \m

 [root@localhost ~]# rpm -ivh
 http://vuksan.com/centos/RPMS-6/x86_64/ganglia-gmond-modules-python-3.6.0-1.x86_64.rpm
 http://vuksan.com/centos/RPMS-6/x86_64/libganglia-3.6.0-1.x86_64.rpm
 http://vuksan.com/centos/RPMS-6/x86_64/ganglia-gmond-3.6.0-1.x86_64.rpm
 http://vuksan.com/centos/RPMS-6/x86_64/libconfuse-2.6-2.el6.rf.x86_64.rpm
 Retrieving
 http://vuksan.com/centos/RPMS-6/x86_64/ganglia-gmond-modules-python-3.6.0-1.x86_64.rpm
 Retrieving
 http://vuksan.com/centos/RPMS-6/x86_64/libganglia-3.6.0-1.x86_64.rpm
 Retrieving
 http://vuksan.com/centos/RPMS-6/x86_64/ganglia-gmond-3.6.0-1.x86_64.rpm
 Retrieving
 http://vuksan.com/centos/RPMS-6/x86_64/libconfuse-2.6-2.el6.rf.x86_64.rpm
 warning: /var/tmp/rpm-tmp.JtASFF: Header V3 DSA/SHA1 Signature, key ID
 6b8d79e6: NOKEY
 Preparing...###
 [100%]
1:libconfuse ### [
 25%]
2:libganglia ### [
 50%]
3:ganglia-gmond  ### [
 75%]
4:ganglia-gmond-modules-p###
 [100%]
 [root@localhost ~]# gmond -d 2

 loaded module: core_metrics
 loaded module: cpu_module
 loaded module: disk_module
 loaded module: load_module
 loaded module: mem_module
 loaded module: net_module
 loaded module: proc_module
 loaded module: sys_module
 loaded module: python_module
 udp_recv_channel mcast_join=239.2.11.71 mcast_if=NULL port=8649
 bind=239.2.11.71 buffer=0




 On 02/09/2014 09:39 AM, Jeff Layton wrote:

 Vladimir,

 I initially tried your binaries on my 6.5 system and I could
 not get them to install and run (I think they were built with
 a 6.3 system).

 At some point I'll try building the rpm's and installing those.
 Hopefully there is no different in the build process - that
 would be very interesting if the rpm's worked and building
 from source didn't :)

 I'll let you know - but first I'm going to try Maciej's strace
 idea.

 Thanks!

 Jeff

 P.S. There are some pretty significant differences between
 6.4 and 6.5. One big one that I know of is the ntp format
 changed.

 I have not seen issues with Centos 6 however I usually build my RPM
 packages. You could do that if you type

 rpmbuild -tb ganglia-3.6.0.tar.gz

 Alternatively if you are interested to try prebuilt packages you can find
 them here.

 http://vuksan.com/centos/RPMS-6/x86_64/

 Vladimir

 On 02/08/2014 11:11 AM, Jeff Layton wrote:

 Good morning,

 I'm running a CentOS 6.5 system with ganglia 3.6.0 and
 ganglia-web 3.5.12. I'm following the general guidelines in
 this article:

 http://sachinsharm.wordpress.com/tag/installing-ganglia/

 Everything goes swimmingly and ganglia itself works fine. So I
 decide to go to the next step and try using Python  with
 gmond. I followed the general guidelines in this article:

 http://sachinsharm.wordpress.com/2013/08/19/setup-and-configure-ganglia-python-modules-on-centosrhel-6-3/

 But when I start up gmond I get a segfault as reported in
 /var/log/messages.


 Feb  5 19:58:47 home4 kernel: gmond[17992]: segfault at 8 ip
 0036a7ce6ceb sp 7fffaad46bf0 error 4 in
 libpython2.6.so.1.0[36a7c0+15d000]
 Feb  5 19:58:47 home4 abrt[18003]: Saved core dump of pid 17992
 (/usr/local/sbin/gmond) to /var/spool/abrt/ccpp-2014-02-05-19:58:47-17992
 (4284416 bytes)
 Feb  5 19:58:47 home4 abrtd: Directory 'ccpp-2014-02-05-19:58:47-17992'
 creation detected
 Feb  5 19:58:47 home4 abrtd: Executable '/usr/local/sbin/gmond' doesn't
 belong to any package and ProcessUnpackaged is set to 'no'
 Feb  5 19:58:47 home4 abrtd: 'post-create' on
 '/var/spool/abrt/ccpp-2014-02-05-19:58:47-17992' exited with 1
 Feb  5 19:58:47 home4 abrtd: Deleting problem directory
 '/var/spool/abrt/ccpp-2014-02-05-19:58:47-17992'


 I'm been trying to debug this but I have to admit that I'm
 coming up blank. Running gmond with debug doesn't give
 too much information:

 [root@home4 laytonjb]# gmond -d 5 -c /etc/ganglia/gmond.conf
 loaded module: core_metrics
 loaded module: cpu_module
 loaded module: disk_module
 loaded module: load_module
 loaded module: mem_module
 loaded module: net_module
 loaded module: proc_module
 loaded module: sys_module
 loaded module: python_module
 loaded module: python_module
 Segmentation fault (core dumped)


 I went back to the Mod_Python section of the book (p. 89) and
 tried to make things as simple as possible to trace down the