Sorry this last message has been stuck in the moderation queue for long
time as it was too large. But my problem has been resolved. Gmond could
not start on AIX 5300-06-01, this has been proven to be an OS issue.
Gmond starts fine after we upgraded OS away from this base level of
technology level 6. Thanks for everyone's help.

 

From: Li, Guosheng 
Sent: Monday, October 12, 2009 5:27 PM
To: Michael Perzl
Cc: [email protected]
Subject: Re: [Ganglia-general] gmond 3.1.2 not reporting any
performancedata on AIX 5300-06-01

 

Michael,  Thanks for your reply!

 

(/etc/ganglia)> /opt/freeware/sbin/gmond -m

loaded module: core_metrics

heartbeat       Last heartbeat (module core_metrics)

location        Location of the machine (module core_metrics)

gexec           gexec available (module core_metrics)

 

After installing apr-1.3.3-1, I got the same results:

 

(/etc/ganglia)> /etc/rc.d/init.d/gmond start

Starting GANGLIA gmond...

loaded module: core_metrics

udp_recv_channel mcast_join=NULL mcast_if=NULL port=8649 bind=NULL

tcp_accept_channel bind=NULL port=8649

udp_send_channel mcast_join=NULL mcast_if=NULL host=tsdux07 port=8649

Unable to find the metric information for 'kernel64bit'. Possible that
the module has not been loaded.

....

....

Unable to find the metric information for 'disk_free'. Possible that the
module has not been loaded.

        sending metadata for metric: heartbeat

        sent message 'heartbeat' of length 48 with 0 errors

        sending metadata for metric: location

        sent message 'location' of length 64 with 0 errors

        sending metadata for metric: gexec

        sent message 'gexec' of length 48 with 0 errors

  

From: Michael Perzl [mailto:[email protected]] 
Sent: Monday, October 12, 2009 4:40 PM
To: Li, Guosheng
Cc: Jesse Becker; [email protected]
Subject: Re: [Ganglia-general] gmond 3.1.2 not reporting any performance
data on AIX 5300-06-01

 

Guosheng,

judging from your "rpm -qa" list you seem to be using my AIX Ganglia RPM
packages. Can you please check the following two things:

Does "/opt/freeware/sbin/gmond -m" produce any output and can you please
post it ?

Can you please try to replace apr-1.3.3-2 with apr-1.3.3-1 from my web
site, just to make sure this is not caused by this apr version?

Thanks.

Regards,
Michael

Li, Guosheng wrote: 

(/etc/ganglia)> more gmond.conf
/* This configuration is as close to 2.5.x default behavior as possible
   The values closely match ./gmond/metric.h definitions in 2.5.x */
globals {
  daemonize = yes
  setuid = no
  user = nobody
  debug_level = 9
  max_udp_msg_len = 1472
  mute = no
  deaf = no
  host_dmax = 86400 /*secs */
  cleanup_threshold = 300 /*secs */
  gexec = no
  send_metadata_interval = 60  /*defaul 0, this is needed for displaying
on multiple groups */
}
...
 
-----Original Message-----
From: Jesse Becker [mailto:[email protected]] 
Sent: Monday, October 12, 2009 10:31 AM
To: Li, Guosheng
Cc: Ron Wellnitz; [email protected]
Subject: Re: [Ganglia-general] gmond 3.1.2 not reporting any performance
data on AIX 5300-06-01
 
Please post your gmond.conf file.
 
Thanks
 
On Mon, Oct 12, 2009 at 11:10, Li, Guosheng <[email protected]>
<mailto:[email protected]>  wrote:
  

        That is where I downloaded all the ganglia-related rpm packages.
The
        same version of ganglia works fine for other TLs or even the
same TL
        (06) but higher service pack levels (5300-06-03, 5300-06-05,
        5300-06-08). So I guess this might be a bug with 5300-06-01. But
I do
        not know what to check from OS side. bos.perf.libperfstat is at
        5.3.0.60, it should be fine as nmon that also uses it is working
well. I
        generated a few additional performance matrix for ethernet and
fibre
        channel adapters as well as paging space using gmetric tool,
they all
        work fine and display the graphs. Just none of the generic
matrix of
        Ganglia is being loaded. Here are the additional lines using
higher
        debug level (5 or 9), others are the same. Thanks again!
         
        Starting GANGLIA gmond...
        loaded module: core_metrics
        udp_recv_channel mcast_join=NULL mcast_if=NULL port=8649
bind=NULL
        tcp_accept_channel bind=NULL port=8649
        udp_send_channel mcast_join=NULL mcast_if=NULL host=tsdux07
port=8649
        Unable to find the metric information for 'kernel64bit'.
Possible that
        the module has not been loaded.
        .....
         
        -----Original Message-----
        From: Ron Wellnitz [mailto:[email protected]]
        Sent: Monday, October 12, 2009 9:56 AM
        To: Li, Guosheng
        Cc: [email protected]
        Subject: Re: [Ganglia-general] gmond 3.1.2 not reporting any
performance
        data on AIX 5300-06-01
         
         
         From where do you got the RPM-Packages...
        "http://www.perzl.org/ganglia/ganglia-files-v3.1.2.html";
<http://www.perzl.org/ganglia/ganglia-files-v3.1.2.html>  ? If not
please
         
        try one of this.
        I have no idea yet ;) Maybe increasing the debug level will show
more
        information.
         
        P.S.
        The  AIX-Boxes with other TL/ML  also run Ganglia 3.1.2 or a
different
        version ? In some cases the gmond.conf isn't compatible
        with a older/newer version of Ganglia.
         
        Greets Ron
         
        Li, Guosheng schrieb:
            

                Ron,
                Thanks for reminding me on setting debug_level=1. Below
is the output,
                it looks none of the modules has been loaded. Any idea
why? rpm -qa
                output is showing all the packages being installed
correctly. Thanks!
                Guosheng
                 
                Starting GANGLIA gmond...
                Unable to find the metric information for 'kernel64bit'.
Possible that
                the module has not been loaded.
                Unable to find the metric information for 'serial_num'.
Possible that
                the module has not been loaded.
                Unable to find the metric information for 'oslevel'.
Possible that the
                module has not been loaded.
                ....
                Unable to find the metric information for 'disk_total'.
Possible that
                the module has not been loaded.
                Unable to find the metric information for 'disk_free'.
Possible that
                      

                The module has not been loaded.
                 
                (/etc/ganglia)>  rpm -qa
                apr-1.3.3-2
                sudo-1.6.7p5-2
                expat-2.0.1-2
                mkisofs-1.13-4
                ganglia-lib-3.1.2-1
                ganglia-gmond-3.1.2-1
                ganglia-mod_ibmpower-3.1.2-1
                libconfuse-2.6-1
                cdrecord-1.9-4
                mtools-3.9.8-1
                openssl-0.9.7l-1
                AIX-rpm-5.3.0.50-6
                 
                 
                -----Original Message-----
                From: Ron Wellnitz [mailto:[email protected]]
                Sent: Monday, October 12, 2009 9:00 AM
                To: Li, Guosheng
                Cc: [email protected]
                Subject: Re: [Ganglia-general] gmond 3.1.2 not reporting
any
                   

        performance
            

                data on AIX 5300-06-01
                 
                Hi Guosheng,
                 
                have you tried to activate the debug mode (debug_level =
x) in your
                gmond.conf and check at the output?
                 
                Greets Ron
                 
                Li, Guosheng schrieb:

                        I have installed gmond 3.1.2 on a bunch of hosts
with AIX 5.3
                        Technology Level 6 Service Pack 1 (5300-06-01),
no problem with
                        installation and gmond process is running, but
no any graph displayed
                        and no data under /var/lib/ganglia/rrds on the
web server. "telnet

                        <hostname> 8649" only shows header lines, no
line between <CLUSTER...> and </CLUSTER>. Same results on all these
5300-06-01 servers, I have no problem with ganglia running on other AIX
servers with different

                        Technology or Service Pack levels. Is there a
known bug for gmond on
                        AIX 5300-06-01? How can I troubleshoot? Is there
a log file I can Look at?

                         
                        Thanks.
                        Guosheng

------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day 
trial. Simplify your report design, integration and deployment - and focus on 
what you do best, core application coding. Discover what's new with
Crystal Reports now.  http://p.sf.net/sfu/bobj-july
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to