Thanks a lot. I just want to find Ganglia with NVIDIA GPUs.

2011/6/17 <ganglia-general-requ...@lists.sourceforge.net>

> Send Ganglia-general mailing list submissions to
>        ganglia-general@lists.sourceforge.net
>
> To subscribe or unsubscribe via the World Wide Web, visit
>        https://lists.sourceforge.net/lists/listinfo/ganglia-general
> or, via email, send a message with subject or body 'help' to
>        ganglia-general-requ...@lists.sourceforge.net
>
> You can reach the person managing the list at
>        ganglia-general-ow...@lists.sourceforge.net
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Ganglia-general digest..."
>
>
> Today's Topics:
>
>   1. Re: ganglia web: can't select my cluster (Alex Dean)
>   2. Re: ganglia web: can't select my cluster (Daems Dirk)
>   3. Gmond sends wrong hostname (Ron Cavallo)
>   4. C api to create ganglia metrics (Indranil C)
>   5. Re: C api to create ganglia metrics (saurabh verma)
>   6. Re: C api to create ganglia metrics (Alex Dean)
>   7. Problem with gmond after change in motherboard (Mark Panning)
>   8. Gmond Python module for monitoring NVIDIA GPUs (Bernard Li)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Wed, 15 Jun 2011 07:03:36 -0400
> From: Alex Dean <a...@crackpot.org>
> Subject: Re: [Ganglia-general] ganglia web: can't select my cluster
> To: Daems Dirk <dirk.da...@vito.be>
> Cc: "ganglia-general@lists.sourceforge.net"
>        <ganglia-general@lists.sourceforge.net>
> Message-ID: <4ff321f6-d811-4b0c-9322-029cdb8c9...@crackpot.org>
> Content-Type: text/plain; charset=us-ascii
>
> On Jun 15, 2011, at 4:24 AM, Daems Dirk wrote:
>
> > Hi Rick,
> >
> > Sorry for the confusion. Correction:
> > My gmetad port is 8651. This is the one that is referred to in the
> conf.php file.
> > My gmond port is 8649.
>
> If your installation is using the default port assignments, 8651 is the
> gmetad non-interactive XML port.  You should use the interactive port
> (typically 8652) in your conf.php file.
>
> alex
>
>
> ------------------------------
>
> Message: 2
> Date: Wed, 15 Jun 2011 13:42:15 +0200
> From: Daems Dirk <dirk.da...@vito.be>
> Subject: Re: [Ganglia-general] ganglia web: can't select my cluster
> To: Alex Dean <a...@crackpot.org>
> Cc: "ganglia-general@lists.sourceforge.net"
>        <ganglia-general@lists.sourceforge.net>
> Message-ID:
>        <38dbca6e4b96984188d0bb5e2e528d0a38b469a...@vitomail3.vito.local>
> Content-Type: text/plain; charset="us-ascii"
>
> Hi Alex,
>
> Indeed, I referred to a wrong portnumber in the conf.php file.
> It's working now as expected.
>
> Thanks!
> Dirk
>
> -----Original Message-----
> From: Alex Dean [mailto:a...@crackpot.org]
> Sent: woensdag 15 juni 2011 13:04
> To: Daems Dirk
> Cc: Rick Cobb; ganglia-general@lists.sourceforge.net
> Subject: Re: [Ganglia-general] ganglia web: can't select my cluster
>
> On Jun 15, 2011, at 4:24 AM, Daems Dirk wrote:
>
> > Hi Rick,
> >
> > Sorry for the confusion. Correction:
> > My gmetad port is 8651. This is the one that is referred to in the
> conf.php file.
> > My gmond port is 8649.
>
> If your installation is using the default port assignments, 8651 is the
> gmetad non-interactive XML port.  You should use the interactive port
> (typically 8652) in your conf.php file.
>
> alex
>
>
> ---
> This e-mail, any attachments and the information it contains are
> confidential and meant only for the use of the addressee(s) only.  Access to
> this e-mail by anyone other than the addressee(s) is unauthorized.  If you
> are not the intended addressee (or responsible for delivery of the message
> to such person), you may not use, copy, distribute or deliver to anyone this
> message (or any part of its contents) or take any action in reliance on it.
>  In such case, you should destroy this message and notify the sender
> immediately.  If you have received this e-mail in error, please notify us
> immediately by e-mail or telephone and delete the e-mail from any computer.
> All reasonable precautions have been taken to ensure no viruses are present
> in this e-mail and its attachments.  As our company cannot accept
> responsibility for any loss or damage arising from the use of this e-mail or
> attachments we recommend that you subject these to your virus checking
> procedures prior to use.
>
>
>
>
>
> ------------------------------
>
> Message: 3
> Date: Wed, 15 Jun 2011 13:51:13 -0500
> From: "Ron Cavallo" <ron_cava...@s5a.com>
> Subject: [Ganglia-general] Gmond sends wrong hostname
> To: <ganglia-general@lists.sourceforge.net>
> Cc: Alex Shoyket <alex_shoy...@s5a.com>
> Message-ID:
>        <
> d9f2a5e810ef7041bdfe9433122b1ac372b...@jxn-ms-mbs29.intranet.saksroot.saksinc.com
> >
>
> Content-Type: text/plain; charset="iso-8859-1"
>
> All,
>
> I have a few servers with /etc/hosts entries that refer to names and IP's
> to enable applications on those servers to function proplerly.
>
> Sometimes, gmond reads one of these entries as the hostname of the server
> upon restarting gmond.
>
> How can I fix this so that gmond always knows the name of the server?
>
> -RC
>
> Ron Cavallo
> Sr. Director, Infrastructure
> Saks Fifth Avenue / Saks Direct
> 12 East 49th Street
> New York, NY 10017
> 212-451-3807 (O)
> 212-940-5079 (fax)
> 646-315-0119(C)
> www.saks.com <http://www.saks.com/>
>
>
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 4
> Date: 16 Jun 2011 07:34:48 -0000
> From: "Indranil C" <indran...@rediff.co.in>
> Subject: [Ganglia-general] C api to create ganglia metrics
> To: "ganglia-general " <ganglia-general@lists.sourceforge.net>
> Message-ID:
>        <20110616073448.3590.qm...@pro237-210.mxout.rediffmailpro.com>
> Content-Type: text/plain; charset="utf-8"
>
> Hi,&nbsp; Is there a C API, using which I can create ganglia metrics?
> Basically I want to avoid calliing "gmetric", from my C code, as this seems
> to be too time and resource consuming for a large number of calls on a
> regular basis. Any thoughts?&nbsp;
>
> Thanks,
> Neel
>
> Treat yourself at a restaurant, spa, resort and much more with Rediff Deal
> ho jaye!
> -------------- next part --------------
> An HTML attachment was scrubbed...
>
> ------------------------------
>
> Message: 5
> Date: Thu, 16 Jun 2011 14:30:51 +0530
> From: saurabh verma <saurabh...@directi.com>
> Subject: Re: [Ganglia-general] C api to create ganglia metrics
> To: indran...@rediff.co.in,     ganglia-general
>        <ganglia-general@lists.sourceforge.net>
> Message-ID: <32ac4734-5b37-42fc-8632-63c97c20b...@directi.com>
> Content-Type: text/plain; charset=us-ascii
>
> I would require that too . is there any ?
>
> Thanks ,
> Saurabh
>
> On 16-Jun-2011, at 1:04 PM, Indranil C wrote:
>
> > Hi,
> >   Is there a C API, using which I can create ganglia metrics? Basically I
> want to avoid calliing "gmetric", from my C code, as this seems to be too
> time and resource consuming for a large number of calls on a regular basis.
> Any thoughts?
> >
> > Thanks,
> > Neel
>
>
>
>
> ------------------------------
>
> Message: 6
> Date: Thu, 16 Jun 2011 09:15:03 -0400
> From: Alex Dean <a...@crackpot.org>
> Subject: Re: [Ganglia-general] C api to create ganglia metrics
> To: ganglia-general <ganglia-general@lists.sourceforge.net>
> Message-ID: <5c0fc7a4-0520-436a-9301-7ddba0f12...@crackpot.org>
> Content-Type: text/plain; charset=us-ascii
>
>
> On Jun 16, 2011, at 3:34 AM, Indranil C wrote:
>
> > Hi,
> >   Is there a C API, using which I can create ganglia metrics? Basically I
> want to avoid calliing "gmetric", from my C code, as this seems to be too
> time and resource consuming for a large number of calls on a regular basis.
> Any thoughts?
> >
>
> You can create gmond modules using the C API.
> http://sourceforge.net/apps/trac/ganglia/wiki/ganglia_gmond_c_modules
>
> alex
>
>
>
>
> ------------------------------
>
> Message: 7
> Date: Thu, 16 Jun 2011 16:43:57 -0400
> From: Mark Panning <mpann...@ufl.edu>
> Subject: [Ganglia-general] Problem with gmond after change in
>        motherboard
> To: ganglia-general@lists.sourceforge.net
> Message-ID: <814ca24f-6071-43f4-ae46-be30f0a8e...@ufl.edu>
> Content-Type: text/plain; charset=US-ASCII; format=flowed; delsp=yes
>
> I suspect I'm missing some key point here somewhere, but ganglia has
> been broken on my cluster since I was forced to replace the
> motherboard on the master node, while it was working just fine prior
> to the change.
>
> Some details:
> I am running ganglia 3.0.6 on RHEL5
>
> gmond is apparently running fine on all nodes, and gmetad appears to
> also be running fine on the master node, but gmond will not work.
>
> I set debug level to 100 in gmond.conf, and here's what I get when I
> attempt to restart:
>
> [root@voigt ~]# /etc/init.d/gmond restart
> Shutting down GANGLIA gmond:                               [FAILED]
> Starting GANGLIA gmond: slurpfile() open() error on file /sys/devices/
> system/cpu/cpu0/cpufreq/scaling_max_freq: No such file or directory
> udp_recv_channel mcast_join=239.2.11.71 mcast_if=eth0 port=8649
> bind=239.2.11.71
> Error creating multicast server mcast_join=239.2.11.71 port=8649
> mcast_if=eth0 family='inet4'. Exiting.
>                                                            [FAILED]
>
> It appears to be a problem with multicast.
>
> I believe multicast should be up and running.  It was working fine on
> my switch configuration in the past, and no changes were made to the
> switch, so I assume that's fine.  As for the network interfaces (which
> did change with the change in motherboard), here's the output of
> ifconfig for eth0 related things, which is the internal network for
> the cluster:
>
> [root@voigt ~]# ifconfig
> eth0      Link encap:Ethernet  HWaddr B8:AC:6F:14:20:09
>           inet6 addr: fe80::baac:6fff:fe14:2009/64 Scope:Link
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           RX packets:30202937055 errors:0 dropped:48 overruns:0 frame:0
>           TX packets:53456872695 errors:0 dropped:0 overruns:0
> carrier:0
>           collisions:0 txqueuelen:1000
>           RX bytes:6631035455709 (6.0 TiB)  TX bytes:77118419416930
> (70.1 TiB)
>           Interrupt:185 Memory:ec000000-ec012800
>
> eth0:1    Link encap:Ethernet  HWaddr B8:AC:6F:14:20:09
>           inet addr:10.0.0.1  Bcast:10.0.0.255  Mask:255.255.255.0
>           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>           Interrupt:185 Memory:ec000000-ec012800
>
> I tried creating a static route with route add -host 239.2.11.71 dev
> eth0, but I still get the same error.
>
> Any hints on further troubleshooting I can do to track down this
> problem, or further information I can send out?
>
> Mark
>
>
>
>
> ------------------------------
>
> Message: 8
> Date: Thu, 16 Jun 2011 23:21:52 -0700
> From: Bernard Li <bern...@vanhpc.org>
> Subject: [Ganglia-general] Gmond Python module for monitoring NVIDIA
>        GPUs
> To: Ganglia <ganglia-general@lists.sourceforge.net>
> Message-ID: <banlktim6+mbedo0x-jok5edxs67mc8u...@mail.gmail.com>
> Content-Type: text/plain; charset=ISO-8859-1
>
> Dear all:
>
> Just a quick note letting you guys know that we now have a python
> module for monitoring NVIDIA GPUs using the newly released Python
> bindings for NVML:
>
> https://github.com/ganglia/gmond_python_modules/tree/master/gpu/nvidia
>
> If you are running a cluster with NVIDIA GPUs, please download the
> module and give it a try.
>
> The module itself is pretty much feature complete, but the GUI/reports
> still need some work.  It would be cool if we could extend it to work
> with the new gweb 2.0 as well.  Please feel free to fork the repo and
> submit pull requests.
>
> Special thanks to the team at NVIDIA for their help in implementing
> the plugin and Jeremy Enos at NCSA for providing access to a NVIDIA
> GPU cluster.
>
> Cheers,
>
> Bernard
>
>
>
> ------------------------------
>
>
> ------------------------------------------------------------------------------
> EditLive Enterprise is the world's most technically advanced content
> authoring tool. Experience the power of Track Changes, Inline Image
> Editing and ensure content is compliant with Accessibility Checking.
> http://p.sf.net/sfu/ephox-dev2dev
>
> ------------------------------
>
> _______________________________________________
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
>
>
> End of Ganglia-general Digest, Vol 61, Issue 14
> ***********************************************
>



-- 
-------------------------------------
Yours sincerely,
Huaxing Guo
Email address: ghxand...@gmail.com
High Performance and Grids Computing Center
Sun Yat-Sen University
Canton 510000, China
------------------------------------------------------------------------------
EditLive Enterprise is the world's most technically advanced content
authoring tool. Experience the power of Track Changes, Inline Image
Editing and ensure content is compliant with Accessibility Checking.
http://p.sf.net/sfu/ephox-dev2dev
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to