On Thu, Aug 18, 2005 at 05:05:58PM -0500, Utsav Agarwal wrote:
> The solution we came up with is to change the code to report only the
> hostname part of the FQDN. That solves our current problem. However, I
> wonder what will happen if the gmond 'collector' cannot resolve hostnames
> (dns down...which admittedly is a bad situation). The rrd files will now be
> created by the node's IP? 

Yes.  You can fix it by fixing DNS, shutting down gmetad, and then
renaming the directory to the correct name.

-- Brooks

> -----Original Message-----
> From: [EMAIL PROTECTED]
> [mailto:[EMAIL PROTECTED] On Behalf Of Jason A.
> Smith
> Sent: Wednesday, August 17, 2005 10:30 AM
> To: Utsav Agarwal
> Cc: Ganglia General
> Subject: RE: [Ganglia-general] gmond - fully qualified host lookup
> 
> I don't think you can force functions like getnameinfo(3) to use DNS
> instead of /etc/hosts, because they rely on the system configuration to
> determine how to resolve things.  For example, on Linux look at the
> glibc config file /etc/nsswitch.conf, a lot of systems will probably
> have a line like:
> 
> hosts:      files dns
> 
> which means all glibc functions, like getnameinfo, will first
> try /etc/hosts and then dns if it is not found in the local file.  If
> you want to force gmond to always use dns then you would have to replace
> getnameinfo with some dns client library function, possibly from bind or
> some other library.  This might introduce portability problems though,
> and some people might not want that behavior so it might have to be made
> configurable.
> 
> ~Jason
> 
> 
> On Wed, 2005-08-17 at 10:00 -0500, Utsav Agarwal wrote:
> > We are using udp unicast packets from our cluster to the gmond
> 'collector',
> > which gets polled by a gmetad server. My reasoning is mainly based on the
> > above scenario.
> > 
> > The reason gmond may do the lookups is that it is hierarchically closer
> > (cluster using NAT, own dns server?) in dns to the cluster, whereas the
> > Gmetad can be in a different dns zone or site. However, there could be an
> > option in the config files to state where name lookups need to be done.
> > In our case, it is not a bad idea for gmond to do the lookups, as long as
> it
> > could use a consistent scheme for lookups. Right now the library that
> gmond
> > plugs into seems to use dns for external ip's and /etc/hosts for its own
> ip.
> > 
> > I am not a programmer: While looking at the source code, I saw some
> > references to apr_getnameinfo, which calls getnameinfo. Man getnameinfo
> > gives no flag to force a FQDN response, but there is a flag to force a
> short
> > response. Perhaps FQDN response is the default, but then why would we get
> > the short version for the 'collector' gmond.
> > 
> > Snippet from manpage:
> >  The flags argument modifies the behaviour of getnameinfo(3) as follows:
> > 
> >        NI_NOFQDN
> >               If set, return only the hostname part of the FQDN for local
> > hosts.
> > 
> >        NI_NUMERICHOST
> >               If set, then the numeric form of the hostname is returned.
> > (When  not  set,  this  will
> >               still happen in case the node's name cannot be looked up.)
> > 
> >        NI_NAMEREQD
> >               If set, then a error is returned if the hostname cannot be
> > looked up.
> > 
> >        NI_NUMERICSERV
> >               If  set,  then  the service address is returned in numeric
> > form, for example by its port
> >               number.
> > 
> >        NI_DGRAM
> >               If set, then the service is datagram (UDP) based rather than
> > stream (TCP) based. This is
> >               required for the few ports (512-514) that have different
> > services for UDP and TCP.
> > 
> > Utsav.
> > 
> > 
> > -----Original Message-----
> > From: [EMAIL PROTECTED]
> > [mailto:[EMAIL PROTECTED] On Behalf Of Jason A.
> > Smith
> > Sent: Wednesday, August 17, 2005 9:16 AM
> > To: [EMAIL PROTECTED]
> > Cc: steven wagner; Ganglia General
> > Subject: Re: [Ganglia-general] gmond - fully qualified host lookup
> > 
> > I was thinking about this a little more, why does gmond need to resolve
> > IPs anyway?  Convenience when debugging and to spare gmetad and possibly
> > other third party collectors of the xml data from doing the lookups.
> > Why not just let gmond keep the IPs only and then make gmetad resolve
> > the IPs?
> > 
> > One option that Matt suggested is to modify gmond and create a config
> > section where you could statically define names for IPs which gmond
> > would use to fill its internal hash table and therefore preventing it
> > from resolving those IPs.
> > 
> > Another option I was thinking of is to make a config setting in gmond
> > where it could specify if and what fqdn it should send out for itself,
> > like the location setting.  This would also benefit multi-nic'd hosts so
> > you could specify the hostname you would like that node to be known as
> > if it happens to be different than what it is known by on its multicast
> > interface.  For example, this would be useful if you are using a private
> > network for gmond's multicast but would like to have your nodes known by
> > their public name.  The only drawback is that since gmond uses udp,
> > there is no guarantee of successful data delivery, so how do you handle
> > the case when there is no fqdn data for some nodes?  Any thoughts?
> > 
> > ~Jason
> > 
> > 
> > On Tue, 2005-08-16 at 18:29 -0400, Jason A. Smith wrote:
> > > This has come up a few times in the past I believe, once a few years ago
> > > when we had similar problems on some Solaris servers we were monitoring
> > > with a similar /etc/hosts file setup.
> > > 
> > > I believe that ganglia just uses system calls like gethostbyaddr which
> > > rely on the OS/system libs to determine how to resolve names (for glibc
> > > see /etc/nsswitch.conf).  To get consistent results you would probably
> > > have to force ganglia to always do DNS lookups (like changing it to use
> > > bind libs instead), but I think this might introduce some portability
> > > problems (or some people might not want to rely on DNS so this would
> > > have to be made a config option).  We just ended up changing the format
> > > in our /etc/hosts files instead of opening up this can of worms.
> > > 
> > > ~Jason
> > > 
> > > 
> > > On Tue, 2005-08-16 at 16:56 -0500, [EMAIL PROTECTED] wrote:
> > > > Steven,
> > > >   Thanks for the response. It will not be practical to change the
> > /etc/hosts entry in a production environment. I am going to look at the
> > gmond source code. Any other pointers will be helpful.
> > > > 
> > > > Thanks,
> > > > Utsav Agarwal
> > > > 
> > > > ------------------------------------------------
> > > > On Tue, 16 Aug 2005 14:43:45 -0700, steven wagner <[EMAIL PROTECTED]>
> > wrote:
> > > > 
> > > > > Change the order of the hosts entry to:
> > > > > 
> > > > > <IP address>  node1.domain.com  node1  any-other-aliases
> > > > > 
> > > > > That should do it...
> > > > > 
> > > > > [EMAIL PROTECTED] wrote:
> > > > > > Hello all,
> > > > > >     We have the following setup and issue:     
> > > > > >   
> > > > > > All cluster nodes send (unicast udp) data to node1 and the main
> > gmetad server collects data from node1. The XML stream obtained from node1
> > reports all nodes except itself in a fully qualified fashion. We would
> like
> > node1's gmond to report itself as fully qualified without changing
> > /etc/hosts file. 
> > > > > > 
> > > > > > On node1:
> > > > > > The command host <ip of node1> reports node1's FQDN.
> > > > > > The command hostname reports node1
> > > > > > In /etc/hosts, the entry is: 
> > > > > > <ipadress>  node1 node1.domain.com
> > > > > > 
> > > > > > Any suggestions will be helpful.
> > > > > > 
> > > > > > Thanks,
> > > > > > Utsav Agarwal
> > > > > > 
> > > > > > 
> > > > > > -------------------------------------------------------
> > > > > > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > > > > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> > Practices
> > > > > > Agile & Plan-Driven Development * Managing Projects & Teams *
> > Testing & QA
> > > > > > Security * Process Improvement & Measurement *
> > http://www.sqe.com/bsce5sf
> > > > > > _______________________________________________
> > > > > > Ganglia-general mailing list
> > > > > > Ganglia-general@lists.sourceforge.net
> > > > > > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > > > 
> > > > > 
> > > > > 
> > > > > 
> > > > > -------------------------------------------------------
> > > > > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > > > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> > Practices
> > > > > Agile & Plan-Driven Development * Managing Projects & Teams *
> Testing
> > & QA
> > > > > Security * Process Improvement & Measurement *
> > http://www.sqe.com/bsce5sf
> > > > > _______________________________________________
> > > > > Ganglia-general mailing list
> > > > > Ganglia-general@lists.sourceforge.net
> > > > > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > > > 
> > > > 
> > > > 
> > > > -------------------------------------------------------
> > > > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> > Practices
> > > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing
> &
> > QA
> > > > Security * Process Improvement & Measurement *
> > http://www.sqe.com/bsce5sf
> > > > _______________________________________________
> > > > Ganglia-general mailing list
> > > > Ganglia-general@lists.sourceforge.net
> > > > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > > 
> > > -- 
> > > /------------------------------------------------------------------\
> > > |  Jason A. Smith                          Email:  [EMAIL PROTECTED] |
> > > |  Atlas Computing Facility, Bldg. 510M    Phone:  (631)344-4226   |
> > > |  Brookhaven National Lab, P.O. Box 5000  Fax:    (631)344-7616   |
> > > |  Upton, NY 11973-5000                                            |
> > > \------------------------------------------------------------------/
> > > 
> > > 
> > > 
> > > 
> > > -------------------------------------------------------
> > > SF.Net email is Sponsored by the Better Software Conference & EXPO
> > > September 19-22, 2005 * San Francisco, CA * Development Lifecycle
> > Practices
> > > Agile & Plan-Driven Development * Managing Projects & Teams * Testing &
> QA
> > > Security * Process Improvement & Measurement *
> http://www.sqe.com/bsce5sf
> > > _______________________________________________
> > > Ganglia-general mailing list
> > > Ganglia-general@lists.sourceforge.net
> > > https://lists.sourceforge.net/lists/listinfo/ganglia-general
> > > 
> -- 
> /------------------------------------------------------------------\
> |  Jason A. Smith                          Email:  [EMAIL PROTECTED] |
> |  Atlas Computing Facility, Bldg. 510M    Phone:  (631)344-4226   |
> |  Brookhaven National Lab, P.O. Box 5000  Fax:    (631)344-7616   |
> |  Upton, NY 11973-5000                                            |
> \------------------------------------------------------------------/
> 
> 
> 
> 
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
> _______________________________________________
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
> 
> 
> 
> -------------------------------------------------------
> SF.Net email is Sponsored by the Better Software Conference & EXPO
> September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
> Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
> Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
> _______________________________________________
> Ganglia-general mailing list
> Ganglia-general@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
-- 
Any statement of the form "X is the one, true Y" is FALSE.
PGP fingerprint 655D 519C 26A7 82E7 2529  9BF0 5D8E 8BE9 F238 1AD4

Attachment: pgpp24I7sT8F6.pgp
Description: PGP signature

Reply via email to