Re: [Ganglia-developers] hierarchical metric naming (long)

Steven Wagner Wed, 04 Sep 2002 11:10:28 -0700

Federico Sacerdoti wrote:

To continue our interesting exchange,


I can start reading the phone book at any time, folks.

On Tuesday, September 3, 2002, at 03:37 PM, Steven Wagner wrote:
I am particularly interested in laying the groundwork for that so thatwe can upgrade the monitoring core here with a vanilla RPM and notworry about hosing any future proprietary metrics.
Definitely.
In this discussion, lets assume a node has received a metric from a hostnamed "subetha", that describes the size of its CPU1's cache. The fullyqualified metric name is "cpu/cache/size".
The node hostnames here are 12 characters long (and the hostnamemetric reports their FQDNs). I like the idea of identifying the hostsseparately using a metric transmitted sparingly instead of tacking iton to every metric name the host transmits... although I definitelythink it's the right way to go as far as naming schemes go(debug/metadata display), I believe the hostname should be decoupledfrom the transmitted metric.
So the host name is actually implied. You don't need to include it inany metric, since the source address of the XDR packet itself identifiesthe originating host. The transmitted metric name only needs to locatethe metric in the host's tree hierarchy, like "cpu/cache/size".
Maintaining a separate hash (of (strlen(metricname) + 1 +sizeof(uint32_t)) * num_of_metrics) bytes in size) that has XDR metricnames as keys and an internal metric index as values doesn't strike meas being too icky from a memory/performance footprint either.
I agree that adding the hash is not difficult or slow. It would fitright into our methodology of processing metrics. My problem is withfailure resistance. What do we do if we just crashed and don't knowabout any branches?

The logic is already in the monitoring core to retransmit all metrics if anode's "gmond_started" metric timestamp changes compared to the last timein its metric hash. So, the primary (oldest) node takes it upon itself toretransmit the tree.

I suggest an extension to this - as I said a couple ranty e-mails back, Ithink that monitoring cores should wait before transmitting any metric databeyond "gmond_started", "gmond_version", "hostname" (maybe) and "heartbeat"... say, ten or fifteen seconds. This should provide ample time to buildup the metric tree, even if there's 85 metrics in there.

And even if there is a failure, it's only on one node. There's still n-1nodes out there that have an accurate picture of the cluster.

At some point a node is going to need each branch that contains ametric to be described to it in detail.
Wouldn't it be easier if the metric name self-described its ownbranches? Imagine that each metric name contains an absolute path(filesystem analogy). Then the branches would not have to be describedin detail, they could be automatically created as necessary. The realproblem in my mind is shared fate. If the "branch-description" messageis separate from the "metric" message, then we can potentially receiveone without the other. If the branch description is included in themetric packet (via a fully qualified name), then this is not a problem.

See above. I also proposed (about four emails ago) a DNS-like metricresolver function that allows a libganglia-using client to submit a requestfor description of a metric ... with answers being provided by the oldestor second-oldest node.

Anyway, let's say that doesn't work, or your six-fig Cisco monkey shoved abanana in a switch somewhere and the "create-branch" message arrives afterthe metric itself.


At this point we have two options:

* Discard the metric data, process the create-branch data, wait for thenext metric transmission. Straightforward but it means a hole in the datafor up to t_max and that'd be a bummer if it's one of those 15-minute metrics.

* Guess at adding the metric data based on the payload type of the XDR.If you win and we have a string in there at least naming the actual metric,then we sock it into an "uncategorized" branch and query/wait for thebranch data. After the create-branch data is received, we update thelookup hash and the metric hash to move the guessed metric into itsrightful place. This is quite a bit more complicated, obviously. And wecan't report this metric until its rightful place is secured.

A third option springs to mind, which is really a more extreme version ofthe second option:

* Freak out and send a request for the entire create_tree to beretransmitted (or, if the freaking node is the eldest, request thesecondary node or the *new metric's originating node* to transmit itsentire metric tree). The idea being, if *one* metric isn't in the tree,you can't be sure how many other branches are missing in the tree, so askthe most authoritative host available.


I'm pretty sure at least half of the above is not insane.

I would further like to point out that the minimum packet size in theEthernet protocol is 64 bytes. As most metrics are numeric types thatrequire only 4 bytes or so, most of the packet will be empty. We haveplenty of room available for a fully qualified name, and that space willotherwise be wasted.

It's the idea of having the XDR metric key being of a known, fixed lengththat I like.

Have we ever investigated the idea of packing metrics into a struct orarray of some kind (possibly on a branch basis) and transmitting them atthe same time? If we're starting to consider logical metric grouping instorage and data output, might as well transmit them, as well. Besides,atomic data sets like "cpu/idle_pct" and "cpu/user_pct" should probably becollected *AND* transmitted as a set, don't you think? :)

If we start packing metrics like this, it might pave the way fortransmitting tabular data (such as "active processes 'ps' data", which I'vefound is TOTALLY portable across our currently supported platforms, or"individual CPU usage", "Disk partition status", "running job data for thisnode according to some proprietary batch scheduler" etc. ...), which issomething I'd like to see.


Anyway...

Re: [Ganglia-developers] hierarchical metric naming (long)

Reply via email to