Re: [Ganglia-developers] hierarchical metric naming (long)

Federico Sacerdoti Wed, 04 Sep 2002 10:23:23 -0700

To continue our interesting exchange,

On Tuesday, September 3, 2002, at 03:37 PM, Steven Wagner wrote:

I am particularly interested in laying the groundwork for that so thatwe can upgrade the monitoring core here with a vanilla RPM and notworry about hosing any future proprietary metrics.

Definitely.

In this discussion, lets assume a node has received a metric from a hostnamed "subetha", that describes the size of its CPU1's cache. The fullyqualified metric name is "cpu/cache/size".

The node hostnames here are 12 characters long (and the hostname metricreports their FQDNs). I like the idea of identifying the hostsseparately using a metric transmitted sparingly instead of tacking iton to every metric name the host transmits... although I definitelythink it's the right way to go as far as naming schemes go(debug/metadata display), I believe the hostname should be decoupledfrom the transmitted metric.

So the host name is actually implied. You don't need to include it inany metric, since the source address of the XDR packet itself identifiesthe originating host. The transmitted metric name only needs to locatethe metric in the host's tree hierarchy, like "cpu/cache/size".

Maintaining a separate hash (of (strlen(metricname) + 1 +sizeof(uint32_t)) * num_of_metrics) bytes in size) that has XDR metricnames as keys and an internal metric index as values doesn't strike meas being too icky from a memory/performance footprint either.

I agree that adding the hash is not difficult or slow. It would fitright into our methodology of processing metrics. My problem is withfailure resistance. What do we do if we just crashed and don't knowabout any branches?

At some point a node is going to need each branch that contains ametric to be described to it in detail.

Wouldn't it be easier if the metric name self-described its ownbranches? Imagine that each metric name contains an absolute path(filesystem analogy). Then the branches would not have to be describedin detail, they could be automatically created as necessary. The realproblem in my mind is shared fate. If the "branch-description" messageis separate from the "metric" message, then we can potentially receiveone without the other. If the branch description is included in themetric packet (via a fully qualified name), then this is not a problem.

I would further like to point out that the minimum packet size in theEthernet protocol is 64 bytes. As most metrics are numeric types thatrequire only 4 bytes or so, most of the packet will be empty. We haveplenty of room available for a fully qualified name, and that space willotherwise be wasted.

I always liked URLs. Besides, everybody knows what a URL looks like.At least, everybody who's running gmond does, I hope ...
gmond://host[.cluster?]/cpu/1/idle_percentage
[etc.]


Excellent idea.

-Federico

Rocks Cluster Group, Camp X-Ray, SDSC, San Diego
GPG Fingerprint: 3C5E 47E7 BDF8 C14E ED92  92BB BA86 B2E6 0390 8845

Re: [Ganglia-developers] hierarchical metric naming (long)

Reply via email to