Bernard,

  The failed directories are the ones to hold the rrds for each individual host 
like 'ganglia/grid/rrds/cluster/host/' and 
'ganglia/grid/rrds/cluster/__SummaryInfo__/'

This seems normal based on rrd_helpers.c where:

my_mkdir ( const char *dir )
{
   pthread_mutex_lock( &rrd_mutex );
   if ( mkdir ( dir, 0755 ) < 0 && errno != EEXIST)
      {
   pthread_mutex_unlock(&rrd_mutex);
         err_sys("Unable to mkdir(%s)",dir);
      }
   pthread_mutex_unlock( &rrd_mutex );
}

I assume that this is more efficient then checking for directory existence and 
then conditionally creating it.  A smaller setup on the same box also has the  
'gmetad   RET   mkdir -1 errno 17 File exists' in ktrace  but not on scale 
where it is noticeable.  

Please let me know if you would like any additional information.

Gil

----- Original Message ----
From: Bernard Li <[EMAIL PROTECTED]>
To: Gilad Raphaelli <[EMAIL PROTECTED]>; [EMAIL PROTECTED]; 
[email protected]
Sent: Friday, April 14, 2006 1:09:32 PM
Subject: RE: [Ganglia-developers] RE: [Ganglia-general] New (final?) tarball 
for ganglia-3.0.3

 <!-- DIV {margin:0px} -->  Hi Gil:
  
 I guess what would be helpful  to troubleshoot this, is figure out what 
directories failed to create and why -  that would probably help us figure out 
the problem.
  
 BTW, can you also test to see  if a smaller setup has the same issue?
  
 Cheers,
  
 Bernard

 
  From: Gilad Raphaelli  [mailto:[EMAIL PROTECTED]
Sent: Fri 14/04/2006 13:06
To:  Bernard Li; [EMAIL PROTECTED];  [email protected]
Subject: Re:  [Ganglia-developers] RE: [Ganglia-general] New (final?) tarball 
for  ganglia-3.0.3


   <!-- DIV {margin:0px} -->    I'm actually seeing 100% disk busy under both 
rhel4 and freebsd 4.11 with  just 98 nodes in 13 clusters.   My goal is to get 
gmetad running on  freebsd, rhel4 was just for comparision's sake.  A ktrace 
reveals 100s of  failed mkdirs during every writing period - traceable to 
rrd_helpers.c.   There don't seem to be any other significant events.  When the 
disk hits  100% iowait the system is unusable.

I was under the impression that a  relatively low powered system could handle 
something like this configuration -  perhaps that is the issue?  The box is a 
PIII 800 with 1.5 GB mem - the  rrds are stored on a dedicated 70 GB ide disk.

Any insight would be  appreciated.  I'm hanging out in #ganglia on freenode if 
anyone wants to  chat.

Thank you,

Gil

----- Original Message ----
From:  Bernard Li <[EMAIL PROTECTED]>
To: [EMAIL PROTECTED]; [EMAIL PROTECTED];  
[email protected]
Sent: Thursday, April 13, 2006  11:19:50 PM
Subject: [Ganglia-developers] RE: [Ganglia-general] New (final?)  tarball for 
ganglia-3.0.3

  Hi Martin:
  
 Finally had the time to test  it, here's the text in the webpage now:

Gmetad Web Frontend version  3.0.3.200604132304 Check for Updates.
Gmetad Web Backend (gmetad) version  3.0.3.200604102000 Check for Updates.
  
 Looks like it's  fixed.
  
 BTW, I tested Ganglia on  Fedora Core 5 x86 and it is working fine.
  
 Did anybody else test  3.0.3?  Somebody on IRC mentioned that he was having 
issues with gmetad  using up 99% CPU with a large number of clients (50+).
  
 Cheers,
  
 Bernard

 
  From: Martin Knoblauch  [mailto:[EMAIL PROTECTED]
Sent: Tue 11/04/2006 11:38
To:  Bernard Li; [EMAIL PROTECTED];  [email protected]
Subject: RE:  [Ganglia-general] New (final?) tarball for ganglia-3.0.3


  Bernard,

 could you please test the following patch  in "web" to solve this
really really big problem :-) You need to run  "./configure" to recreate
"web/version.php".

$diff -u -r1.9  ganglia.php
--- ganglia.php 25 Mar 2006 01:53:57  -0000      1.9
+++ ganglia.php 11 Apr 2006 18:34:31  -0000
@@ -33,7 +33,8 @@
 $version = array();

 # The web  frontend version, from conf.php.
-$version["webfrontend"] =  "$majorversion.$minorversion.$microversion";
+#$version["webfrontend"]  =
"$majorversion.$minorversion.$microversion";
+$version["webfrontend"] =  "$ganglia_version";

 # The name of our local grid.
 $self =  " ";


$diff -u -r1.1 version.php.in
---  version.php.in      10 Dec 2004 21:34:04  -0000      1.1
+++  version.php.in      11 Apr 2006 18:34:50 -0000
@@  -5,7 +5,7 @@
 $minorversion =  @GANGLIA_MINOR_VERSION@;
 $microversion =  @GANGLIA_MICRO_VERSION@;

-$ganglia_version  =
"@[EMAIL PROTECTED]@[EMAIL PROTECTED]@GANGLIA_MICRO_VERSION@";
+$ganglia_version  = "@GANGLIA_VERSION@";
 $ganglia_release_name    =  "@GANGLIA_RELEASE_NAME@";

 ?>


--- Bernard Li  <[EMAIL PROTECTED]> wrote:

> Just tested building and running on  Fedora Core 4 x86, everything
> checks out (minimal installation test) -  did notice this minor issue
> though:
> 
> Gmetad Web  Frontend version 3.0.3 Check for Updates.
> Gmetad Web Backend (gmetad)  version 3.0.3.200604102000 Check for
> Updates.
> 
>  Notice the versions are different between webfrontend and gmetad - I
>  guess they use difference sources for the version string?
> 
>  Chris, are you still planning to help us test with your  hardware?
> 
> Thanks,
> 
>  Bernard
> 
> P.S. If anybody wants the RPMs, please ping  me.
>
> ________________________________
>
> From:  [EMAIL PROTECTED] on behalf of Martin
>  Knoblauch
> Sent: Sat 08/04/2006 00:31
> To: ganglia general;  [email protected]
> Subject: [Ganglia-general] New  (final?) tarball for ganglia-3.0.3
>
>
>
>  Hi,
>
>  as promised, I have created a new pre-3.0.3 tarball.  It can be
> downloaded from:
>
> http://www.knobisoft.de/ganglia/ganglia-3.0.3.200604080900.tar.gz
>
>   Due to the release plans for OSCAR5, this could be the last snaphot
>  before a release next week.
>
>  Especially the following  problems are supposed to be solved:
>
> - truncated XML
> -  bogus "old protocol" messages in dead-host detection
> - gmetad will not  stop updating RRDs after a previous failure
> - apr-0.9.7 is now  officially in CVS
> - minor fixes to the webfrontend
> - more minor  stuff -> See the ChangeLog
>
> Cheers
>  Martin
>
>  ------------------------------------------------------
> Martin  Knoblauch
> email: k n o b i AT knobisoft DOT de
> www:    http://www.knobisoft.de
>
>
>  -------------------------------------------------------
> This SF.Net  email is sponsored by xPML, a groundbreaking scripting
> language
>  that extends applications into web and mobile media. Attend the live
>  webcast
> and join the prime developer group breaking into this new  coding
> territory!
>
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
>  _______________________________________________
> Ganglia-general mailing  list
> [EMAIL PROTECTED]
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
>
>
>


------------------------------------------------------
Martin  Knoblauch
email: k n o b i AT knobisoft DOT de
www:   http://www.knobisoft.de









Reply via email to