Two things to check before concluding it's the code (though I think your points are valid):
If you have a data source that's misconfigured, with a cluster name that matches a different data source, you'll get this problem, but only on __SummaryInfo__ files. If you have a 3.0 system, and the same metric name is sent via both gmetric & gmond for the same host, you'll get this problem. I suspect that can still happen in 3.1, as long as the metric comes from two different modules. I found both these by looking at the pattern of the names of the files. If it's really happening randomly to any rrdfile, I'd suspect the code, but if it's clustering on specific ones, I'd suspect configuration. -- ReC On Sep 11, 2009, at 6:21 AM, Spike Spiegel wrote: > Hi, > > our gmetad boxes (2 of them) with 12 data sources, 6 of which are > gmetad and 6 gmonds, are spamming syslog like mad with the following > message: > > Sep 6 06:33:32 localhost.localdomain /usr/sbin/gmetad[2526]: > RRD_update (/var/lib/ganglia/rrds/...metric.rrd): illegal attempt to > update using time 1252244010 when last update time is 1252244010 > (minimum one second step) > > This happens for both metrics and summary graphs. > > Looking at the hosts every appear to be fine to me, and ntp is running > everywhere and in sync. > > Looking at the code instead both gmetad/gmetad.c and > gmetad/data_thread.c have a possibly suspicious call to sleep: > > in gmetad.c:417 > sleep_time = 10 + ((30-10)*1.0) * rand()/(RAND_MAX + 1.0); > sleep(sleep_time); > > in data_thread.c:193 > sleep_time = (d->step - 5) + (10 * (rand()/(float)RAND_MAX)) > - (end.tv_sec - start.tv_sec); > if( sleep_time > 0 ) > sleep(sleep_time); > > two observation: > - based on man 3 sleep, if any signal is sent to gmetad, the sleep > interval can be 0 > - end.tv_sec - start.tv_sec could compute to a considerably high > number that along with a short step could result in a sleep_time < = > 0. > > thoughts? > > thanks > > -- > "Behind every great man there's a great backpack" - B. > > ------------------------------------------------------------------------------ > Let Crystal Reports handle the reporting - Free Crystal Reports 2008 > 30-Day > trial. Simplify your report design, integration and deployment - and > focus on > what you do best, core application coding. Discover what's new with > Crystal Reports now. http://p.sf.net/sfu/bobj-july > _______________________________________________ > Ganglia-developers mailing list > Ganglia-developers@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/ganglia-developers ------------------------------------------------------------------------------ Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day trial. Simplify your report design, integration and deployment - and focus on what you do best, core application coding. Discover what's new with Crystal Reports now. http://p.sf.net/sfu/bobj-july _______________________________________________ Ganglia-developers mailing list Ganglia-developers@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/ganglia-developers