Hello John,
I am also in the same situation that you discribe, I am using 3.0.1 / 
rrdtool 1.0.41, the gmetad  is connected to 27 others one, but after 
double-check I have no any duplication of  "CLUSTER NAME" in the xml frame 
received from the 27 others one.
In my case, the problem also arrive in nodes directories, not only 
"__SummaryInfo__", but mainly in "__SummaryInfo__".
Using the debug_level of gmetad, in the following example, we can see that 
it is gmetad that attempt to write two times in the same second in the 
rrdtool database the metrics "disk_free":
Updating host spa42sta, metric cpuRate
Updating host spa42sta, metric tcpQueueRate
Writing Summary data for source A1430_DB_42_LN, metric disk_free
Writing Summary data for source A1430_DB_42_LN, metric Data_node
Writing Summary data for source A1430_DB_42_LN, metric bytes_out
Writing Summary data for source A1430_DB_42_LN, metric proc_total
Writing Summary data for source A1430_DB_42_LN, metric NdbOperation
Writing Summary data for source A1430_DB_42_LN, metric cpuRate
Writing Summary data for source A1430_DB_42_LN, metric SFTP_BANDWIDTH_3
Writing Summary data for source A1430_DB_42_LN, metric tcpQueueRate
Writing Summary data for source A1430_DB_42_LN, metric cpu_nice
Writing Summary data for source A1430_DB_42_LN, metric pkts_in
Writing Summary data for source A1430_DB_42_LN, metric cpu_speed
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_getattr
Writing Summary data for source A1430_DB_42_LN, metric boottime
Writing Summary data for source A1430_DB_42_LN, metric 
diskstat_sda_iowait_queue
Writing Summary data for source A1430_DB_42_LN, metric PdPContext
Writing Summary data for source A1430_DB_42_LN, metric PdPModel
Writing Summary data for source A1430_DB_42_LN, metric NdbTransaction
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_setattr
Writing Summary data for source A1430_DB_42_LN, metric cpu_wio
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_lookup
Writing Summary data for source A1430_DB_42_LN, metric Committed_AS
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_create
Writing Summary data for source A1430_DB_42_LN, metric load_one
Writing Summary data for source A1430_DB_42_LN, metric disk_total
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_remove
Writing Summary data for source A1430_DB_42_LN, metric cpu_user
Writing Summary data for source A1430_DB_42_LN, metric cpu_idle
Writing Summary data for source A1430_DB_42_LN, metric Questions
Writing Summary data for source A1430_DB_42_LN, metric swap_free
Writing Summary data for source A1430_DB_42_LN, metric pkts_out
Writing Summary data for source A1430_DB_42_LN, metric mem_cached
Writing Summary data for source A1430_DB_42_LN, metric load_five
Writing Summary data for source A1430_DB_42_LN, metric cpu_num
Writing Summary data for source A1430_DB_42_LN, metric load_fifteen
Writing Summary data for source A1430_DB_42_LN, metric diskstat_sda_writes
Writing Summary data for source A1430_DB_42_LN, metric mem_free
Writing Summary data for source A1430_DB_42_LN, metric diskstat_sda_reads
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_read
Writing Summary data for source A1430_DB_42_LN, metric cpu_system
Writing Summary data for source A1430_DB_42_LN, metric proc_run
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_write
Writing Summary data for source A1430_DB_42_LN, metric mem_total
Writing Summary data for source A1430_DB_42_LN, metric cpu_aidle
Writing Summary data for source A1430_DB_42_LN, metric bytes_in
Writing Summary data for source A1430_DB_42_LN, metric mem_buffers
Writing Summary data for source A1430_DB_42_LN, metric mem_shared
Writing Summary data for source A1430_DB_42_LN, metric Index_node
Writing Summary data for source A1430_DB_42_LN, metric nfsd_v3_access
Writing Summary data for source A1430_DB_42_LN, metric swap_total
Writing Summary data for source A1430_DB_42_LN, metric SFTP_BANDWIDTH_1
Writing Summary data for source A1430_DB_42_LN, metric part_max_used
Writing Summary data for source A1430_DB_42_LN, metric disk_free
RRD_update 
(/var/lib/ganglia/rrds/A1430_DB_42_LN/__SummaryInfo__/disk_free.rrd): 
illegal attempt to update using time 1209013018 when last update time is 
1209013018 (minimum one second step)

After deeper investigation, with a query on port 8652 of cluster 
A1430_DB_42_LN using request "/?filter=summary, I catch that the metric 
disk_free is reported twice.
Then, my assumption is that the problem is in the gmetad(s) broadcasting the 
xml frame and not the centralized gmetad.
In my case these gmetad are in release 3.0.5.

No any more time for the moment to continue the investigation, but it is a 
first level....
Regards.
Christian.




----- Original Message ----- 
From: "John Swift" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Wednesday, April 23, 2008 1:28 AM
Subject: [Ganglia-general] RRD error spamming my messages file


> Hey guys,
>
> I have the constantly occurring RRD error on one of my
> gmetad machines. I browsed through the archives for
> this mailing list but I am somewhat confused between
> all the messages I have found. I'm not exactly sure
> what I'm supposed to do to eliminate the problem.
>
> My messages all have "__SummaryInfo__" in them. From
> what I could understand from the archived messages,
> this means a cluster name is being duplicated...right?
> Someone suggested using netcat to figure this out but
> I'm not clear on exactly how. Can someone explain?
>
> Also, is there a way I can output this error to some
> other file besides /var/log/messages?
>
> My error looks like this:
> Apr 22 00:07:43 machine01 /usr/sbin/gmetad[10254]:
> RRD_update
> (/var/lib/ganglia/rrds/__SummaryInfo__/proc_total.rrd):
> illegal attempt to update using time 1208848063 when
> last update time is 1208848063 (minimum one second
> step)
>
> Note that all machines are on NTP.
>
> Thank you in advance for your help.
>
>
> 
> ____________________________________________________________________________________
> Be a better friend, newshound, and
> know-it-all with Yahoo! Mobile.  Try it now. 
> http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ
>
> -------------------------------------------------------------------------
> This SF.net email is sponsored by the 2008 JavaOne(SM) Conference
> Don't miss this year's exciting event. There's still time to save $100.
> Use priority code J8TL2D2.
> http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
> _______________________________________________
> Ganglia-general mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/ganglia-general
> 



-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
Ganglia-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to