Today, Ryan Sweet wrote forth saying...

> one additional bit: running gmetad with debug on yeild lots of normal rrd 
> updates, followed by:
> 
> data_thread() couldn't parse the XML and data to RRD for [primary network]
> 
> where [primary network] is the data source I'm having trouble with....

that error message came from a failed process_xml() function in 
./gmetad/process_xml.c line 300 (although the message is in 
data_thread.c).  it would be nice to see the surrounding error messages 
too to help debug the problem.  if there are no error messages around it, 
then the only "silent" errors that i see would be caused by error'd 
write_data_to_rrd()'s (i'll make sure that err_msgs get put in there as 
they should be).  

here is a quick test to see if there are some corrupted round-robin 
databases.

jump into the rrd directory
# cd /var/lib/ganglia/rrds

list the size of all round robin databases
# find . -ls

you should only see two different fixed sizes.  if you have some files
which are not one size or the other then they are likely corrupted.

email the name of corrupted files and then delete them.  you don't need to 
restart gmetad.. it'll notice that the file is missing and make a new 
database and start filling it in.

-matt


Reply via email to