The debug build of 3.6.0 finally crashed over the weekend.  The backtrace
is:
#0  0x00007f042e4ba38c in hash_insert (key=0x7f0425bcc440,
val=0x7f0425bcc430, hash=0x7239d0) at hash.c:233
#1  0x0000000000408551 in startElement_METRIC (data=0x7f0425bcc770,
el=0x733930 "METRIC", attr=0x709270) at process_xml.c:677
#2  0x00000000004092b2 in start (data=0x7f0425bcc770, el=0x733930 "METRIC",
attr=0x709270) at process_xml.c:1036
#3  0x00007f042d55b5fb in ?? () from /lib/x86_64-linux-gnu/libexpat.so.1
#4  0x00007f042d55c84e in ?? () from /lib/x86_64-linux-gnu/libexpat.so.1
#5  0x00007f042d55e36e in ?? () from /lib/x86_64-linux-gnu/libexpat.so.1
#6  0x00007f042d55eb1b in ?? () from /lib/x86_64-linux-gnu/libexpat.so.1
#7  0x00007f042d560b5d in XML_ParseBuffer () from
/lib/x86_64-linux-gnu/libexpat.so.1
#8  0x0000000000409953 in process_xml (d=0x618900,
    buf=0x792360 "<?xml version=\"1.0\" encoding=\"ISO-8859-1\"
standalone=\"yes\"?>\n<!DOCTYPE GANGLIA_XML [\n   <!ELEMENT GANGLIA_XML
(GRID|CLUSTER|HOST)*>\n      <!ATTLIST GANGLIA_XML VERSION CDATA
#REQUIRED>\n      <!ATTLIST"...) at process_xml.c:1271
#9  0x000000000040556b in data_thread (arg=0x618900) at data_thread.c:288
#10 0x00007f042d131b50 in start_thread () from
/lib/x86_64-linux-gnu/libpthread.so.0
#11 0x00007f042ce7be6d in clone () from /lib/x86_64-linux-gnu/libc.so.6
#12 0x0000000000000000 in ?? ()

So it looks like the original backtrace I sent was the wrong one, as
opposed to the 3.3.8 debug build one

On Wed, Sep 17, 2014 at 9:25 AM, Devon H. O'Dell <devon.od...@gmail.com>
wrote:

> Regardless of whether this is 3.3.8 or 3.6.0, the offending line is:
>
> WRITE_LOCK(hash, i);
>
> I was going to guess this was 3.6.0 because it's a different
> backtrace, however the line number in process_xml.c doesn't make sense
> unless it is 3.3.8. What this implies is that the hash table is not
> properly protected by its mutex.
>
> There are 339 commits between 3.3.8 and the current master branch. I'd
> like to heavily suggest updating because I unfortunately do not have
> time to look through all the commit messages to see if this has been
> solved by work others have done.
>
> --dho
>
------------------------------------------------------------------------------
Slashdot TV.  Video for Nerds.  Stuff that Matters.
http://pubads.g.doubleclick.net/gampad/clk?id=160591471&iu=/4140/ostg.clktrk
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to