On 5/4/2012 9:37 AM, Jan de Kruyf wrote:
> Right, back from away.
> 
> John:
>          You answered my question quite adequately. And I referred to
> PREEMPT_RT when writing RT. Jus t my fingers are so thick.
> 
> All:
>         I seem to miss the stacktrace leading up to the crash in the
> reports. So, sadly,  there is little help I can offer since you guys are
> all so wrapped up in the idea that there is perhaps an underlying problem
> in the kernel or in glibc. 

I haven't been thinking there's a bug in glibc or the kernel.  While the
code is crashing in malloc() or (with electric fence) in vfprintf, this
is almost certainly due to some other bug (likely memory corruption)
that happened earlier in the linuxcnc code.

After reviewing some code last night, I think the segfault I was seeing
when running electric fence may simply be an earlier occurrence of the
later malloc() segfault, and rather than implicating the printing
routines it could be indicating partial corruption of the malloc
structures (or some other memory) earlier than the nasty crash that
happens later.

As for the lost mutex, that seems to be due to the process crashing,
rather than any fundamental problem with the code.

A backwards summary of symptoms when things are broken:

* The hal_data->mutex bit gets stuck

* The rtapi_app process segfaults (in malloc())

* rtapi_print_msg called from rtapi_reset_pagefault_count triggers a
segfault when running electric fence.  This could indicate an issue with
the message print routines, or be another side effect of already
corrupted memory.

...that's about as far as I've gotten for things that I'm pretty certain
should *NOT* be happening.  Backing up further, I see the
rtapi_print_msg calls causing writes to the stderr _IO_STRUCT, but this
could be normal behavior.  I was just guessing at memory to watch for
corruption, and since prints to stderr seem to cause problems, watching
the stderr _IO_STRUCT seemed like a good idea.

-- 
Charles Steinkuehler
char...@steinkuehler.net

------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and 
threat landscape has changed and how IT managers can respond. Discussions 
will include endpoint security, mobile security and the latest in malware 
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Emc-developers mailing list
Emc-developers@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/emc-developers

Reply via email to