On Fri, Apr 24, 2015 at 1:12 AM, Gene Cooperman <g...@ccs.neu.edu> wrote:

>     Thanks for reporting this bug.

Thanks for quick reply.  I am not sure what is a bug?  As far as 2.4
goes I am probably doing something wrong so that probably isn't a bug.

I sent this before, but I did not send to the list, and my email
address was wrong so I would not see replies you sent. Sorry
for the second near-identical copy (last paragraph has new info)

> To be honest, when DMTCP was growing
> fast, we weren't paying enough attention to issues such as ctrl-C.

It sounds like you are saying that you know already know this behavior
exists in 2.2 and 2.3?  Or should I be able to build those and have
them work as one would want with control-c?  If this is already known
to be an issue that is a bit of a relief since at least I know what it
is and it was then fixed in 2.4.

>     If 2.4.0 rc2 sometimes segfaults on restart, we'd definitely like to
> reproduce that locally here, and fix it.
>     Here are some questions that will help us to analyze this:
> 1.  Which programming language interpreter were you testing on?

I tried python, R, and ocaml.

> 2.  On the netbook, was the Atom a 32-bit CPU or 64-bit?

It is an Atom N550, that is 64bit, but it is running under a 32 bit
Fedora with all code in 32 bit.  I assumed a vanilla configure would
detect this, which it seems to have, otherwise it would not have run
at all.   But I see now in the install there is an --enable-m32 option
discussed.  I assume this is just for mixed 32/64 bit use, but I tried
building that one, and had the same result, and also without the
default first make.  (I assume all three of these in this situation
actually did the same thing).

Stracing the run always has it stop in the same place, after a call to
set_thread_area.

I can try to run some gdb's or such if you say what you could use.

> 3.  You said that on the Fedora 20 machine with 2.4.0 rc2, the
>     restart would _sometimes_ segfault.  Given a single checkpoint image,
>     would that same image sometimes checkpoint on restart?  Or would
>     the same file always segfault on restart, and a new checkpoint image
>     might always restart?  (In other words, is there sometimes a
>     "bad ckpt image" that will always segfault on restart, or is it
>     that a single ckpt image will sometimes segfault on restart and
>     sometimes not?)

It is all with a single image.  Each single image starts most of the
time, and occasionally does not.   I would not put much stock in this,
since I might have some kind of RAM problem.   I had two random
crashes a few weeks ago, and although long memtest86 runs detected
nothing, who knows?  Nothing else is causing any symptoms at current.

Also (added in second message) I have since built 2.3.1 on
the 64 bit i3 540 Fedora 20, and it works all the time with
no segfaults on anything I throw at it, but it still has the
problem from 2.2, and the problem of the 32 bit 2.3 on
the Atom processor: that is, a control-c kills the process
outright instead of letting the interpreter handle the interrupt.

------------------------------------------------------------------------------
One dashboard for servers and applications across Physical-Virtual-Cloud 
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to