Basically you need to get the core dump of the segfault.
Make sure the core size of your system is unlimited:

ulimit -c will show you the size limit of the core file, and
ulimit -c unlimited sets it to be unlimited.

You probably need to set the following environment variable:

export DMTCP_ABORT_ON_FAILED_ASSERT=1
which tells dmtcp to generate a core when an assertion fails.

Once you have the core file, you can use gdb to get the backtrace:

gdb -c <core file>, or
gdb <your original binary> <core file>

On Tue, Jul 25, 2017 at 03:56:09PM -0400, Stas Vernon wrote:
> Hi Jiajun,
> 
> Its my own c++ code, thats using MPI library and some other libraries (e.g.
> trillinos). It requires installation of several libraries for run.
> Could you please advise how can I get backtrace of the segfault?
> Thank you.
> 
> On Mon, Jul 24, 2017 at 1:12 PM, Jiajun Cao <jia...@ccs.neu.edu> wrote:
> 
> > What application were you running?  If it is difficult to share the
> > information of the binary, can you send us the backtrace of the
> > segfault?
> >
> > Best,
> > Jiajun
> >
> > On Mon, Jul 24, 2017 at 12:31:57AM -0400, Stas Vernon wrote:
> > > Hi guys,
> > > I'm getting 'Segmentation fault (core dumped)' error when running (in the
> > > very beginning):
> > > ./dmtcp_launch mycode
> > > Tried both in Virtualbox linux (with master version from zip file) and in
> > > interactive session on our cluster (i.e. no scheduler, just from
> > terminal,
> > > with 2.5.1 version) - same error. In both cases I didn't use mpirun that
> > is
> > > without MPI. Tried with mpirun also - same error.
> > > Tried for test:
> > > ./dmtcp_launch ls
> > > - works.
> > > Is there a way to troubleshoot? Thinking of Valgrind, though not sure
> > what
> > > should be the order of valgrind and dmtp_launch in command line.
> > > Thank you.
> > >
> > > On Fri, Jul 21, 2017 at 10:28 PM, Rohan Garg <rohg...@ccs.neu.edu>
> > wrote:
> > >
> > > > The PR 454 did make it to the 2.5.0 release but the PR 510 didn't. It
> > > > should be a part of the next release -- 2.5.1.
> > > >
> > > > You could try the 2.5.0 release to see if it works. PR 510 was for
> > > > a bug that someone had reported with their specific PBS environment;
> > > > it shouldn't affect most common PBS setups. In order to get the
> > > > patch corresponding to PR 510, you'd have to get the source directly
> > > > from the 2.5 git branch. In either case, if see any issues, please
> > > > feel free to write back to us.
> > > >
> > > > On Fri, Jul 21, 2017 at 10:18:56PM -0400, Stas Vernon wrote:
> > > > > Thank you guys for reply!
> > > > > From Gene's email it seems that current stable release 2.5.0 doesn't
> > > > > contain the code fixing bug that Rohan describe, is this correct?
> > > > > I installed 2.5.0 stable release only. Should I install the developer
> > > > > version instead or wait for next stable release?
> > > > >
> > > > > On Fri, Jul 21, 2017 at 9:23 PM, Gene Cooperman <g...@ccs.neu.edu>
> > > > wrote:
> > > > >
> > > > > > Hi Rohan,
> > > > > >     Thanks for your reply.
> > > > > >     Should we add your code to the upcoming release of DMTCP 2.5.1
> > > > > > with a label "experimental" in the README, and then say in the FAQ
> > > > > > that we now support PBS on an experimental basis, with comments
> > > > > > welcome?
> > > > > >     We've been delayed in getting the 2.5.1 release out, just due
> > > > > > to lack of our time, but we definitely owe people the enhancements
> > > > > > and bug fixes.  In one week, I will have more time available.
> > > > > >
> > > > > > Best,
> > > > > > - Gene
> > > > > >
> > > > > > ----- Original Message -----
> > > > > > From: "Rohan Garg" <rohg...@ccs.neu.edu>
> > > > > > To: "Stas Vernon" <stasvern...@gmail.com>
> > > > > > Cc: dmtcp-forum@lists.sourceforge.net
> > > > > > Sent: Friday, July 21, 2017 6:29:17 PM
> > > > > > Subject: Re: [Dmtcp-forum] PBS scheduler support
> > > > > >
> > > > > > Hi Stas,
> > > > > >
> > > > > > You should be able to use DMTCP with PBS. We have worked with the
> > > > > > PBS developers in the past to squash a few bugs related to
> > DMTCP/PBS
> > > > > > interaction (for example, see: https://github.com/dmtcp/
> > dmtcp/pull/454
> > > > ,
> > > > > > and https://github.com/dmtcp/dmtcp/pull/510). The reason we don't
> > list
> > > > > > it on the website is that the DMTCP/PBS integration has been tested
> > > > > > well enough.
> > > > > >
> > > > > > You can view my PBS scripts here:
> > > > > >
> > > > > >  - https://github.com/rohgarg/dmtcp-job-scripts/blob/master/
> > > > > > dmtcp_pbs_launch.job
> > > > > >  - https://github.com/rohgarg/dmtcp-job-scripts/blob/master/
> > > > > > dmtcp_pbs_restart.job
> > > > > >
> > > > > > Please let us know, if you face any issues.
> > > > > >
> > > > > > Thanks,
> > > > > > Rohan
> > > > > >
> > > > > > On Thu, Jul 20, 2017 at 10:02:11PM -0400, Stas Vernon wrote:
> > > > > > > Hello,
> > > > > > > I'm new to DMTCP. Our cluster uses PBS scheduler, my code
> > (written in
> > > > > > C++)
> > > > > > > uses MPI. A job is usually ran on several nodes.
> > > > > > > FAQ says 'DMTCP supports SLURM and Torque', but PBS  not
> > mentioned.
> > > > > > > Does that mean I cannot use it with PBS?
> > > > > > > Or if still possible, could smbd post a simple example script for
> > > > PBS?
> > > > > > > Thank you very much!
> > > > > >
> > > > > > > ------------------------------------------------------------
> > > > > > ------------------
> > > > > > > Check out the vibrant tech community on one of the world's most
> > > > > > > engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> > > > > >
> > > > > > > _______________________________________________
> > > > > > > Dmtcp-forum mailing list
> > > > > > > Dmtcp-forum@lists.sourceforge.net
> > > > > > > https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
> > > > > >
> > > > > >
> > > > > > ------------------------------------------------------------
> > > > > > ------------------
> > > > > > Check out the vibrant tech community on one of the world's most
> > > > > > engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> > > > > > _______________________________________________
> > > > > > Dmtcp-forum mailing list
> > > > > > Dmtcp-forum@lists.sourceforge.net
> > > > > > https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
> > > > > >
> > > >
> >
> > > ------------------------------------------------------------
> > ------------------
> > > Check out the vibrant tech community on one of the world's most
> > > engaging tech sites, Slashdot.org! http://sdm.link/slashdot
> >
> > > _______________________________________________
> > > Dmtcp-forum mailing list
> > > Dmtcp-forum@lists.sourceforge.net
> > > https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
> >
> >

------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to