Ok, this is the backtrace of the running processes. There are two processes
running:
0 S becsekba 54451 54421 0 80 0 - 76108 futex_ 12:39 pts/92 00:00:00
/opt/slurm/16.05.8/bin/srun -n 8 whale-dbg -i IMP/RunImpact2D.i
1 S becsekba 54477 54451 0 80 0 - 24908 pipe_w 12:39 pts/92 00
I think these also belong to the execution:
0 S becsekba 54421 52395 0 80 0 - 4438 wait 12:39 pts/92 00:00:00
/bin/bash /apps/daint/UES/xalt/0.7.6/bin/srun -n 8 whale-dbg -i
IMP/RunImpact2D.i
(gdb) bt
#0 0x2b641c273cec in waitpid () from /lib64/libc.so.6
#1 0x004297aa in
On Fri, 13 Jan 2017, Barna Becsek wrote:
> I think these also belong to the execution:
I think you've posted backtraces for every process *except* the
important ones. :-D
Shouldn't there be child processes of those srun processes? The
children would be the ones running *your* code; that's whe
Hmm, you are right but I cannot find a process that would correspond to the
children of any of these. These are all the processes for my username:
ps -elf | grep becsekba
4 S root 12129 17897 0 80 0 - 24867 SyS_po 14:35 ?00:00:00
sshd: becsekba [priv]
4 S becsekba 12131 1
On Fri, 13 Jan 2017, Barna Becsek wrote:
> Hmm, you are right but I cannot find a process that would correspond to the
> children of any of these.
Is this a cluster, by any chance? Where slurm is running on the login
node but the actual applications are running on separate compute
nodes?
If I