Hi Rohan,

thank you for the reply. Somehow the issue was solved without my intervention. When I retry the same thing now, the error with python vanishes, i. e. I'm able to restart python shell. And the reason for that is that the access right to /dev/ipmi0 have changed:

$ ls -lah /dev/ipmi0
crw-rw-rw- 1 root root 245, 0 Nov  5 10:48 /dev/ipmi0

But for the sake of completeness I answer your questions, for the case if it still may be useful.

1. I never run python as root user on this machine, simply because I have no root access.

2. The dmtcp version is 2.4.4.

3. I think this is some Bull Linux, but system release reports following:
$ cat /etc/system-release
Red Hat Enterprise Linux Server release 6.4 (Santiago)

Open file descriptors before checkpoint:

$ ls -l /proc/$(ps x | grep -e python | grep -v grep | awk '{print $1}')/fd
total 0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 0 -> /dev/pts/8
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 1 -> /dev/pts/8
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 2 -> /dev/pts/8
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 5 -> /dev/ipmi0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 7 -> /dev/ipmi0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 821 -> socket:[34528195]
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 827 -> /dev/pts/8
l-wx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 828 -> /tmp/dmtcp-s9951545@taurusi5591/jassertlog.4b3242428f3a397f-40000-56d5522b_python lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 831 -> /tmp/dmtcp-s9951545@taurusi5591/dmtcpSharedArea.4b3242428f3a397f-40000-56d5522b.56d5522b9

After checkpoint:

$ ls -l /proc/$(ps x | grep -e python | grep -v grep | awk '{print $1}')/fd
total 0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 0 -> /dev/pts/8
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 1 -> /dev/pts/8
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 2 -> /dev/pts/8
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 5 -> /dev/ipmi0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 7 -> /dev/ipmi0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 821 -> socket:[34528195]
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 827 -> /dev/pts/8
l-wx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 828 -> /tmp/dmtcp-s9951545@taurusi5591/jassertlog.4b3242428f3a397f-40000-56d5522b_python lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 831 -> /tmp/dmtcp-s9951545@taurusi5591/dmtcpSharedArea.4b3242428f3a397f-40000-56d5522b.56d5522b9

And after restart:

$ ls -l /proc/$(ps x | grep -e python | grep -v grep | awk '{print $1}')/fd
total 0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 0 -> /dev/pts/6
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 1 -> /dev/pts/6
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 2 -> /dev/pts/6
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 5 -> /dev/ipmi0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 7 -> /dev/ipmi0
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 821 -> socket:[34528397]
lrwx------ 1 s9951545 p_ffmk 64 Mar  1 09:26 827 -> /dev/pts/6
lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 831 -> /tmp/dmtcp-s9951545@taurusi5591/dmtcpSharedArea.4b3242428f3a397f-40000-56d5522b.56d552486



On 02/29/2016 06:13 PM, Rohan Garg wrote:
Hi Maksym,

This looks like a strange issue. I have some questions about your setup.

  - Do you launch your python interpreter with sudo privileges or as the
    root user?
  - What python version are you using? What DMTCP version are you using?
  - What distro are you using?

At restart time, DMTCP tries to restore file connections that the
process had opened at checkpoint time. I'm not sure why it's trying
to open '/dev/ipmi0' on restart.  Can you share the output of the
following command: ls -l /proc/<PID>/fd prior to checkpointing?
(Here PID is the process id of the python interpreter that you launch
under DMTCP.) This will help us identify if for some strange reason
the python interpreter opens /dev/ipmi0 on your setup.

Thanks,
Rohan

On Feb 1, 2016, at 3:39 AM, Maksym Planeta <mplan...@os.inf.tu-dresden.de> 
wrote:

Hello,

I'm trying to setup DMTCP. I installed it and launch coordinator. Then I launched python 
interpreter, created a variable, switched to coordinator, initiated checkpoint, and 
killed all coordinator clients with "k" command.

After this python interpreter was terminated and several new files appeared in 
the directory where coordinator was running.

Next I wanted to restart the interpreter. I still had my coordinator open, so I 
decided to use dmtcp_restart to launch python again:

dmtcp_restart ckpt_*.dmtcp

But this resulted in following error report:

[40000] ERROR at fileconnection.cpp:863 in openFile; REASON='JASSERT(fd != -1) 
failed'
     _path = /dev/ipmi0
     (strerror((*__errno_location ()))) = Permission denied

I have this file:

$ ls /dev/ipmi0  -lah
crw-rw---- 1 root root 245, 0 Nov  4 10:22 /dev/ipmi0

But I don't have root permissions to manipulate access rights over the file. 
Could you tell me what can I do about this? And why DMTCP tries to access a 
file which the interpreter was never allowed to access?

--
Regards,
Maksym Planeta

------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum


--
Regards,
Maksym Planeta

Attachment: smime.p7s
Description: S/MIME Cryptographic Signature

------------------------------------------------------------------------------
Site24x7 APM Insight: Get Deep Visibility into Application Performance
APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month
Monitor end-to-end web transactions and take corrective actions now
Troubleshoot faster and improve end-user experience. Signup Now!
http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to