Hi Rohan,thank you for the reply. Somehow the issue was solved without my intervention. When I retry the same thing now, the error with python vanishes, i. e. I'm able to restart python shell. And the reason for that is that the access right to /dev/ipmi0 have changed:
$ ls -lah /dev/ipmi0 crw-rw-rw- 1 root root 245, 0 Nov 5 10:48 /dev/ipmi0But for the sake of completeness I answer your questions, for the case if it still may be useful.
1. I never run python as root user on this machine, simply because I have no root access.
2. The dmtcp version is 2.4.4. 3. I think this is some Bull Linux, but system release reports following: $ cat /etc/system-release Red Hat Enterprise Linux Server release 6.4 (Santiago) Open file descriptors before checkpoint: $ ls -l /proc/$(ps x | grep -e python | grep -v grep | awk '{print $1}')/fd total 0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 0 -> /dev/pts/8 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 1 -> /dev/pts/8 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 2 -> /dev/pts/8 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 5 -> /dev/ipmi0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 7 -> /dev/ipmi0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 821 -> socket:[34528195] lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 827 -> /dev/pts/8l-wx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 828 -> /tmp/dmtcp-s9951545@taurusi5591/jassertlog.4b3242428f3a397f-40000-56d5522b_python lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 831 -> /tmp/dmtcp-s9951545@taurusi5591/dmtcpSharedArea.4b3242428f3a397f-40000-56d5522b.56d5522b9
After checkpoint: $ ls -l /proc/$(ps x | grep -e python | grep -v grep | awk '{print $1}')/fd total 0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 0 -> /dev/pts/8 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 1 -> /dev/pts/8 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 2 -> /dev/pts/8 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 5 -> /dev/ipmi0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 7 -> /dev/ipmi0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 821 -> socket:[34528195] lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 827 -> /dev/pts/8l-wx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 828 -> /tmp/dmtcp-s9951545@taurusi5591/jassertlog.4b3242428f3a397f-40000-56d5522b_python lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 831 -> /tmp/dmtcp-s9951545@taurusi5591/dmtcpSharedArea.4b3242428f3a397f-40000-56d5522b.56d5522b9
And after restart: $ ls -l /proc/$(ps x | grep -e python | grep -v grep | awk '{print $1}')/fd total 0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 0 -> /dev/pts/6 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 1 -> /dev/pts/6 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 2 -> /dev/pts/6 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 5 -> /dev/ipmi0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 7 -> /dev/ipmi0 lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 821 -> socket:[34528397] lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 827 -> /dev/pts/6lrwx------ 1 s9951545 p_ffmk 64 Mar 1 09:26 831 -> /tmp/dmtcp-s9951545@taurusi5591/dmtcpSharedArea.4b3242428f3a397f-40000-56d5522b.56d552486
On 02/29/2016 06:13 PM, Rohan Garg wrote:
Hi Maksym, This looks like a strange issue. I have some questions about your setup. - Do you launch your python interpreter with sudo privileges or as the root user? - What python version are you using? What DMTCP version are you using? - What distro are you using? At restart time, DMTCP tries to restore file connections that the process had opened at checkpoint time. I'm not sure why it's trying to open '/dev/ipmi0' on restart. Can you share the output of the following command: ls -l /proc/<PID>/fd prior to checkpointing? (Here PID is the process id of the python interpreter that you launch under DMTCP.) This will help us identify if for some strange reason the python interpreter opens /dev/ipmi0 on your setup. Thanks, RohanOn Feb 1, 2016, at 3:39 AM, Maksym Planeta <mplan...@os.inf.tu-dresden.de> wrote: Hello, I'm trying to setup DMTCP. I installed it and launch coordinator. Then I launched python interpreter, created a variable, switched to coordinator, initiated checkpoint, and killed all coordinator clients with "k" command. After this python interpreter was terminated and several new files appeared in the directory where coordinator was running. Next I wanted to restart the interpreter. I still had my coordinator open, so I decided to use dmtcp_restart to launch python again: dmtcp_restart ckpt_*.dmtcp But this resulted in following error report: [40000] ERROR at fileconnection.cpp:863 in openFile; REASON='JASSERT(fd != -1) failed' _path = /dev/ipmi0 (strerror((*__errno_location ()))) = Permission denied I have this file: $ ls /dev/ipmi0 -lah crw-rw---- 1 root root 245, 0 Nov 4 10:22 /dev/ipmi0 But I don't have root permissions to manipulate access rights over the file. Could you tell me what can I do about this? And why DMTCP tries to access a file which the interpreter was never allowed to access? -- Regards, Maksym Planeta ------------------------------------------------------------------------------ Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=267308311&iu=/4140_______________________________________________ Dmtcp-forum mailing list Dmtcp-forum@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dmtcp-forum
-- Regards, Maksym Planeta
smime.p7s
Description: S/MIME Cryptographic Signature
------------------------------------------------------------------------------ Site24x7 APM Insight: Get Deep Visibility into Application Performance APM + Mobile APM + RUM: Monitor 3 App instances at just $35/Month Monitor end-to-end web transactions and take corrective actions now Troubleshoot faster and improve end-user experience. Signup Now! http://pubads.g.doubleclick.net/gampad/clk?id=272487151&iu=/4140
_______________________________________________ Dmtcp-forum mailing list Dmtcp-forum@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/dmtcp-forum