Hi Folks,
  We are using DMTCP2.5.0.  We have an intermittent problem that the restart 
hangs.  I suspect there may be a critical difference between the machine/OS 
performing the save ad the machine/OS performing the restart.  I have enabled 
-DDEBUG, and get the following output.  We iterate apparently endlessly over 
socketwrappers.cpp:59 in socket; REASON='socket created', and 
socketconnection.cpp:227 in TcpConnection; REASON='Creating TcpConnection.'.  
It appears that the socket number is getting incremented each time.  Does this 
look familiar?

Regards,
   Drew



[53734] TRACE at threadlist.cpp:360 in checkpointhread; REASON='before 
callbackSleepBetweenCheckpoint(0)'
[53734] TRACE at dmtcpworker.cpp:465 in waitForStage1Suspend; REASON='running'
[53734] TRACE at dmtcpworker.cpp:424 in waitForCoordinatorMsg; REASON='waiting 
for SUSPEND message'
[53734] TRACE at execwrappers.cpp:686 in execl; REASON='execl() wrapper'
     path = /bin/sh
[53734] TRACE at execwrappers.cpp:585 in execv; REASON='execv() wrapper, 
calling execve with environ'
     path = /bin/sh

…some too related output, then

Checking license…
[51839] TRACE at socketwrappers.cpp:59 in socket; REASON='socket created'
     ret = 22
     domain = 2
     type = 2
     protocol = 0
[51839] TRACE at socketconnection.cpp:227 in TcpConnection; REASON='Creating 
TcpConnection.'
     id() = 5979407538cdd414-51839-ad1c9812b2c75(101182)
     domain = 2
     type = 2
     protocol = 0
[51839] TRACE at socketwrappers.cpp:59 in socket; REASON='socket created'
     ret = 22
     domain = 2
     type = 2
     protocol = 0
[51839] TRACE at socketconnection.cpp:227 in TcpConnection; REASON='Creating 
TcpConnection.'
     id() = 5979407538cdd414-51839-ad1c9812b2c75(101183)
     domain = 2
     type = 2
     protocol = 0
[51839] TRACE at socketwrappers.cpp:59 in socket; REASON='socket created'
     ret = 23
     domain = 2
     type = 2
     protocol = 0



[http://www.cadence.com/mail/footer_logocdns2.jpg]
[Cadence Cares]<http://fortune.com/best-companies/cadence-52/>
Andrew T. Lynch     |    Software Architect
T: 408.914.6875   M: 408.832.1045    www.cadence.com<http://www.cadence.com/>

_______________________________________________
Dmtcp-forum mailing list
Dmtcp-forum@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dmtcp-forum

Reply via email to