-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Dear Roger,
apart from Francois' suggestion using strace you can also check the space available in /tmp, the amount of memory available on these machines (using the command 'free') and run memcheck to test the memories health. Regards, Tim On 03/07/2013 06:47 PM, Roger Rowlett wrote: > OK, here is a strange one: I have 4 client machines (3 absolutely > identical hardware) all running ccp4i from a central server and 2 > of the 4 machines (always the same 2) will fail when running a > certain scala job with a "child killed" error. I can run the same > job on the 2 "good" machines. All the machines call the same ccp4 > source file in .tcshrc. I'm stumped. It's probably some sort of > cryptic permissions error or something like that. Anyone seen > anything like this and have some clue as to the origin of the > crash? AFAIK, it only happens running scala in 6.3.0. The home > directories for users are served via NFS, but that should be the > same for all machines. > > Log file follows: > > > <!-- CCP4 HTML LOGFILE --> <pre>#CCP4I VERSION CCP4Interface 2.2.0 > #CCP4I SCRIPT LOG scala #CCP4I DATE 05 Mar 2013 14:28:22 #CCP4I > USER xrdcamp4 #CCP4I PROJECT chem385 #CCP4I JOB_ID 4 #CCP4I SCRATCH > /tmp/xrdcamp4 #CCP4I HOSTNAME malagueta #CCP4I PID 18904 </pre> > > *************************************************************************** > > * Information from CCP4Interface script > *************************************************************************** > > The program run with command: /usr/local/xtal/ccp4-6.3.0/bin/scala HKLIN > "/home/xrdcamp4/chem385/Chem385-2013Srefinaloutput.mtz" HKLOUT > "/tmp/xrdcamp4/chem385_4_1_mtz.tmp" SCALES > "/home/xrdcamp4/chem385/chem385_4.scala" ROGUES > "/home/xrdcamp4/chem385/chem385_4_rogues.log" NORMPLOT > "/home/xrdcamp4/chem385/chem385_4_normplot.xmgr" ANOMPLOT > "/home/xrdcamp4/chem385/chem385_4_anomplot.xmgr" PLOT > "/home/xrdcamp4/chem385/chem385_4_surface_plot.plt" CORRELPLOT > "/home/xrdcamp4/chem385/chem385_4_correlplot.xmgr" ROGUEPLOT > "/home/xrdcamp4/chem385/chem385_4_rogueplot.xmgr" has failed with > error message child killed: kill signal > *************************************************************************** > > > > #CCP4I TERMINATION STATUS 0 "child killed: kill signal" #CCP4I > TERMINATION TIME 05 Mar 2013 14:28:22 #CCP4I MESSAGE Task failed > > - -- - -- Dr Tim Gruene Institut fuer anorganische Chemie Tammannstr. 4 D-37077 Goettingen GPG Key ID = A46BEE1A -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.12 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iD8DBQFROc0bUxlJ7aRr7hoRApxTAJ48mJWrv6x5f4/6AEVjWXFoUjpjEQCg2rlO SDsullZAVEGfjRbIoJuVOkw= =euL4 -----END PGP SIGNATURE-----
