On 9 May 2017, at 05:02, Robert Kopp <[email protected]> wrote:

> I get the impression that my attachments may not have been attached. Here 
> they appear as inline text, which hopefully will be readable, hoping to get 
> an understanding of why I can't run the Simplified Tutorial with more than 
> one thread. 
> 
> static_tov/SIMFACTORY/run/RunScript
> #! /bin/bash
> 
> echo "Preparing:"
> set -x                          # Output commands
> set -e                          # Abort on errors
> 
> cd @RUNDIR@-active
> 
> echo "Checking:"
> pwd
> hostname
> date
> 
> echo "Environment:"
> export CACTUS_NUM_PROCS=@NUM_PROCS@
> export CACTUS_NUM_THREADS=@NUM_THREADS@
> export GMON_OUT_PREFIX=gmon.out
> export OMP_NUM_THREADS=@NUM_THREADS@
> env > SIMFACTORY/ENVIRONMENT
> 
> echo "Starting:"
> export CACTUS_STARTTIME=$(date +%s)
> #mpd --daemon
> mpirun -np @NUM_PROCS@ @EXECUTABLE@ -L 3 @PARFILE@
> 
> echo "Stopping:"
> date
> 
> echo "Done."
> 
> static_tov/output-0002/SIMFACTORY/RunScript
> #! /bin/bash
> 
> echo "Preparing:"
> set -x                          # Output commands
> set -e                          # Abort on errors
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> cd /home/robert/simulations/static_tov/output-0002-active
> (Exports the wrong stuff. Changes to machine file had already been made.)
> 
> echo "Checking:"
> pwd
> hostname
> date
> 
> echo "Environment:"
> export CACTUS_NUM_PROCS=1
> export CACTUS_NUM_THREADS=1
> export GMON_OUT_PREFIX=gmon.out
> export OMP_NUM_THREADS=1
> env > SIMFACTORY/ENVIRONMENT
> 
> echo "Starting:"
> export CACTUS_STARTTIME=$(date +%s)
> #mpd --daemon
> mpirun -np 1 /home/robert/simulations/static_tov/SIMFACTORY/exe/cactus_sim -L 
> 3 /home/robert/simulations/static_tov/output-0002/static_tov_small.par
> 
> echo "Stopping:"
> date
> 
> echo "Done."
> 
> 
> 
> 
> 
> 
> static_tov/SIMFACTORY/properties.ini
> [properties]
> machine         = robert-desktop
> simulationid    = 
> simulation-static_tov-robert-desktop-robert-desktop-robert-2017.05.01-22.30.19-17921
> sourcedir       = /home/robert/Cactus
> configuration   = sim
> configid        = config-sim-robert-desktop-home-robert-Cactus
> buildid         = build-sim-robert-desktop-robert-2017.05.02-05.04.57-3349
> testsuite       = False
> executable      = 
> /home/robert/simulations/static_tov/SIMFACTORY/exe/cactus_sim
> optionlist      = 
> /home/robert/simulations/static_tov/SIMFACTORY/cfg/OptionList
> submitscript    = 
> /home/robert/simulations/static_tov/SIMFACTORY/run/SubmitScript
> runscript       = /home/robert/simulations/static_tov/SIMFACTORY/run/RunScript
> parfile         = 
> /home/robert/simulations/static_tov/SIMFACTORY/par/static_tov_small.par
> 
> 
> 
> machine.ini
> # Machine description (Changes made)
> nickname        = robert-desktop
> name            = robert-desktop
> location        = somewhere
> description     = Whatever
> status          = personal
> 
> # Access to this machine
> hostname        = robert-desktop
> aliaspattern    = ^generic\.some\.where$
> 
> # Source tree management
> sourcebasedir   = /home/robert
> optionlist      = 
> /home/robert/Cactus/repos/simfactory2/mdb/optionlists/ubuntu.cfg
> submitscript    = generic.sub
> runscript       = 
> /home/robert/Cactus/repos/simfactory2/mdb/runscripts/debian.sh
> make            = make
> basedir         = /home/robert/simulations
> ppn             = 4
> max-num-threads = 4
> num-threads     = 4
> nodes           = 1
> submit          = exec @SCRIPTFILE@ < /dev/null > /dev/null 2> /dev/null & 
> echo $!
> getstatus       = ps @JOB_ID@
> stop            = kill @JOB_ID@
> submitpattern   = (.*)
> statuspattern   = "^ *@JOB_ID@ "
> queuedpattern   = $^
> runningpattern  = ^
> holdingpattern  = $^
> exechost        = echo localhost
> exechostpattern = (.*)
> stdout          = cat @[email protected]
> stderr          = cat @[email protected]
> stdout-follow   = tail -n 100 -f @[email protected] @[email protected]
> 
> 
> # modify parameter file for smaller memory footprint using sed
>  sed '/CoordBase::d[xyz]/s/8/12/' <par/static_tov.par 
> >par/static_tov_small.par
>  # start simulation, watch log output
>  ./simfactory/bin/sim submit static_tov --parfile=par/static_tov_small.par 
> --procs=4 --walltime=8:0:0
>  ./simfactory/bin/sim show-output --follow static_tov
> 
> Here procs changed from 1 to 4, gives error message.
> _______________________________________________
> Users mailing list
> [email protected]
> http://cactuscode.org/mailman/listinfo/users


Hi Robert,

I notice that you are giving the files from output-0002.  Does it work if you 
delete the full static_tov simulation and try again?  Submitting the same 
simulation repeatedly is supposed to be used for checkpoint/recovery, not 
trying different versions of the simulation (though many people do this, I 
don't think it's a good idea).  I don't see the machine definition being stored 
in the top-level simulation, so logically this shouldn't make any difference, 
but software can surprise you more often than not!

On the surface, what you observe appears to be a bug, because you have ppn=4 
and --procs 4, which should only require one node.  But I would like to check 
that it still happens if you start from a fresh simulation.

I just tried it myself, and it works for me.

-- 
Ian Hinder
http://members.aei.mpg.de/ianhin

_______________________________________________
Users mailing list
[email protected]
http://cactuscode.org/mailman/listinfo/users

Reply via email to