Hi everybody,

We have an issue trying to run FSL through Condor. The thing is that the jobs don't start ever. Here is the output of condor_status:

Name OpSys Arch State Activity LoadAv Mem ActvtyTime

slot10@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:18 slot11@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:19 slot12@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:20 slot13@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:21 slot14@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:22 slot15@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:23 slot16@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:16 slot17@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:17 slot18@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:18 slot19@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:19 slot1@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+00:08:36 slot20@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:20 slot21@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:21 slot22@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:22 slot23@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:23 slot24@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:16 slot25@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:17 slot26@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:18 slot27@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:19 slot28@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:20 slot29@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:21 slot2@srvulx01 LINUX X86_64 Unclaimed Idle 1.000 7436 0+19:25:18 slot30@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:22 slot31@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:23 slot32@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:16 slot3@srvulx01 LINUX X86_64 Unclaimed Idle 0.950 7436 0+19:25:19 slot4@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:20 slot5@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:21 slot6@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:22 slot7@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:23 slot8@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:16 slot9@srvulx01 LINUX X86_64 Unclaimed Idle 0.000 7436 0+19:25:17 Total Owner Claimed Unclaimed Matched Preempting Backfill

        X86_64/LINUX    32     0       0        32       0 0        0

               Total    32     0       0        32       0 0        0


Here's the output of condor_q:

-- Submitter: srvulx01 : <127.0.0.1:42724> : srvulx01
 ID      OWNER            SUBMITTED     RUN_TIME ST PRI SIZE CMD

0 jobs; 0 completed, 0 removed, 0 idle, 0 running, 0 held, 0 suspended

This one shows that it should run even when the computer is not idle:

 condor_config_val START
TRUE


And here is the error output:

fmri.feat/log$ more design.e0
/usr/share/fsl/5.0/bin/fsl_sub -T 10 -l logs -N feat0_init /usr/share/fsl/5.0/bin/feat /tmp/feeds-oxford-jalapeno_linux_64-gcc4.1/feeds/results/fmri+.feat/design.fsf -D /tmp/feeds-oxford-jalapeno_linux_64-gcc4.1/feeds/results/fmri+.feat -I 1 -init
    while executing
"fsl:exec "${FSLDIR}/bin/feat ${fsfroot}.fsf -D $FD -I $session -init" -b 10 -N feat0_init -l logs "
    invoked from within
"if { $done_something == 0 } {

    if { ! $fmri(inmelodic) } {
    if { $fmri(level) == 1 } {
        #{{{ FEAT first-level analysis

for { set session 1 } ..."
    (file "/usr/share/fsl/5.0/bin/feat" line 207)

Any idea of what's going on here? One last thing, here is the script I use for submitting:

#!/bin/bash

unset FSLPARALLEL  # parallelization is not possible for submitted jobs

onm=allfsf.submit  # submit file for condor
memusg=4000        # expected memory usage for a single analysis

cdir=$(pwd)        # get the path to current working directory
#fsflst=`ls -1 $fsfdir/*.fsf`
fsflst=`ls -1 *.fsf`

if [ ! -d $cdir/log ] # create directory for condor log files
then
    mkdir $cdir/log
fi

# create header for the condor submit file
echo "Executable = $FSLDIR/bin/feat
Universe = vanilla
initialdir = $cdir
request_cpus = 1
request_memory = $memusg
getenv = True
" > $onm

# create a queue with each fsf file found in the current directory
for cfsf in $fsflst
do
    cstem=`basename "$cfsf" | sed -e 's/.fsf//g'`

    echo "arguments = $cfsf" >> $onm
    echo "error  = $cdir/log/$cstem.e\$(Process)" >> $onm
    echo "output = $cdir/log/$cstem.o\$(Process)" >> $onm
    echo "Queue" >> $onm
done

condor_submit $onm # this will submit and run the analyses

Thanks!!!


--

Eneko Perez
IT Manager
*BCAM -* Basque Center for Applied Mathematics
Alameda de Mazarredo, 14
E-48009 Bilbao, Basque Country - Spain
Tel. +34 946 567 842
[email protected] <mailto:[email protected]> | www.bcamath.org/perez <http://www.bcamath.org/perez>
*/
/*
*/(/*///matematika mugaz bestalde *)*/
_______________________________________________
Neurodebian-users mailing list
[email protected]
http://lists.alioth.debian.org/cgi-bin/mailman/listinfo/neurodebian-users

Reply via email to