Hi Guchun >From what you are saying, you've compiled for 64 bits and you're trying to run on 32 bits. You need to make your build and execute environments consistent,
cheers - Barry On Wednesday 16 November 2011 15:27:28 Guchun Zhang wrote: > Hi Miles, > > The gcc versions on two exec nodes are different. bunix-server has 4.4.5 > and the other 4.6.1. Both are 32 bits Ubuntu Desktop, one 10.10 and the > other 11.10. The headnode is installed with 64 bits Ubuntu Server 11.10. > The headnode has also 4.6.1 gcc. > > I know Moses is written for 32 bits systems and can be compiled on 64 bits > systems. I don't know whether this will change the code to 64 bits. > > I just rechecked the out.job12017-aa/b files. They are a bit different. I > put them all together here for comparison. > > out.job12017-aa: > Linux bunix-server 2.6.35-30-generic #60-Ubuntu SMP Mon Sep 19 20:45:08 UTC > 2011 i686 GNU/Linux > ulimit: Command not found. > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses: Exec format error. > Wrong Architecture. > Newline in variable name. > > bunix-server is Ubuntu 10.10. > > out.job12017-ab: > Warning: no access to tty (Bad file descriptor). > Thus no job control in this shell. > Linux guchun-VirtualBox 3.0.0-12-generic #20-Ubuntu SMP Fri Oct 7 14:50:42 > UTC 2011 i686 athlon i386 GNU/Linux > ulimit: Command not found. > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses: Exec format error. > Binary file not executable. > exit status 1 > mv: cannot stat > `/home/guchun/Work/tasks/ro-en/tuning-sge/tmp12017/run1.best100.out.split12 > 017-ab': No such file or directory > exit status 1 > exit status 0 > > guchun-VirtualBox is Ubuntu 11.10, installed in a VirtualBox on > bunix-server. > > The error messages are different. Is this caused by the different version > of Ubuntu? Or something more profound? > > Cheers, > > Guchun > > On 16 November 2011 12:21, Miles Osborne <[email protected]> wrote: > > check the gcc version on the slaves. it looks like eg you are running > > 64 code on a 32 bit machine > > > > Miles > > > > On 16 November 2011 12:15, Guchun Zhang <[email protected]> wrote: > > > Hi Barry, > > > In out.job12017-aa, > > > Linux bunix-server 2.6.35-30-generic #60-Ubuntu SMP Mon Sep 19 20:45:08 > > > > UTC > > > > > 2011 i686 GNU/Linux > > > ulimit: Command not found. > > > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses: Exec format error. > > > > Wrong > > > > > Architecture. > > > Newline in variable name. > > > bunix-server is the hostname of the execution node. Complaints are > > > > similar > > > > > in out.job12017-ab (run on another node), too. > > > Cheers, > > > Guchun > > > > > > On 16 November 2011 09:21, Barry Haddow <[email protected]> wrote: > > >> Hi Guchun > > >> > > >> The mert.out file doesn't help that much. Is there any more > > >> information > > > > in > > > > >> the > > >> err and out files? > > >> eg > > >> /home/guchun/Work/tasks/ro-en/tuning-sge/out.job12017-aa > > >> /home/guchun/Work/tasks/ro-en/tuning-sge/err.job12017-aa > > >> > > >> cheers - Barry > > >> > > >> On Tuesday 15 Nov 2011 22:01:41 Guchun Zhang wrote: > > >> > Hi there, > > >> > > > >> > I am trying to tune on a SGE cluster. I ran the following command on > > > > the > > > > >> > head node, > > > > /home/guchun/Work/moses-scripts/scripts-20111111-1703/training/mert-moses > >.p > > > > >> > l \ > > >> > /home/guchun/Work/tasks/ro-en/corpus/euparl.lc.ro \ > > >> > /home/guchun/Work/tasks/ro-en/corpus/euparl.lc.en \ > > >> > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses \ > > >> > /home/guchun/Work/tasks/ro-en/trained/model/moses.ini \ > > >> > --mertdir /home/guchun/Work/mosesdecoder/mert/ \ > > >> > --rootdir /home/guchun/Work/moses-scripts/scripts-20111111-1703/ \ > > >> > --working-dir /home/guchun/Work/tasks/ro-en/tuning-sge/ \ > > >> > --jobs 2 --decoder-flag "-v 0" >& > > >> > /home/guchun/Work/tasks/ro-en/tuning-sge/mert.out & > > >> > > > >> > I got the following error, > > >> > > > >> > check_exit_status > > >> > check_exit_status of job -aa > > >> > check_exit_status of job -ab > > >> > *wc: euparl.lc.ro.split12017-aa.trans: No such file or directory* > > >> > *Split (-aa) were not entirely translated* > > >> > outputN= inputN=11966 > > >> > outputfile=euparl.lc.ro.split12017-aa.trans > > >> > inputfile=euparl.lc.ro.split12017-aa > > >> > *Split (-ab) were not entirely translated* > > >> > outputN=0 inputN=11966 > > >> > outputfile=euparl.lc.ro.split12017-ab.trans > > >> > inputfile=euparl.lc.ro.split12017-ab > > >> > *everything crashed, not trying to resubmit jobs* > > >> > *Got interrupt or something failed.* > > >> > kill_all_and_quit > > >> > qdel 56 > > >> > Executing: qdel 56 > > >> > Exit code: 1 > > >> > qdel 57 > > >> > Executing: qdel 57 > > >> > Exit code: 1 > > >> > Translation was not performed correctly > > >> > or some of the submitted jobs died. > > >> > qdel function was called for all submitted jobs > > >> > Exit code: 1 > > >> > The decoder died. CONFIG WAS -w -0.322581 -lm 0.161290 -d 0.193548 > > >> > -tm 0.064516 0.064516 0.064516 0.064516 0.064516 > > >> > > > >> > Any clue what may cause the problem? I have also attached the output > > >> > file > > >> > (mert.out) for full inspection. > > >> > > > >> > Everything runs fine in serial execution (without --job 2). > > >> > > > >> > I wonder if this can attribute to my SGE configuration. So if > > > > possible, > > > > >> > could you please also give some advice on the parameter > > >> > configuration > > > > of > > > > >> > SGE? > > >> > > > >> > Many thanks in advance, > > >> > > > >> > Guchun > > > > > > -- > > > > > > Guchun Zhang > > > > > > Localization Engineer > > > Alpha CRC Ltd | Cambridge, UK > > > Direct: +44 1223 431035 > > > [email protected] > > > > > > _______________________________________________ > > > Moses-support mailing list > > > [email protected] > > > http://mailman.mit.edu/mailman/listinfo/moses-support > > > > -- > > The University of Edinburgh is a charitable body, registered in > > Scotland, with registration number SC005336. > -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
