Hi Barry, It could well be the problem. I will have a go with all-x86 later.
Cheers, Guchun On 16 November 2011 16:07, Barry Haddow <[email protected]> wrote: > Hi Guchun > > From what you are saying, you've compiled for 64 bits and you're trying to > run > on 32 bits. You need to make your build and execute environments > consistent, > > cheers - Barry > > On Wednesday 16 November 2011 15:27:28 Guchun Zhang wrote: > > Hi Miles, > > > > The gcc versions on two exec nodes are different. bunix-server has 4.4.5 > > and the other 4.6.1. Both are 32 bits Ubuntu Desktop, one 10.10 and the > > other 11.10. The headnode is installed with 64 bits Ubuntu Server 11.10. > > The headnode has also 4.6.1 gcc. > > > > I know Moses is written for 32 bits systems and can be compiled on 64 > bits > > systems. I don't know whether this will change the code to 64 bits. > > > > I just rechecked the out.job12017-aa/b files. They are a bit different. I > > put them all together here for comparison. > > > > out.job12017-aa: > > Linux bunix-server 2.6.35-30-generic #60-Ubuntu SMP Mon Sep 19 20:45:08 > UTC > > 2011 i686 GNU/Linux > > ulimit: Command not found. > > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses: Exec format error. > > Wrong Architecture. > > Newline in variable name. > > > > bunix-server is Ubuntu 10.10. > > > > out.job12017-ab: > > Warning: no access to tty (Bad file descriptor). > > Thus no job control in this shell. > > Linux guchun-VirtualBox 3.0.0-12-generic #20-Ubuntu SMP Fri Oct 7 > 14:50:42 > > UTC 2011 i686 athlon i386 GNU/Linux > > ulimit: Command not found. > > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses: Exec format error. > > Binary file not executable. > > exit status 1 > > mv: cannot stat > > > `/home/guchun/Work/tasks/ro-en/tuning-sge/tmp12017/run1.best100.out.split12 > > 017-ab': No such file or directory > > exit status 1 > > exit status 0 > > > > guchun-VirtualBox is Ubuntu 11.10, installed in a VirtualBox on > > bunix-server. > > > > The error messages are different. Is this caused by the different version > > of Ubuntu? Or something more profound? > > > > Cheers, > > > > Guchun > > > > On 16 November 2011 12:21, Miles Osborne <[email protected]> wrote: > > > check the gcc version on the slaves. it looks like eg you are running > > > 64 code on a 32 bit machine > > > > > > Miles > > > > > > On 16 November 2011 12:15, Guchun Zhang <[email protected]> wrote: > > > > Hi Barry, > > > > In out.job12017-aa, > > > > Linux bunix-server 2.6.35-30-generic #60-Ubuntu SMP Mon Sep 19 > 20:45:08 > > > > > > UTC > > > > > > > 2011 i686 GNU/Linux > > > > ulimit: Command not found. > > > > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses: Exec format > error. > > > > > > Wrong > > > > > > > Architecture. > > > > Newline in variable name. > > > > bunix-server is the hostname of the execution node. Complaints are > > > > > > similar > > > > > > > in out.job12017-ab (run on another node), too. > > > > Cheers, > > > > Guchun > > > > > > > > On 16 November 2011 09:21, Barry Haddow <[email protected]> > wrote: > > > >> Hi Guchun > > > >> > > > >> The mert.out file doesn't help that much. Is there any more > > > >> information > > > > > > in > > > > > > >> the > > > >> err and out files? > > > >> eg > > > >> /home/guchun/Work/tasks/ro-en/tuning-sge/out.job12017-aa > > > >> /home/guchun/Work/tasks/ro-en/tuning-sge/err.job12017-aa > > > >> > > > >> cheers - Barry > > > >> > > > >> On Tuesday 15 Nov 2011 22:01:41 Guchun Zhang wrote: > > > >> > Hi there, > > > >> > > > > >> > I am trying to tune on a SGE cluster. I ran the following command > on > > > > > > the > > > > > > >> > head node, > > > > > > > /home/guchun/Work/moses-scripts/scripts-20111111-1703/training/mert-moses > > >.p > > > > > > >> > l \ > > > >> > /home/guchun/Work/tasks/ro-en/corpus/euparl.lc.ro \ > > > >> > /home/guchun/Work/tasks/ro-en/corpus/euparl.lc.en \ > > > >> > /home/guchun/Work/mosesdecoder/moses-cmd/src/moses \ > > > >> > /home/guchun/Work/tasks/ro-en/trained/model/moses.ini \ > > > >> > --mertdir /home/guchun/Work/mosesdecoder/mert/ \ > > > >> > --rootdir /home/guchun/Work/moses-scripts/scripts-20111111-1703/ \ > > > >> > --working-dir /home/guchun/Work/tasks/ro-en/tuning-sge/ \ > > > >> > --jobs 2 --decoder-flag "-v 0" >& > > > >> > /home/guchun/Work/tasks/ro-en/tuning-sge/mert.out & > > > >> > > > > >> > I got the following error, > > > >> > > > > >> > check_exit_status > > > >> > check_exit_status of job -aa > > > >> > check_exit_status of job -ab > > > >> > *wc: euparl.lc.ro.split12017-aa.trans: No such file or directory* > > > >> > *Split (-aa) were not entirely translated* > > > >> > outputN= inputN=11966 > > > >> > outputfile=euparl.lc.ro.split12017-aa.trans > > > >> > inputfile=euparl.lc.ro.split12017-aa > > > >> > *Split (-ab) were not entirely translated* > > > >> > outputN=0 inputN=11966 > > > >> > outputfile=euparl.lc.ro.split12017-ab.trans > > > >> > inputfile=euparl.lc.ro.split12017-ab > > > >> > *everything crashed, not trying to resubmit jobs* > > > >> > *Got interrupt or something failed.* > > > >> > kill_all_and_quit > > > >> > qdel 56 > > > >> > Executing: qdel 56 > > > >> > Exit code: 1 > > > >> > qdel 57 > > > >> > Executing: qdel 57 > > > >> > Exit code: 1 > > > >> > Translation was not performed correctly > > > >> > or some of the submitted jobs died. > > > >> > qdel function was called for all submitted jobs > > > >> > Exit code: 1 > > > >> > The decoder died. CONFIG WAS -w -0.322581 -lm 0.161290 -d 0.193548 > > > >> > -tm 0.064516 0.064516 0.064516 0.064516 0.064516 > > > >> > > > > >> > Any clue what may cause the problem? I have also attached the > output > > > >> > file > > > >> > (mert.out) for full inspection. > > > >> > > > > >> > Everything runs fine in serial execution (without --job 2). > > > >> > > > > >> > I wonder if this can attribute to my SGE configuration. So if > > > > > > possible, > > > > > > >> > could you please also give some advice on the parameter > > > >> > configuration > > > > > > of > > > > > > >> > SGE? > > > >> > > > > >> > Many thanks in advance, > > > >> > > > > >> > Guchun > > > > > > > > -- > > > > > > > > Guchun Zhang > > > > > > > > Localization Engineer > > > > Alpha CRC Ltd | Cambridge, UK > > > > Direct: +44 1223 431035 > > > > [email protected] > > > > > > > > _______________________________________________ > > > > Moses-support mailing list > > > > [email protected] > > > > http://mailman.mit.edu/mailman/listinfo/moses-support > > > > > > -- > > > The University of Edinburgh is a charitable body, registered in > > > Scotland, with registration number SC005336. > > > > > -- > The University of Edinburgh is a charitable body, registered in > Scotland, with registration number SC005336. > > > -- *Guchun Zhang* Localization Engineer Alpha CRC Ltd | Cambridge, UK Direct: +44 1223 431035 [email protected] <[email protected]>
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
