Re: [Moses-support] Training scripts/executables under Win32 ?

Hubert Crépy Fri, 22 Feb 2008 08:46:42 -0800

J C Read a écrit :

According to wikipedia http://en.wikipedia.org/wiki/SIGSEGV signal 11 indicates
an invalid memory reference.

Yes, definitely, what we also call a "coredump" under AIX.

I eventually figured out that this was because of the data I was using.

That's often the case, an unfortunate data condition that is unexpectedand unaccounted for in error recovery. That's usually hard to track,though...

Things to check:


Is the data sentence aligned?

Yes, europarl.lowercased.0-0.fr has 73835 lines:
   reprise de la session
   je dÃ©clare reprise la session du parlement europÃ©en qui avait (...)
   (...)
   des paroles , pas d' action .
   en attendant , deux mille personnes ont perdu la vie inutilement , (...)
and europarl.lowercased.0-0.en has 73835 lines:
   resumption of the session

i declare resumed the session of the european parliament adjournedon (...)

   (...)
   more talk . no action .
   meanwhile , two thousand people in the last year have needlessly (...)

Has the data been cleaned with the clean script? (try using sentences of min 1
and max 100)

Yes, it went through the script, with the recommended parameters:

|bin/moses-scripts/scripts-||/YYYYMMDD-HHMM/||/training/clean-corpus-n.perlworking-dir/corpus/europarl.tok fr en working-dir/corpus/europarl.clean1 40|


which reduced the number of sentences from the initial 100K to 73835.

Any other suggestions?

Say, it could not be that the very smallness of my training data (only73K sentences) could be causing unexpected underflows or whatever inGIZA, could it?Does it not make sense to try and run the whole process on a smalldataset to start with (I don't have access to powerful machines at themoment, running this on my personal laptop...) ?


Thanks for your support, much appreciated.

--
Hubert Crépy

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Re: [Moses-support] Training scripts/executables under Win32 ?

Reply via email to