Hi Dave

NB. Please subscribe to the mailing list before posting to it. You can 
subscribe here:
    http://mailman.mit.edu/mailman/listinfo/moses-support

I've noticed recently that virtual machine disks tends to be REALLY 
slow. Since binarization is all IO bound, that may severely affect the 
speed. Try binarizing on a real pc and see how it goes.

Also, check that you haven't been beaten by this VMWare bug:
http://kb.vmware.com/selfservice/microsites/search.do?language=en_US&cmd=displayKC&externalId=51306

For comparison, the de-en in the example models I created
http://www.statmt.org/moses/RELEASE-0.91/de-en/model/phrase-table.1.bin/
took 10 hours, and that while running about 20 binarizations simultaneously.


On 28/12/2012 02:02, David Wilson-Parr wrote:
> Hi all,
>
> I am a new member going through the 'build baseline system' section of 
> the website using the Europarl Swedish-English V6 set. Training took 
> not so long maybe 2 days although I swapped laptops halfway through so 
> its hard to tell.  I am running moses on Mint (Ubuntu type) linux on 
> VMWare under Windows 7.  I map the working (train/model) directory to 
> the host computer harddisk because I want to keep the VM image smaller.
>
> Anyway cutting to the chase.  I was training the Swedish/English Pair 
> of Europarl.  Training took a while, I would estimate 2 days but I 
> wasn't using mgiza++ just giza++ , incidentally I can't get mgiza++ to 
> compile.  I then tried to run the decoder and it took a while to start 
> up but when it started, it would immediately say
>
> **Killed
>
> event though I didn't kill it.  So I decided to binarise the phrase 
> table an re-ordering models but it has taken far longer than I 
> expected.  The 'build a baseline system tutorial' generally indicates  
> when something is a time-consuming process but this was taking longer 
> than the initial training.
>
> processPhraseTable - took 2 days+
> processLexicalTable - 3 days and still running
>
> Machine has 32gb of Ram, Intel I7 3630-QM 2.40 Ghz cpu (4/8 cores) .  
> SSD drive Sata III.  VMware Image is set to use 4 cores and 29Gb memory.
>
> I really appreciate some help,
>
> Dave

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to