Hello, 

We are trying to compile and train moses to translate a huge amount of 
documents. 

We follow the steps described in 
    http://www.statmt.org/moses_steps.html  ( Moses Installation and Training 
Run-Through ) 

but we have change the corpus and use the corpus available in europral for a 
couple of languages. 

I would highly appreciate if you could answer some of the questions we have: 
1.  Is it possible to achieve something similar to the online demo with a 
4-core machine (6gb RAM) ?

2.  Is it necessary to train with the full europarl corpus? 

3.  We plan to translate big amounts of text... How fast moses goes for big 
amounts of text?

4.  Does anybody have trained files so we can achieve a good quality without 
having to retrain the whole corpus? Some repositories, private, anything would 
be of great help.  


5.  The documentation explains that we need to do 4 preprocess steps for 
europarl corpus:
      tokenizer, lowercase, take xml takes off and strip empty lines. 
      I have taken the xml tags off and stripped the empty lines with an script 
done for me, because I haven't found any script in moses. 
      Are these scripts available somewhere? 
    
Could you please help us by answering these questions? 

Any help will be very much appreciated. 

                                          
_________________________________________________________________
Escucha a quienes ya han probado Windows 7 ¡Hazlo aquí!
http://www.sietesunpueblodeexpertos.com/index_windows7.html
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to