Hello,
We are trying to compile and train moses to translate a huge amount of
documents.
We follow the steps described in
http://www.statmt.org/moses_steps.html ( Moses Installation and Training
Run-Through )
but we have change the corpus and use the corpus available in europral for a
couple of languages.
I would highly appreciate if you could answer some of the questions we have:
1. Is it possible to achieve something similar to the online demo with a
4-core machine (6gb RAM) ?
2. Is it necessary to train with the full europarl corpus?
3. We plan to translate big amounts of text... How fast moses goes for big
amounts of text?
4. Does anybody have trained files so we can achieve a good quality without
having to retrain the whole corpus? Some repositories, private, anything would
be of great help.
5. The documentation explains that we need to do 4 preprocess steps for
europarl corpus:
tokenizer, lowercase, take xml takes off and strip empty lines.
I have taken the xml tags off and stripped the empty lines with an script
done for me, because I haven't found any script in moses.
Are these scripts available somewhere?
Could you please help us by answering these questions?
Any help will be very much appreciated.
_________________________________________________________________
Escucha a quienes ya han probado Windows 7 ¡Hazlo aquí!
http://www.sietesunpueblodeexpertos.com/index_windows7.html_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support