Hi,
Have some questions regarding transliteration training using Moses.
There is minimal documentation.
../mosesdecoder/scripts/Transliteration/train-transliteration-module.pl \
--corpus-f <foreign text> --corpus-e <target text> \ --alignment <path to
aligned text> \ --moses-src-dir <moses decoder path> --external-bin-dir
<external tools> \ --input-extension <input extension>--output-extension
<output-extension> \ --srilm-dir <sri lm binary path> --out-dir <path to
generate output files>
Is the lm trained over words or characters?
A parallel corpus of source language word and destination language word
will suffice?
Should the corpus be character segmented for training?
Regards,
T. Venkata Sai Nikhil,
International Institute of Information Technology -Hyderabad.
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support