Hi Nikhil,

The command you have mentioned is to be used for transliteration mining
from a parallel translation corpus. If you already have a parallel
transliteration corpus, you can ignore this command and use the standard
Moses pipeline to train the model. The parallel transliteration corpus
should be character segmented, and you should train the LM over characters.

Anoop.

On Wed 31 Jan, 2018, 15:08 nikhil t.v.s, <[email protected]> wrote:

> Hi,
>       Have some questions regarding transliteration training using Moses.
> There is minimal documentation.
>
>
> ../mosesdecoder/scripts/Transliteration/train-transliteration-module.pl \
> --corpus-f <foreign text> --corpus-e <target text> \ --alignment <path to
> aligned text> \ --moses-src-dir <moses decoder path> --external-bin-dir
> <external tools> \ --input-extension <input extension>--output-extension
> <output-extension> \ --srilm-dir <sri lm binary path> --out-dir <path to
> generate output files>
>
>
> Is the lm trained over words or characters?
> A parallel corpus of source language word and destination language word
> will suffice?
> Should the corpus be character segmented for training?
>
>
> Regards,
> T. Venkata Sai Nikhil,
> International Institute of Information Technology -Hyderabad.
> _______________________________________________
> Moses-support mailing list
> [email protected]
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to