mt-list  

Re: [Mt-list] MT for languages with multiple scripts

Vadim Berman
Fri, 04 Sep 2009 21:50:34 -0700

Hi Jeff,

An interesting question.

In our system, we tend to store it in one lexicon with different stylistic tags.

Obviously, the grammar, etc. must be reused.

Best regards,
Vadim

----- Original Message ----- From: "Jeff Allen" <jeff.al...@free.fr>
To: <mt-list@eamt.org>
Sent: Friday, September 04, 2009 11:54 PM
Subject: [Mt-list] MT for languages with multiple scripts


Hi listers,

What is experience on creating MT systems for languages with multiple scripts?

- choose only one script
- Separate system per script
- Toggle between variants after system launch
- etc

What factors that have influenced how you have chosen to implement multiple scripts for a given language? As source language and/or as target language.

Examples of languages:

* Chinese (Traditional, Simplified, and handling Kanji)
* Bosnian/Croatian/Serbian (Latin alphabet, Cyrillic alphabet)
* Mongolian (Classic script, Cyrillic script)

Regards,

Jeff
_______________________________________________
Mt-list mailing list



_______________________________________________
Mt-list mailing list