Re: [Moses-support] english-japanese translation

2016-04-07 Thread Vito Mandorino
Hi, concerning the building of a Japanese-English translation engine, I have some doubts about the handling of Japanese numbers and digits, notably the portability of the placeholder approach. It seems that Japanese has several ways of expressing numbers, and both detection and conversion (to

Re: [Moses-support] english-japanese translation

2016-03-29 Thread Vito Mandorino
Thank you all for your comments and ressources. I will go through them and let you know if I stumble upon something interesting during the process. Vito Mandorino 2016-03-28 9:47 GMT+02:00 Graham Neubig : > Hello Vishal, > > Yes, that's what pre-ordering means. Specifically

Re: [Moses-support] english-japanese translation

2016-03-28 Thread Graham Neubig
Hello Vishal, Yes, that's what pre-ordering means. Specifically it means re-ordering the source side. Graham On Mon, Mar 28, 2016 at 2:16 PM, Vishal Goyal(विशाल गोयल) < vishal@gmail.com> wrote: > Dear Graham, > Greetings. > Please clarify that Pre-Ordering in your reply means, that making

Re: [Moses-support] english-japanese translation

2016-03-27 Thread विशाल गोयल
Dear Graham, Greetings. Please clarify that Pre-Ordering in your reply means, that making the word order of both Source Language Sentences and Target Language Sentences similar in Source-Target Language Pair before going for training so that it becomes similar to the scenario of closely related

Re: [Moses-support] english-japanese translation

2016-03-25 Thread Graham Neubig
Hi Vito, English-Japanese and Japanese-English translation are very difficult due to the grammatical differences between the languages. You have a couple options to overcome this problem: 1) If you want to use phrase-based Moses, you will have to perform some variety of pre-ordering, in which

[Moses-support] english-japanese translation

2016-03-25 Thread Vito Mandorino
Dear all, does anyone have ever done experiments for English-Japanese and Japanese-English translation? Do you know about useful ressources for this language pair, or some specific gotchas one should be aware of? More specifically, what is the best policy for dealing with alphabets? Do you think