are you sure the input and the training data are encoded as UTF8?

Also, was the input tokenized exactly the same way as the training? The
tokenization scripts sometimes changes from time to time

Hieu Hoang
http://www.hoang.co.uk/hieu

On 26 December 2015 at 13:53, Sehrob Ibrohimov <iseh...@gmail.com> wrote:

> Hello everyone!
>
> I have came accross some strange behaviour of moses decoder and can't
> get it right. I can't figure out what the problem is.
> The problem is, I have trained translation models about a year ago
> with moses (version 3.0) and got pretty good results. The translation
> was pretty impressive. But some days ago I have got new machine
> (Centos 7) and build moses sources and give him translation model I
> had. And I can't get any translations. But these models was giving
> good results with the previous moses which was on ubuntu 14.0.
> I give him input like: "президент республики подписал указ о
> проведении праздника навруз".
> It gives me:  президент|UNK|UNK|UNK республики|UNK|UNK|UNK
> подписал|UNK|UNK|UNK указ|UNK|UNK|UNK о|UNK|UNK|UNK
> проведении|UNK|UNK|UNK праздника|UNK|UNK|UNK навруз|UNK|UNK|UNK
> [11111111]  [total=-798.292]
>
> core=(-800.000,-8.000,8.000,0.000,0.000,0.000,0.000,0.000,0.000,0.000,0.000,0.000,0.000,0.000,0.000,-25.136)
> Then I thought that there may be some new changes in moses, and I have
> retrained my models with the data I had, using EMS but nothing has
> changed, the decoder gives me the same results.
> Please, help me to figure out what the problem is!
>
> Sincerely yours
> Sehrob
>
> _______________________________________________
> Moses-support mailing list
> Moses-support@mit.edu
> http://mailman.mit.edu/mailman/listinfo/moses-support
>
_______________________________________________
Moses-support mailing list
Moses-support@mit.edu
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to