Hi,
  I'm new to Moses, have several questions, but I'll start with asking what
happens to apostrophes.
  What I see is that the tokenizer transforms words with apostrophes like
this :

it's --> it ' s
c'est une belle journée --> c ' est une belle journée

How does this work in translating the meaning of the word it's (English) /
c'est (French)?
I'm using the baseline model which I trained according to the tutorial.
I assume I need to send text tokenized in the same way to be translated
(it's --> it ' s; Otherwise any word that has an apostrophe is not
translated.)
I assumed it ' s (or c ' est) would be considered a phrase, however, I
notice that only the word est is translated (and this is just one example,
in l'équipe only the word equipe is translated an so forth...)

Am I doing sometihng wrong or misunderstanding something? Or do I need to
change the tokenizer to accept ' in the middle of words?

Thanks in advance,
Shani
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to