Hi Kamel, I think you have a problem of tokenization.
It's strange that your Arabic tokenizer split the word " البدائية ' to "ال بدائية " while the word "قوتهم " is not split to " قوت +هم". You can use MADA ( http://www1.cs.columbia.edu/~rambow/software-downloads/MADA_Distribution.html) to pre-process your Arabic train, dev and test data. Also, you have another problem (not related to the initial question) on the French side of you training data: the token " d'" is originally " d' " and after tokenization you should have " d ' ". I think it's due to a problem of cleaning and normalisation of the web-crawled data. -Haithem On 28 February 2016 at 09:30, Rajen Chatterjee <[email protected] > wrote: > Hi Kamel, > > Untranslated words occurs in Moses output because the phrase table do not > have any translation option for these source words. > You can use the switch "-drop-unknown" or transliteration model, please > look here http://www.statmt.org/moses/?n=Advanced.OOVs > > On Fri, Feb 26, 2016 at 2:18 PM, kamel Bouzidi <[email protected]> > wrote: > >> Hi , >> I have a problem with the output moses . i found some untranslated words >> in moses output , i don't understand why , and this word it's not ( oov ) >> because it exist in my language model. >> can you help me . >> and thank you. >> >> _______________________________________________ >> Moses-support mailing list >> [email protected] >> http://mailman.mit.edu/mailman/listinfo/moses-support >> >> > > > -- > -Regards, > Rajen Chatterjee. > > _______________________________________________ > Moses-support mailing list > [email protected] > http://mailman.mit.edu/mailman/listinfo/moses-support > > -- *Email Disclaimer"This e-mail and any files transmitted with it are confidential and are intended solely for use by the addressee. Any unauthorised dissemination, distribution or copying of this message and any attachments is strictly prohibited. If you have received this e-mail in error, please notify the sender and delete the message. Any views or opinions presented in this e-mail may solely be the views of the author and cannot be relied upon as being those of Dublin City University. E-mail communications such as this cannot be guaranteed to be virus-free, timely, secure or error-free and Dublin City University does not accept liability for any such matters or their consequences. Please consider the environment before printing this e-mail."Séanadh Ríomhphoist"Tá an ríomhphost seo agus aon chomhad a sheoltar leis faoi rún agus is lena úsáid ag an seolaí agus sin amháin é. Tá cosc iomlán ar scaipeadh, dháileadh nó chóipeáil neamhúdaraithe ar an teachtaireacht seo agus ar aon cheangaltán atá ag dul leis. Má tá an ríomhphost seo faighte agat trí dhearmad cuir sin in iúl le do thoil don seoltóir agus scrios an teachtaireacht. D’fhéadfadh sé gurb iad tuairimí an údair agus sin amháin atá in aon tuairimí no dearcthaí atá curtha i láthair sa ríomhphost seo agus níor chóir glacadh leo mar thuairimí nó dhearcthaí Ollscoil Chathair Bhaile Átha Cliath. Ní ghlactar leis go bhfuil cumarsáid ríomhphoist den sórt seo saor ó víreas, in am, slán, nó saor ó earráid agus ní ghlacann Ollscoil Chathair Bhaile Átha Cliath le dliteanas in aon chás den sórt sin ná as aon iarmhairt a d’eascródh astu. Cuimhnigh ar an timpeallacht le do thoil sula gcuireann tú an ríomhphost seo i gcló."*
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
