[Moses-support] Tokenization problem

Ihab Ramadan Mon, 05 Jan 2015 00:10:28 -0800

Dears,

Using the tokenizer on the training files replaces the apostrophes with
"&apos; s" (with space) but if I use the same script to tokenize a sentence
it makes the apostrophes to be "&apos;s" (without a space)


This problem confuse the decoder while translation 

How to solve this peoblem

Thanks  

 

Best Regards

Ihab Ramadan| Senior Developer|  <http://www.saudisoft.com/> Saudisoft -
Egypt | Tel  +2 02 330 320 37  Ext- 0 | Mob+201007570826 | Fax+20233032036 |
Follow us on
<http://www.linkedin.com/company/77017?trk=vsrp_companies_res_name&trkInfo=V
SRPsearchId%3A1489659901402995947155%2CVSRPtargetId%3A77017%2CVSRPcmpt%3Apri
mary> linked |
<https://www.facebook.com/pages/Saudisoft-Co-Ltd/289968997768973?ref_type=bo
okmark> ZA102637861 |  <https://twitter.com/Saudisoft> ZA102637858

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

[Moses-support] Tokenization problem

Reply via email to