Hi,

On Fri, Mar 23, 2012 at 2:10 AM, Hieu Hoang <[email protected]>
wrote:
> i think philipp committed  some scripts that escape these characters, and
> others, a few days ago.

Yes, escaping of these characters is now standard in the tokenizer,
and there is also a specific script to escape them
(escape-special-chars.perl)
which should be called after any tokenizer you may use in place of
the Moses one.

For the record, the following characters are replaced:
& -> &amp;
| -> &bar;
< -> &lt;
> -> &gt;
[ -> &bra;
] -> &ket;

-phi
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to