Hello,

We intend to add XML tags to our corpus but we are not sure how the Moses
decoder and SRILM uses these tags in training and decoding phase.

For example if we tag 19 in main corpus like this:
19  ---> <num>19</num>

How does LM must be made on this tagged corpus using SRILM?
Does SRILM consider whether <num>  or <num>19</num> as a token?

Also in decoding phase:
How does moses pass the tagged tokens to the LM?
For example if test is tagged like this:
<num>19</num>
Does it pass just <num> or whole of it as <num>19</num>


---------------------
Best Regards,
S.Bakhshaei

After All you will come ....
And will spread light on the dark desolate world!
O' Kind Father! We will be waiting for your affectionate hands ...
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to