Hi all Just wanted to know a few things about applying linguistic inputs to improve a baseline SMT system.
1. Should we tag both the source and the target language for training? And when we are tuning/testing should that data be tagged as well? 2. Also, can language model be tagged and will it make any improvements. 3. I have the enhanced Brill's tagger for english are there any other good ones? Finally, when applying linguistic inputs to both source and target language should they have the same no. of tags like pos, any other morphological information. Thanks in advance. Regards, Vineet _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
