SMT systems such as Moses do not guarantee that they can reproduce the training set. For example, phrases might be pruned due to frequencies being too low, not all words might be aligned, the decoder might discard the true translation during etc etc.
This doesn't really have much to do with Indian languages per se; instead, it is the way that systems are built in general. Miles > Can anyone please tell me about why we got low BLEU score on a testset we get from training set for sparse resourced languages like Indian languages. -- The University of Edinburgh is a charitable body, registered in Scotland, with registration number SC005336. _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
