Hi all I would like to know is the data used for both tuning and testing the same ?
also how long would it take to tune say 5000 sentences using mert? can someone recommend a nice tool for sentence alignment ? i am currently using Microsoft's bilingual sentence aligner which seems to be very accurate but becomes slower for large number of sentences as it does a lot of iterations? also with respect to sentence alignment, there is something called as the probability threshold which i dont understand the importance of other than a value between 0 and 1 is chosen also how to interpret a bleu score of say 15 or 20 in terms accuracy in percentage? Thanks Vineet _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
