Re: [Moses-support] Data collection

2016-04-19 Thread Philipp Koehn
Hi, the common training pipeline limits sentences to at most 80 words. This is due to limitations in GIZA++. There can be any mix of sentence lengths - long sentences, short sentences, single words. There is a good chance for the system to translate "I eat an apple" correctly, if it a training

[Moses-support] Data collection

2016-04-19 Thread Sanjanashree Palanivel
Hi, How the data should be collected for training Moses. I wish to know how much longer and shorter the sentence can be for training moses. What will happens, if the simple sentences like "I eat an apple" are given for training with longer sentences. and what if i give a word as a