During the IWSLT 2011 evaluation campaign, a sentence aligned corpus from Multi-UN<http://iwslt2011.org/doku.php?id=06_evaluation#download_of_training_data>(1.6GB) released but now the link is broken <http://www.euromatrixplus.eu/downloads/35>. I think I have to align the sentences myself except someone who has the file kindly gives me a download link :) Thanks.
All the best, Saeed On Fri, Aug 10, 2012 at 9:13 PM, Philipp Koehn <[email protected]> wrote: > Hi, > > it is true that the sentence alignment is not provided for these corpora, > except for fr-en and es-en as part of the WMT evaluation campaign. > > You will have to use some sentence aligner to align these corpora. > I'd be interested to hear what others have done in this regard, since > it seems to be a common problem. > > -phi > > On Fri, Aug 10, 2012 at 11:30 AM, saeed smith <[email protected]>wrote: > >> Hi, >> >> I appreciate it if anyone send me a link or some information about >> Multi-UN corpora with sentence alignments. I downloaded some files from >> EuroMatrixPlus <http://www.euromatrixplus.net/multi-un/> website but it >> seems that sentence alignments are not included. >> >> All the best, >> Saeed >> >> _______________________________________________ >> Moses-support mailing list >> [email protected] >> http://mailman.mit.edu/mailman/listinfo/moses-support >> >> >
_______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
