During the IWSLT 2011 evaluation campaign, a sentence aligned corpus from
Multi-UN<http://iwslt2011.org/doku.php?id=06_evaluation#download_of_training_data>(1.6GB)
released but now the
link is broken <http://www.euromatrixplus.eu/downloads/35>. I think I have
to align the sentences myself except someone who has the file kindly gives
me a download link :) Thanks.

All the best,
Saeed



On Fri, Aug 10, 2012 at 9:13 PM, Philipp Koehn <[email protected]> wrote:

> Hi,
>
> it is true that the sentence alignment is not provided for these corpora,
> except for fr-en and es-en as part of the WMT evaluation campaign.
>
> You will have to use some sentence aligner to align these corpora.
> I'd be interested to hear what others have done in this regard, since
> it seems to be a common problem.
>
> -phi
>
> On Fri, Aug 10, 2012 at 11:30 AM, saeed smith <[email protected]>wrote:
>
>> Hi,
>>
>> I appreciate it if anyone send me a link or some information about
>> Multi-UN corpora with sentence alignments. I downloaded some files from
>> EuroMatrixPlus <http://www.euromatrixplus.net/multi-un/> website but it
>> seems that sentence alignments are not included.
>>
>> All the best,
>> Saeed
>>
>> _______________________________________________
>> Moses-support mailing list
>> [email protected]
>> http://mailman.mit.edu/mailman/listinfo/moses-support
>>
>>
>
_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to