Great! Thank you Joao... In addition, I also collected en-fr from the Hansard corpus. I am not sure who created the manually aligned files. I also do not have them available on a website. If anyone is interested, email me...
-----Original Message----- From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of [EMAIL PROTECTED] Sent: donderdag 24 juli 2008 18:09 To: [email protected] Subject: Moses-support Digest, Vol 21, Issue 21 Send Moses-support mailing list submissions to [email protected] To subscribe or unsubscribe via the World Wide Web, visit http://mailman.mit.edu/mailman/listinfo/moses-support or, via email, send a message with subject or body 'help' to [EMAIL PROTECTED] You can reach the person managing the list at [EMAIL PROTECTED] When replying, please edit your Subject line so it is more specific than "Re: Contents of Moses-support digest..." Today's Topics: 1. Manual Word Alignments ( Jo?o Gra?a ) ---------------------------------------------------------------------- Message: 1 Date: Thu, 24 Jul 2008 12:54:16 +0100 From: " Jo?o Gra?a " <[EMAIL PROTECTED]> Subject: [Moses-support] Manual Word Alignments To: [email protected] Message-ID: <[EMAIL PROTECTED]> Content-Type: text/plain; charset="utf-8" In reply to the post about manual word alignments i want to inform all that we have made some word alignments for 6 europarl language pairs. They correspond to the first 100 sentences from the common test set of the europarl for: Portuguese - English Portuguese - French Portuguese - Spanish Spanish - French Spanish - English English-French We intend to continue increasing the number of alignments, but they are useful anyway to test your algorithms or tune some parameters. The link for them is: https://www.l2f.inesc-id.pt/wiki/index.php/Word_Alignments Hope you can find this resources useful, Jo?o Gra?a Message: 6 > Date: Thu, 24 Jul 2008 10:46:59 +0100 > From: Patrik Lambert <[EMAIL PROTECTED]> > Subject: Re: [Moses-support] Moses-support Digest, Vol 21, Issue 14 > To: [email protected] > Message-ID: <[EMAIL PROTECTED]> > Content-Type: text/plain; charset=ISO-8859-1; format=flowed > > Hi Sane, > > Rada Mihalcea's alignment page > (http://www.cs.unt.edu/~rada/wa/#data<http://www.cs.unt.edu/%7Erada/wa/#data >) > has links to human-aligned > alignment files, such as those used in the ACL 2005 workshop on parallel > text, the NAACL 2003 workshop on Parallel Text or the Europarl corpus > word alignment trial and test data. Note that the widely used > French-English Hansards reference contains many ambiguous (i.e. P) links > and thus favours high precision computed alignments, as explained in > "Patrik Lambert, Adri? de Gispert, Rafael Banchs and Jos? B. Mari?o. > Guidelines for Word Alignment Evaluation and Manual Alignment. Language > Resources and Evaluation, 39 (4) pp. 267-285. Springer" > > Patrik > > > [EMAIL PROTECTED] wrote: > > Message: 2 > > Date: Thu, 17 Jul 2008 15:10:29 +0200 > > From: "Sanne Korzec" <[EMAIL PROTECTED]> > > Subject: [Moses-support] recaser > > To: <[email protected]> > > Message-ID: > > <[EMAIL PROTECTED]> > > > > 3) Finally and a little off topic, does anybody in our research > field > > have word-based human-aligned alignment files for some test set > available? > > > > > > -------------- next part -------------- An HTML attachment was scrubbed... URL: http://mailman.mit.edu/pipermail/moses-support/attachments/20080724/25322431 /attachment-0001.htm ------------------------------ _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support End of Moses-support Digest, Vol 21, Issue 21 ********************************************* _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
