You can use this one: http://code.google.com/p/geppetto/
It uses the word alignments with posterior constraints that work much better than Giza for small corpus and as well as large corpus (http://aclweb.org/anthology-new/J/J10/J10-3007.pdf). João Graça On Fri, Apr 20, 2012 at 3:53 PM, Joerg Tiedemann <[email protected]> wrote: > Maybe you can find something useful here: > http://opus.lingfil.uu.se/letsmt-trac/wiki/WordAligner > > Jörg > > > On Fri, Apr 20, 2012 at 2:01 PM, Andrei Popescu-Belis > <[email protected]> wrote: >> Dear all, >> >> Do you have any suggestions for software to perform word alignment? >> >> I am looking for a solution to word align several pairs of sentences (one is >> an imperfect translation of the other), possibly using knowledge obtained >> from a larger parallel corpus. >> >> Here are some ideas that do not seem entirely satisfactory: >> 1. Build a translation model with GIZA++ over the data to align. But there >> is not enough data. Maybe adding the large corpus data? (Slow.) >> 2. If Moses is used, it can output word alignment info, but our >> translations are not necessarily done with Moses. >> 3. There were research systems for the shared task at the 2003/2005 >> Parallel Texts workshops, but is any piece of software available? >> >> Thank you and best regards, >> Andrei Popescu-Belis >> >> Idiap Research Institute | tel: (41 27) 721 7729 >> Centre du Parc, CP 592 | fax: (41 27) 721 7712 >> CH-1920 Martigny | [email protected] >> Switzerland | www.idiap.ch/~apbelis >> _______________________________________________ >> Mt-list mailing list > > > > -- > ********************************************************************************** > Jörg Tiedemann [email protected] > Dep. of Linguistics and Philology http://stp.lingfil.uu.se/~joerg/ > Uppsala University tel: +46 (0)18 - 471 > 1412 > Box 635, SE-751 26 Uppsala/SWEDEN fax: +46 (0)18 - 471 1094 > _______________________________________________ > Mt-list mailing list _______________________________________________ Mt-list mailing list
