My apologies to abuse the moses-list for sending these announcements but the following might be interesting to subscribers of this list:
There are several new parallel resources available from OPUS. Please have a look at http://opus.lingfil.uu.se/ Some highlights: - several sub-corpora from various domains - over 20 billion tokens in total in more than 90 languages - sentence-aligned for all possible language pairs (> 3500) (more than 100 language pairs with > 100M tokens) - available in XML/XCES, TMX and plain text (Moses format) - online query tools - partially machine-annotated (POS, lemmas, chunks) Feedback is very welcome! There is also a new related book available on creating parallel resources: Bitext Alignment Synthesis Lecture on HLT, Morgan & Claypool Publishers http://dx.doi.org/10.2200/S00367ED1V01Y201106HLT014 -- ********************************************************************************** Jörg Tiedemann [email protected] Dep. of Linguistics and Philology http://stp.lingfil.uu.se/~joerg/ Uppsala University tel: +46 (0)18 - 471 1412 Box 635, SE-751 26 Uppsala/SWEDEN fax: +46 (0)18 - 471 1094 _______________________________________________ Moses-support mailing list [email protected] http://mailman.mit.edu/mailman/listinfo/moses-support
