My apologies to abuse the moses-list for sending these announcements
but the following might be interesting to subscribers of this list:


There are several new parallel resources available from OPUS. Please
have a look at
http://opus.lingfil.uu.se/

Some highlights:
- several sub-corpora from various domains
- over 20 billion tokens in total in more than 90 languages
- sentence-aligned for all possible language pairs (> 3500)
  (more than 100 language pairs with > 100M tokens)
- available in XML/XCES, TMX and plain text (Moses format)
- online query tools
- partially machine-annotated (POS, lemmas, chunks)

Feedback is very welcome!


There is also a new related book available on creating parallel resources:

Bitext Alignment
Synthesis Lecture on HLT, Morgan & Claypool Publishers
http://dx.doi.org/10.2200/S00367ED1V01Y201106HLT014



-- 
**********************************************************************************
 Jörg Tiedemann                                     [email protected]
 Dep. of Linguistics and Philology
http://stp.lingfil.uu.se/~joerg/
 Uppsala University                                  tel:  +46 (0)18 - 471 1412
 Box 635, SE-751 26 Uppsala/SWEDEN   fax: +46 (0)18 - 471 1094

_______________________________________________
Moses-support mailing list
[email protected]
http://mailman.mit.edu/mailman/listinfo/moses-support

Reply via email to