[Mt-list] ELRA - Language Resources Catalogue - Update

ELRA ELDA Information Wed, 01 Oct 2014 06:46:11 -0700

Our apologies if you have received multiple copies of this announcement.


*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

We are happy to announce that 1 new Speech Resource and 3 new WrittenCorpora are now available in our catalogue.


*ELRA-S0371 PortMedia French and Italian corpus*

This corpus contains 700 transcribed dialogues from about 140 Frenchspeakers and 604 transcribed dialogues from about 150 Italian speakers(several dialogues per speaker). The method chosen for the corpusconstruction process is that of a 'Wizard of Oz' (WoZ) system. Thisconsists of simulating a natural language man-machine dialogue. Thescenario was built in the domain of touristic information andreservation. A manual transcription and semantic annotation of thecorpus are provided with corresponding wave files.For more information, see:http://catalog.elra.info/product_info.php?products_id=1224&language=en


*ELRA-W0078 NE3L named entities Arabic corpus*

The Arabic corpus contains 103,363 words coming from articles extractedfrom "Le Monde Diplomatique" newspaper, and published in 2004. 2 namedentity categories were taken into account: Time and Amount.For more information, see:http://catalog.elra.info/product_info.php?products_id=1226<http://catalog.elra.info/product_info.php?products_id=1226&language=en>&language=en<http://catalog.elra.info/product_info.php?products_id=1226&language=en>


*ELRA-W0079 NE3L named entities Chinese corpus*

The Chinese corpus contains 79,302 words coming from articles extractedfrom "Le Monde Diplomatique" newspaper, and published in 2001. 3 namedentity categories were taken into account: Person, Place and Organisation.For more information, see:http://catalog.elra.info/product_info.php?products_id=1227<http://catalog.elra.info/product_info.php?products_id=1227&language=en>&language=en<http://catalog.elra.info/product_info.php?products_id=1227&language=en>


*ELRA-W0080 NE3L named entities Russian corpus*

The Russian corpus contains 75,784 words coming from articles extractedfrom "Izvestia" newspaper, and published in 1995. 2 named entitycategories were taken into account: Time and Amount.For more information, see:http://catalog.elra.info/product_info.php?products_id=1228<http://catalog.elra.info/product_info.php?products_id=1228&language=en>&language=en<http://catalog.elra.info/product_info.php?products_id=1228&language=en>

For more information on the catalogue, please contact Valérie Mapellimailto:[email protected]


Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info

Archives of ELRA Language Resources Catalogue Updates:http://www.elra.info/LRs-Announcements.html

_______________________________________________
Mt-list site list
[email protected]
http://lists.eamt.org/mailman/listinfo/mt-list

[Mt-list] ELRA - Language Resources Catalogue - Update

Reply via email to