Our apologies if you have received multiple copies of this announcement.

*****************************************************************
ELRA - Language Resources Catalogue - Update
*****************************************************************

We are happy to announce that 1 new Speech Resource and 3 new Written Corpora are now available in our catalogue.

*ELRA-S0371 PortMedia French and Italian corpus*
This corpus contains 700 transcribed dialogues from about 140 French speakers and 604 transcribed dialogues from about 150 Italian speakers (several dialogues per speaker). The method chosen for the corpus construction process is that of a 'Wizard of Oz' (WoZ) system. This consists of simulating a natural language man-machine dialogue. The scenario was built in the domain of touristic information and reservation. A manual transcription and semantic annotation of the corpus are provided with corresponding wave files. For more information, see: http://catalog.elra.info/product_info.php?products_id=1224&language=en

*ELRA-W0078 NE3L named entities Arabic corpus*
The Arabic corpus contains 103,363 words coming from articles extracted from "Le Monde Diplomatique" newspaper, and published in 2004. 2 named entity categories were taken into account: Time and Amount. For more information, see: http://catalog.elra.info/product_info.php?products_id=1226 <http://catalog.elra.info/product_info.php?products_id=1226&language=en>&language=en <http://catalog.elra.info/product_info.php?products_id=1226&language=en>

*ELRA-W0079 NE3L named entities Chinese corpus*
The Chinese corpus contains 79,302 words coming from articles extracted from "Le Monde Diplomatique" newspaper, and published in 2001. 3 named entity categories were taken into account: Person, Place and Organisation. For more information, see: http://catalog.elra.info/product_info.php?products_id=1227 <http://catalog.elra.info/product_info.php?products_id=1227&language=en>&language=en <http://catalog.elra.info/product_info.php?products_id=1227&language=en>

*ELRA-W0080 NE3L named entities Russian corpus*
The Russian corpus contains 75,784 words coming from articles extracted from "Izvestia" newspaper, and published in 1995. 2 named entity categories were taken into account: Time and Amount. For more information, see: http://catalog.elra.info/product_info.php?products_id=1228 <http://catalog.elra.info/product_info.php?products_id=1228&language=en>&language=en <http://catalog.elra.info/product_info.php?products_id=1228&language=en>


For more information on the catalogue, please contact ValĂ©rie Mapelli mailto:[email protected]

Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info
Archives of ELRA Language Resources Catalogue Updates: http://www.elra.info/LRs-Announcements.html
_______________________________________________
Mt-list site list
[email protected]
http://lists.eamt.org/mailman/listinfo/mt-list

Reply via email to