[Mt-list] ELRA - Language Resources Catalogue - Update

ELRA ELDA Information Thu, 18 Jan 2018 03:09:05 -0800

[Our apologies if you have received multiple copies of this announcement.]

We are happy to announce that 1 new Monolingual Lexicon, 1 new WrittenCorpus and 2 new Speech resources are now available in our catalogue.


*ELRA-L0100 French dictionary of definitions (SYNAPSE)*
*ISLRN: **357-949-964-163-0* <http://islrn.org/resources/357-949-964-163-0/>

The French dictionary of definitions (SYNAPSE) consists of 216,835entries (147,378 nouns, 80,552 adjectives, 24,001 verbs, 4,677 adverbs,1,560 prefixes, 107 prepositions, 614 interjections, 147 pronouns, 42conjunctions, 27 articles), 309,078 definitions and 7,395 phraseologicalunits (phrases). Grammatical information for each entry consists of:grammatical category, gender, number, inflected forms. This dictionaryis provided in XML format together with its DTD.For more information, see:http://catalog.elra.info/product_info.php?products_id=1315


*ELRA-W0124 English-Vietnamese Parallel Corpus*
*ISLRN: 838-483-738-912-8 <http://islrn.org/resources/838-483-738-912-8/>*

This is a corpus of 500,000 English-Vietnamese sentence pairs. Theparallel corpus contains English documents translated by professionaltranslators into Vietnamese. The source texts include books,dictionaries, newspapers, online news. The texts are provided in TEI format.For more information, see:http://catalog.elra.info/product_info.php?products_id=1316


*ELRA-S0394 Metalogue Multi-Issue Bargaining Dialogue*
*ISLRN: 217-906-813-531-9 <http://islrn.org/resources/217-906-813-531-9/>*

This corpus consists of approximately 2.5 hours of semanticallyannotated English dialogue data that includes speech and transcripts.Six unique subjects (undergraduates between 19 and 25 years of age)participated in the collection. The dialogue speech was captured withtwo headset microphones and saved in 16kHz, 16-bit mono linear PCM FLACformat. Transcripts were produced semi-automatically, using an automaticspeech recognizer followed by manual correction. All text is presentedin UTF-8 as either plain text or XML.For more information, see:http://catalog.elra.info/product_info.php?products_id=1317

*
ELRA-S0395 Nautilus Speaker Characterization (NSC) Corpus*
*ISLRN: 157-037-166-491-1 <http://islrn.org/resources/157-037-166-491-1/>*

This corpus comprises clean microphone recordings of conversationalspeech from 300 German speakers (126 males and 174 females) aged 18 to35 years, with no marked dialect/accent. The recordings were performedin an acoustically-isolated room in 2016/2017. Four scripted and foursemi-spontaneous dialogs were elicited from the speakers, simulatingtelephone call inquiries. Additionally, spontaneous neutral andemotional speech utterances and questions were produced. All labels areprovided, together with the speech recordings and the speakers' metadata.For more information, see:http://catalog.elra.info/product_info.php?products_id=1318

For more information on the catalogue, please contact Valérie Mapellimailto:[email protected]

If you would like to enquire about having your resources distributed byELRA, please do not hesitate to contact us.


Visit our On-line Catalogue: http://catalog.elra.info
Visit the Universal Catalogue: http://universal.elra.info

Archives of ELRA Language Resources Catalogue Updates:http://www.elra.info/en/catalogues/language-resources-announcements/

_______________________________________________
Mt-list site list
[email protected]
http://lists.eamt.org/mailman/listinfo/mt-list

[Mt-list] ELRA - Language Resources Catalogue - Update

Reply via email to