[Apologies for multiple postings]
/Press Release - Immediate/
/Paris, France, October 2, 2024/
*ELDA releases the ALLIES Corpus for the evaluation of Broadcast News
across time diarization systems*
ELDA is pleased to announce the release of the ALLIES Corpus, as a
result from the collaboration between the University of Le Mans (France)
and ELDA.Overall, the ALLIES Corpus contains about 900 hours of news
broadcast, including orthographic transcriptions, speaker annotations
and segmentation. This corpus is based on the material that was used for
the ESTER, REPERE and ETAPE evaluation packages (see ELRA Catalogue:
http://catalogue.elra.info <http://catalogue.elra.info>for respective
packages).
The ALLIES corpus was built as an extension of the previous produced
corpora. It contains corrected annotations from the previous evaluation
materials as well as new audio data with corresponding transcriptions.
Corrections include corrected names of speakers and re-segmentation.
The dataset can be found in the ELRA Catalogue of Language Resources
under the following links and references: ALLIES Corpus
<http://catalog.elra.info/en-us/repository/browse/ELRA-S0486/>, *ISLRN:
*397-116-696-859-2 <http://www.islrn.org/resources/397-116-696-859-2>
For more information and/or questions, please write to cont...@elda.org
<mailto:cont...@elda.org>.
*
*
*About ALLIES project*
The ALLIES Corpus was produced within the European CHIST-Era project
ALLIES (Autonomous Lifelong learnIng intelLigent Systems). The ALLIES
project enabled to carry out a campaign for the evaluation of Broadcast
News across time diarization systems using French data. This project is
an extension of the previous ESTER, REPERE and ETAPE evaluation
campaigns that were carried out for the French language in this field.
The University of Le Mans (France) and ELDA collaborated on the
production of this new corpus.
To find out more about ALLIES project, please visit:
https://www.chistera.eu/projects/allies
<https://www.chistera.eu/projects/allies>
*About ELDA*
The Evaluation and Language resources Distribution Agency (ELDA)
identifies, collects, markets, and distributes language resources, along
with the dissemination of general information in the field of Human
Language Technologies (HLT). ELDA has considerable knowledge and skills
in HLT applications. ELDA is part of major French, European and
international projects in the field of HLT.
To find out more about ELDA, please visit our web site:
http://www.elda.org/ <http://www.elda.org/>