[Apologies for cross-postings]
Press Release - Immediate
Paris, France, January 24, 2018
*ELRA and LDC partner on a joint distribution of Language Resources *
ELRA and LDC are pleased to announce the release of
1) three corpora from the 2007 CoNLL Shared Task, consisting of
dependency treebanks, used as part of the CoNLL 2007 shared task on
multi-lingual dependency parsing and the domain adaptation track. The
source data in the treebanks consists for the most part of various texts
(e.g., textbooks, news, literature) annotated in dependency format, and
2) one TRAD Chinese-French Parallel Text, which includes French
translations of a subset of approximately 10,000 Chinese words from
LDC’s GALE Phase 1 Chinese Blog Parallel.
These packages can be found in the ELRA and LDC catalogues under the
following references:
2007 CoNLL Shared Task - Basque, Catalan, Czech & Turkish
ISLRN: 769-620-932-723-2 <http://www.islrn.org/resources/769-620-932-723-2/>
ELRA ID: ELRA-W012
<http://catalog.elra.info/product_info.php?products_id=1312>1
<http://catalog.elra.info/product_info.php?products_id=1312>
LDC ID: LDC2018T06 <https://catalog.ldc.upenn.edu/LDC2018T06>
2007 CoNLL Shared Task - Greek, Hungarian and Italian
ISLRN: 270-733-242-642-3 <http://www.islrn.org/resources/270-733-242-642-3/>
ELRA ID: ELRA-W0122
<http://catalog.elra.info/product_info.php?products_id=1313>
LDC ID: LDC2018T07 <https://catalog.ldc.upenn.edu/LDC2018T07>
2007 CoNLL Shared Task – Arabic & English
ISLRN: 505-782-255-628-8 <http://www.islrn.org/resources/505-782-255-628-8/>
ELRA ID: ELRA-W0123
<http://catalog.elra.info/product_info.php?products_id=1314>
LDC ID: LDC2018T08 <https://catalog.ldc.upenn.edu/LDC2018T08>
TRAD Chinese-French Parallel Text – Blog
ISLRN: 713-266-631-883-0 <http://www.islrn.org/resources/713-266-631-883-0/>
ELRA ID: ELRA-W0125
<http://catalog.elra.info/product_info.php?products_id=1319>
LDC ID: LDC2018T02 <https://catalog.ldc.upenn.edu/LDC2018T02>
***About CoNLL and the 2007 shared task***
The Conference on Computational Natural Language Learning (CoNLL) is
accompanied every year by a shared task intended to promote NLP
applications and evaluate them in a standard setting. In 2007, the
shared task was devoted to the parsing of syntactic dependencies using
corpora from up to thirteen languages. The task aimed to define and
extend the then-current state of the art in dependency parsing, a
technology that complemented previous tasks by producing a different
kind of syntactic description of input text. The 2007 shared task also
included a domain adaptation track for English.
http://www.signll.org/conll/ and http://www.conll.org/previous-tasks/.
***About PEA TRAD project ***
PEA-TRAD (Translation as a Support for Document Analysis) is a project
supported by the French Ministry of Defense (DGA) whose purpose was to
develop speech-to-speech translation technology for multiple languages
(e.g., Arabic, Chinese, Pashto) from a variety of domains.
More on the PEA TRAD project:
http://www.elra.info/en/projects/archived-projects/pea-trad/.
*** About ELRA ***
The European Language Resources Association (ELRA) is a non-profit
making organisation founded by the European Commission in 1995, with the
mission of providing a clearing house for language resources and
promoting Human Language Technologies (HLT).
To find out more about ELRA and its respective catalogue, please visit:
http://www.elra.info and http://catalogue.elra.info
*** About LDC ***
The Linguistic Data Consortium (LDC) is an open consortium of
universities, libraries, corporations and research laboratories that
creates and distributes linguistic resources for language-related
education, research and technology development.
To find out more about LDC and its respective catalogue, please visit:
https://www.ldc.upenn.edu and https://catalog.ldc.upenn.edu
_______________________________________________
Mt-list site list
[email protected]
http://lists.eamt.org/mailman/listinfo/mt-list