2nd CFP: Third Workshop on Language Technologies for Historical and Ancient 
LAnguages (LT4HALA 2024)

Website: https://circse.github.io/LT4HALA/2024/
Submission page: https://softconf.com/lrec-coling2024/lt4hala2024/
Place: co-located with LREC-COLING 2024, Torino, Italy
Date: Saturday, May 25 2024

DESCRIPTION
LT4HALA 2024 is a one-day workshop that seeks to bring together scholars who 
are developing and/or are using Language Technologies (LTs) for historically 
attested languages, so to foster cross-fertilization between the Computational 
Linguistics community and the areas in the Humanities dealing with historical 
linguistic data, e.g. historians, philologists, linguists, archaeologists and 
literary scholars. LT4HALA 2024 follows LT4HALA 2020 and 2022 that was 
organized in the context of LREC 2022 and LREC 2022, respectively. Despite the 
current availability of large collections of digitized texts written in 
historical languages, such interdisciplinary collaboration is still hampered by 
the limited availability of annotated linguistic resources for most of the 
historical languages. Creating such resources is a challenge and an obligation 
for LTs, both to support historical linguistic research with the most updated 
technologies and to preserve those precious linguistic data that survived from 
past times.

Relevant topics for the workshop include, but are not limited to:
- handling spelling variation,
- detection and correction of OCR errors,
- creation and annotation of linguistic resources,
- deciphering,
- morphological/syntactic/semantic analysis of textual data,
- adaptation of tools to address diachronic/diatopic/diastratic variation in 
texts,
- teaching ancient languages with LTs,
- NLP-driven theoretical studies in historical linguistics,
- NLP-driven analysis of literary ancient texts,
- evaluation of LTs designed for historical and ancient languages,
- Large Language Models for the automatic analysis of ancient texts.

LT4HALA 2024 will host:
- the third edition of EvaLatin 
(https://circse.github.io/LT4HALA/2024/EvaLatin), an evaluation campaign 
entirely devoted to the evaluation of NLP tools for Latin, focusing on 
dependency parsing and emotion polarity detection;
- the third edition of EvaHan (https://circse.github.io/LT4HALA/2024/EvaHan), 
the evaluation campaign for the evaluation of NLP tools for Ancient Chinese, 
focusing on sentence segmentation and punctuation prediction.

SUBMISSIONS
Submissions of three forms of papers will be considered:
- Regular long papers – up to eight (8) pages maximum*, presenting substantial, 
original, completed, and unpublished work.
- Short papers – up to four (4) pages*, describing a small focused 
contribution, negative results, system demonstrations, etc.
- Position papers – up to eight (8) pages*, discussing key hot topics, 
challenges and open issues, as well as cross-fertilization between 
computational linguistics and other disciplines.
*Excluding any number of additional pages for references, ethical 
consideration, conflict-of-interest, as well as data, and code availability 
statements.
We encourage the authors of papers reporting experimental results to make their 
results reproducible and the entire process of analysis replicable, by making 
the data and the tools they used available. The form of the presentation may be 
oral or poster, whereas in the proceedings there is no difference between the 
accepted papers. The submission is anonymous. The LREC-COLING 2024 official 
format is requested. Each paper will be reviewed by three independent reviewers.

IMPORTANT DATES
Workshop
- 26 February 2024: submission due
- 18 March 2024: reviews due
- 22 March 2024: notifications to authors
- 5 April 2024: camera-ready (PDF) due

SHARED TASKS
EvaLatin
- 22 December 2023: guidelines available
- Evaluation Window I - Task: Dependency Parsing
- 1 February 2024: test data available
- 8 February 2024: system results due to organizers
- Evaluation Window II - Task: Emotion Polarity Detection
- 12 February 2024: test data available
- 19 February 2024: system results due to organizers
- 11 March 2024: reports due to organizers
- 22 March 2024: short report review deadline
- 5 April 2024: camera ready version of reports due to organizers

EvaHan
- 22 December 2023: training data available
- Evaluation Window
- 12 February 2024: test data available
- 19 February 2024: system results due to organizers
- 11 March 2024: reports due to organizers
- 22 March 2024: short report review deadline
- 5 April 2024: camera ready version of reports due to organizers

Identify, Describe and Share your LRs!
When submitting a paper from the START page, authors will be asked to provide 
essential information about resources (in a broad sense, i.e. also 
technologies, standards, evaluation kits, etc.) that have been used for the 
work described in the paper or are a new result of your research. Moreover, 
ELRA encourages all LREC-COLING authors to share the described LRs (data, 
tools, services, etc.) to enable their reuse and replicability of experiments 
(including evaluation ones).

WORKSHOP ORGANIZERS
Marco Passarotti, Università Cattolica del Sacro Cuore di Milan, Italy
Rachele Sprugnoli, Università di Parma, Italy

PROGRAMME COMMITTEE
Adam Anderson, FactGrid Cuneiform Project, USA
Yannis Assael, Google DeepMind
Monica Berti, University of Leipzig, Germany
Luca Brigada Villa, Università di Bergamo, Italy
Flavio Massimiliano Cecchini, University of Leuven, Belgium
Margherita Fantoli, University of Leuven, Belgium
Shai Gordin, Ariel University, Israel
Federica Iurescia, Università Cattolica del Sacro Cuore di Milano, Italy
Bin Li, School of Chinese Language and Literature at Nanjing Normal University, 
P.R. China
Eleonora Litta, Università Cattolica del Sacro Cuore di Milano, Italy
Yudong Liu, Western Washington University
Barbara McGillivray, Turing Institute, UK
Beáta  Megyesi, Uppsala University, Sweden
Chiara Palladino, Furman University, USA
John Pavlopoulos, Athens University of Economics and Business, Greece
Eva Pettersson, Uppsala University, Sweden
Sophie Prévost, Laboratoire Lattice, France
Thea Sommerschield, Ca’ Foscari University of Venice, Italy
James Tauber, Eldarion, USA
Toon Van Hal, Katholieke Universiteit Leuven, Belgium
Tariq Yousef, University of Southern Denmark, Denmark

CONTACT
rachele.sprugnoli[AT]unipr.it<http://unipr.it>
Please, write “LT4HALA” or “EvaLatin” in the subject of your e-mail.
For more information on EvaHan, please write to 
libin.njnu[AT]gmail.com<http://gmail.com> writing “EvaHan” in the subject of 
the e-mail.


Prof. Marco C. Passarotti
Computational Linguistics
Index Thomisticus Treebank https://itreebank.marginalia.it/
ERC Grantee, P.I. LiLa https://lila-erc.eu/ (Grant Agreement No. 769994)
CIRCSE Research Centre https://centridiricerca.unicatt.it/circse_index.html

[cid:38DBA4B0-3169-48DD-B59A-4F3A679F9DD9@lan]   
[cid:D415BF3A-E244-4BC4-9FB5-064066B300AD@lan]  
[cid:13BA173A-59CB-4F2D-9B90-DE302E870A50@lan]


Università Cattolica del Sacro Cuore
Largo Gemelli, 1
20123 Milan, Italy
[email protected]<mailto:[email protected]>
tel. +39-02-72342380

[http://static.unicatt.it/ext-portale/5xmille_firma_mail_2023.jpg] 
<https://www.unicatt.it/uc/5xmille>

_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to