We are inviting your submissions to the 5th Workshop on Research in 
Computational Linguistic Typology and Multilingual NLP (SIGTYP 2023) which will 
be held at EACL 2023 (May 2 or 6, 2023 Dubrovnik, Croatia). The submission 
deadline is February 13.
For more information, see details below or refer to: 
https://sigtyp.github.io/workshop.html. This year’s edition will include a 
shared task on “Cognate and Derivative Detection for Low-Resourced Languages", 
more details about the shared task can be found here: 
https://github.com/sigtyp/ST2023.

We are looking forward to your contributions.
Best regards,
Lisa Beinborn (on behalf of the SIGTYP organization committee)


Workshop description

The aim of the 5th edition of SIGTYP workshop is to act as a platform and a 
forum for the exchange of information between typology-related research, 
multilingual NLP, and other research areas that can lead to the development of 
truly multilingual NLP methods. The workshop is specifically aimed at raising 
awareness of linguistic typology and its potential in supporting and widening 
the global reach of multilingual NLP, as well as at introducing computational 
approaches to linguistic typology. It will foster research and discussion on 
open problems, not only within the active community working on cross- and 
multilingual NLP but also by inviting input from leading researchers in 
linguistic typology. In 2023, we would like to continue following this 
direction of research with a special focus on bringing technology to foster 
documentation of under-described languages.

SIGTYP is the first dedicated venue for typology-related research and its 
integration into multilingual NLP. Appropriate topics include (but are not 
limited to) the following as they relate to the areas of the workshop: :


  *   Integration of typological features in language transfer and joint 
multilingual learning. In addition to established techniques such as “selective 
sharing”, are there alternative ways to encode heterogeneous external knowledge 
in machine learning algorithms?

  *   Development of unified taxonomy and resources. Building universal 
databases and models to facilitate understanding and processing of diverse 
languages.

  *   Automatic inference of typological features. The pros and cons of 
existing techniques (e.g. heuristics derived from morphosyntactic annotation, 
propagation from features of other languages, supervised Bayesian and neural 
models) and discussion on emerging ones.

  *   Typology and interpretability. The use of typological knowledge for 
interpretation of hidden representations of multilingual neural models, 
multilingual data generation and selection, and typological annotation of texts.

  *   Improvement and completion of typological databases. Combining linguistic 
knowledge and automatic data-driven methods towards the joint goal of improving 
the knowledge on cross-linguistic variation and universals.

  *   Linguistic diversity and universals. Challenges of cross-lingual 
annotation. Which linguistic phenomena or categories should be considered 
universal? How should they be annotated?

  *   Bringing technology to document under-described languages. Improving 
model performance and documentation of under-resourced languages using 
typological databases, multilingual models, and data from high-resource 
languages.

  *   Cognate and Derivative Detection for Low-Resourced Languages.

Important Dates (all deadlines are 23:59 AoE)

   — February 13, 2023:  Paper submission deadline
   — March 13, 2023:  Notification of acceptance
   — March 27, 2023:  Camera-ready deadline
   — May 2 or 6, 2023:  Workshop

Submissions

We invite both extended abstract submissions (non-archival) and general paper 
submissions (archival). The accepted submissions will be presented at the 
workshop, providing new insights and ideas. Extended abstracts should describe 
already published work or work in progress and should not exceed two (2) pages. 
This way, we will not discourage researchers from preferring main conference 
proceedings, at the same time ensuring that interesting and thought-provoking 
research is presented at the workshop. For general (archival) submissions we 
accept both long and short papers. Short papers should not exceed four (4) 
pages, long papers should not exceed eight (8) pages papers. Unlimited 
additional pages are allowed for the references section in all submission types.
Submissions should be anonymous, without authors or an acknowledgment section; 
self-citations should appear in the third person.

Submissions must follow the EACL 2023 stylesheet 
https://2023.eacl.org/calls/styles/; both long and short paper submissions must 
follow the two-column format of ACL proceedings. All submissions must be in PDF 
format.
Contributions should be submitted via OpenReview: 
https://openreview.net/group?id=eacl.org/EACL/2023/Workshop/SIGTYP


Shared Task
This year’s edition will include a shared task on “Cognate and Derivative 
Detection for Low-Resourced Languages", more details can be found here: 
https://github.com/sigtyp/ST2023.

Organizing Committee
Koustava Goswami, Alexey Sorokin, Ritesh Kumar, Andrey Shcherbakov, Edoardo M. 
Ponti, Saliha Muradoğlu, Lisa Beinborn, Ryan Cotterell, Kat Vylomova

Anti-harassment policy
The workshop follows the ACL anti-harassment policy: 
https://www.aclweb.org/adminwiki/index.php?title=Anti-Harassment_Policy

Contact
For any inquiries regarding the workshop, please send an email to the 
Organizing Committee at [email protected]



------------------------------------------------------------------------

Lisa Beinborn

Assistant Professor for Natural Language Processing

Vrije Universiteit Amsterdam

https://beinborn.eu/

------------------------------------------------------------------------
_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

Reply via email to