[Corpora-List] Webminar by Sebastian Ruder (Meta)

HiTZ zentroa via Corpora Fri, 31 Jan 2025 02:32:32 -0800

**** We apologize for the multiple copies of this email. In case you arealready registered to the next webinar, you do not need to registeragain. ****


------------------------------------------------------------------------
Dear colleague,

We are happy to announce the next webinar in the Language Technologywebinar series organized by the HiTZ Chair of AI&LT (https://hitz.eus).You can check the videos of previous webinars and the schedule forupcoming webinars here: http://www.hitz.eus/webinars


Next webinar:

*Speaker:* Sebastian Ruder (Meta)
*Title:* Multilingual LLM Evaluation in Practical Settings
*Date: * Thursday, February 6, 2025 - 15:00 CET

*Summary:* Large language models (LLMs) are increasingly used in avariety of applications across the globe but do not provide equalutility across languages. In this talk, I will discuss multilingualevaluation of LLMs in two practical settings: conversationalinstruction-following and usage of quantized models. For the first part,I will focus on a specific aspect of multilingual conversational abilitywhere errors result in a jarring user experience: generating text in theuser’s desired language. I will describe a new benchmark and evaluationof a range of LLMs. We find that even the strongest models exhibitlanguage confusion, i.e., they fail to consistently respond in thecorrect language. I will discuss what affects language confusion, how tomitigate it, and potential extensions. In the second part, I willdiscuss the first evaluation study of quantized multilingual LLMs acrosslanguages. We find that automatic metrics severely underestimate thenegative impact of quantization and that human evaluation—which has beenneglected by prior studies—is key to revealing harmful effects. Overall,I highlight limitations of multilingual LLMs and challenges ofreal-world multilingual evaluation.

*Bio:* Sebastian Ruder is a research scientist at Meta based in Berlin,Germany where he works on improving evaluation and benchmarking of largelanguage models (LLMs). He previously led the Multilinguality team atCohere with the objective to improve the multilingual capabilities ofCohere's LLMs. Before that he was a research scientist at GoogleDeepMind. He completed his PhD in Natural Language Processing (NLP) atthe Insight Research Centre for Data Analytics, while working as aresearch scientist at Dublin-based text analytics startup AYLIEN.Previously, he studied Computational Linguistics at the University ofHeidelberg, Germany and at Trinity College, Dublin.

*
Upcoming webinars:*
· Christian Herff (Thursday, March 6, 2025)
· Emanuele Bugliarello (Thursday, April 3, 2025)
· André F. T. Martins (Thursday, May 8, 2025)

If you are interested in participating, please complete thisregistration form: http://www.hitz.eus/webinar_izenematea

If you cannot attend this seminar, but you want to be informed of thefollowing HiTZ webinars, please complete this registration form instead:http://www.hitz.eus/webinar_info


Best wishes,

HiTZ Zentroa

P.S: HiTZ will not grant any type of certificate for attendance at thesewebinars.

_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

[Corpora-List] Webminar by Sebastian Ruder (Meta)

Reply via email to