[Corpora-List] PhD in ML/NLP – Fairness and self-supervised learning for speech processing

François Portet via Corpora Thu, 01 Jun 2023 00:54:51 -0700

PhD in ML/NLP – Fairness and self-supervised learning for speech processing
Starting date: October 1st, 2023 (flexible)
Application deadline: June 9th, 2023
Interviews (tentative): June 14th, 2023


Salary: ~2000€ gross/month (social security included)


Mission: research oriented (teaching possible but not mandatory)

*Keywords:*speech processing, fairness, bias, self-supervisedlearning,evaluation metrics



*CONTEXT*

This thesis is in the context of the ANR project E-SSL (EfficientSelf-Supervised Learning for Inclusive and Innovative SpeechTechnologies). Self-supervised learning (SSL) has recently emerged asone of the most promising artificial intelligence (AI) methods as itbecomes now feasible to take advantage of the colossal amounts ofexisting unlabeled data to significantly improve the performances ofvarious speech processing tasks.


*PROJECT OBJECTIVES*

Speech technologies are widely used in our daily life and are expandingthe scope of our action, with decision-making systems, including incritical areas such as health or legal aspects. In these societalapplications, the question of the use of these tools raises the issue ofthe possible discrimination of people according to criteria for whichsocietyrequires equal treatment, such as gender, origin, religion ordisability... Recently, the machine learning community has beenconfronted with the need to work on the possible biases of algorithms,and many works have shown that the search for the best performance isnot the only goal to pursue [1]. For instance, recent evaluations of ASRsystems have shown that performances can vary according to the genderbut these variations depend both on data used for learning and on models[2]. Therefore such systems are increasingly scrutinized for beingbiased while trustworthy speech technologies definitely represents acrucial expectation.

Both the question of bias and the concept of fairness have now becomeimportant aspects of AI, and we now have to find the right thresholdbetween accuracy and the measure of fairness. Unfortunately, thesenotions of fairness and bias are challenging to define and their

meanings can greatly differ [3].


The goals of this PhD position are threefold:

- First make a survey on the many definitions of robustness, fairnessand bias with the aim of coming up with definitions and metrics fit forspeech SSL models


- Then gather speech datasets with high amount of well-described metadata

- Setup an evaluation protocol for SSL models and analyzing the results.

*SKILLS*

 *

   Master 2 in Natural Language Processing, Speech Processing, computer
   science or data science.

 *

   Good mastering of Python programming and deep learning framework.

 *

   Previous experience in bias in machine learning would be a plus

 *

   Very good communication skills in English

 *

   Good command of French would be a plus but is not mandatory

*SCIENTIFIC ENVIRONMENT*

The PhD position will be co-supervised by Alexandre Allauzen (DauphineUniversité PSL, Paris) and Solange Rossato and François Portet(Université Grenoble Alpes). Joint meetings are planned on a regularbasis and the student is expected to spend time in both places.Moreover, two other PhD positions are open in this project. Thestudents, along with the partners will closely collaborate. Forinstance, specific SSL models along with evaluation criteria will bedeveloped by the other PhD students. Moreover, the PhD student willcollaborate with several team members involved in the project inparticular the two other PhD candidates who will be recruited and thepartners from LIA, LIG and Dauphine Université PSL, Paris. The means tocarry out the PhD will be providedboth in terms of missions in Franceand abroad and in terms of equipment. The candidate will have access tothe cluster of GPUs of both the LIG and Dauphine Université PSL.Furthermore, access to the National supercomputer Jean-Zay will enableto run large scale experiments.


*INSTRUCTIONS FOR APPLYING*

Applications must contain: CV + letter/message of motivation + masternotes + be ready to provide letter(s) of recommendation; and beaddressed to Alexandre Allauzen ([email protected]_<mailto:[email protected]>), SolangeRossato([email protected]) and François Portet([email protected]_ <mailto:[email protected]>). Wecelebrate diversity and are committed to creating an inclusiveenvironment for all employees.


*REFERENCES:*

[1] Mengesha, Z., Heldreth, C., Lahav, M., Sublewski, J. & Tuennerman,E. “I don’t Think These Devices are Very Culturally Sensitive.”—Impactof Automated Speech Recognition Errors on African Americans. Frontiersin Artificial Intelligence 4. issn: 2624-8212._https://www.frontiersin.org/article/10.3389/frai.2021.725911_<https://www.frontiersin.org/article/10.3389/frai.2021.725911>(2021).

[2] Garnerin, M., Rossato, S. & Besacier, L. Investigating the Impact of Gender Representation in ASR Training Data: a Case Study onLibrispeech inProceedings of the 3rd Workshop on Gender Bias in NaturalLanguage Processing (2021), 86–92.[3] Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K. & Galstyan, A. ASurvey on Bias and Fairness in Machine Learning. ACMComput. Surv. 54.issn: 0360-0300. _https://doi.org/10.1145/3457607_<https://doi.org/10.1145/3457607>(July 2021).


--
François PORTET
Professeur - Univ Grenoble Alpes
Laboratoire d'Informatique de Grenoble - Équipe GETALP
Bâtiment IMAG - Office 333
700 avenue Centrale
Domaine Universitaire - 38401 St Martin d'Hères
FRANCE

Phone:  +33 (0)4 57 42 15 44
Email:[email protected]
www:http://membres-liglab.imag.fr/portet/

_______________________________________________
Corpora mailing list -- [email protected]
https://list.elra.info/mailman3/postorius/lists/corpora.list.elra.info/
To unsubscribe send an email to [email protected]

[Corpora-List] PhD in ML/NLP – Fairness and self-supervised learning for speech processing

Reply via email to