We are looking to fill a traineeship position in the field of 'Multilingual
Text Analysis'.

 

If you are interested, please follow the instructions provided at the URL
listed below.

 

 

URL of call:          http://recruitment.jrc.ec.europa.eu/

Call number:      2016-IPR-G-000-6713

Title of call:         Multilingual Text Analysis

Deadline:             21 March 2016 (Brussels time)

Starting date:     1 June 2016

Duration:             5 months

Eligibility requirements:

 
https://ec.europa.eu/jrc/en/working-with-us/jobs/temporary-positions/jrc-tra
inees

 

 

The European Commission's Joint Research Centre (JRC) in Ispra, Italy, is
looking for a trainee to support the JRC's Europe Media Monitor (EMM) team
with a variety of Language Technology-related tasks. EMM gathers and
analyses reports from traditional and social media in dozens of languages by
clustering related news items; categorising them; extracting information
such as entities (persons, organisations, locations), events (who did what
to whom, where and when), quotations by and about people; identifying
sentiment; as well as linking related news clusters over time and across
languages. Methods used are mostly hybrid: machine learning tools are used
to gather evidence, learn vocabulary and rules, but the results are usually
controlled and optimised through human intervention. EMM is used by European
Institutions, by national authorities in EU Member States, by international
organisations and by the public. The public EMM applications can be accessed
at the URLs http://emm.newsbrief.eu/overview.html and
http://emm.newsexplorer.eu. The EMM team also currently contributes to a
United Nations effort of updating the UNISDR Terminology on Disaster Risk
Reduction (DRR) by analysing term usage in English, French and Spanish
DRR-related document collections. For more information, see
http://www.preventionweb.net/files/45462_backgoundpaperonterminologyaugust20
.pdf. 

 

The successful trainee will carry out any of the following tasks: 

 

a)    use third-party software to carry out a terminology use study, which
includes comparing occurrences of terms and their variants in English,
French and Spanish; 

b)    gather existing definitions of important terms from the internet; 

c)    improve the JRC's existing entity-oriented sentiment analysis tools,
then analyse large quantities of sentiment data and its change over time
with the purpose of identifying opinion change patterns and trends; 

d)    contribute to the semi-automatic classification of entities in JRC's
multilingual entity database; 

e)    contribute to improving the recognition of multilingual organisation
names; 

f)    annotation of linguistic data and/or evaluation of automatic text
analysis results; 

g)    contribute to writing a scientific publication.

 

 

Qualifications:

 

Essential:

 

.         A degree (or an almost completed degree) in computational
linguistics, computer science or related areas; 

.         Programming skills; 

.         Good command of oral and written English (level B2).

 

Advantage:

 

.         Knowledge of further foreign languages; 

.         Proven advanced programming skills, especially in Java; 

.         Good knowledge of Language Technology-related tools and methods; 

.         The proven ability to work independently and as part of a team.

 

Please do not send your applications by email. They will not be valid.

 

 

Ralf Steinberger <https://ec.europa.eu/jrc/en/person/ralf-steinberger>  
European Commission - Joint Research Centre (JRC)
21027 Ispra (VA), Italy

URL - Applications:  <http://emm.newsbrief.eu/overview.html>
http://emm.newsbrief.eu/overview.html + http://emm.newsexplorer.eu 

URL - Resources: https://ec.europa.eu/jrc/en/language-technologies   

URL - Publications: http://langtech.jrc.it/JRC_Publications.html 

 

_______________________________________________
Mt-list site list
[email protected]
http://lists.eamt.org/mailman/listinfo/mt-list

Reply via email to