-------- Forwarded Message --------
Subject:        RBMT from Kazakh to Turkish
Date:   Wed, 13 Mar 2019 19:07:42 +0000
From:   Daniyar Nariman <n.dani...@innopolis.ru>
To:     il...@selimcan.org <il...@selimcan.org>



Dear Ilnar Salimzianov,


My name is Nariman. I am a third-year bachelor student at
Innopolis University(Russia, Tatarstan). I am studying Data Science and
really interested in disciplines such as machine learning, natural
language processing, information retrieval etc. 


Recently I read your paper, RBMT from Kazakh to Turkish, which was
published in EAMT 2018. It was really interesting to read. The thing is,
I am applying to GSoC(Google Summer of Code) this year to Apertium, but
I am still thinking on the topic which I would like to deal with. One of
the topics was to bring the defined language pair to state-of-the-art
quality and I would like to deal with Kazakh-Turkish pair as the
Kazakh language my mother tongue and I studied the Turkish language in
the high school for 5 years. 


I would like to ask If there any restrictions on how to increase the
quality of this pair? 

Excluding adding a large number of rules or by expanding the
dictionary(taken for granted). For instance by optimizing the algorithms
given in the pipeline. I am asking this question because by modifying
some part of the pipeline, we can increase accuracy on our pair of
languages, but decrease on another pair and constructing a different
pipeline for different pairs is not a good idea in my opinion.



Thanks in advance!


Best Regards, 

Daniyar Nariman





_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to