-------- Forwarded Message --------
Subject: RBMT from Kazakh to Turkish
Date: Wed, 13 Mar 2019 19:07:42 +0000
From: Daniyar Nariman <n.dani...@innopolis.ru>
To: il...@selimcan.org <il...@selimcan.org>
Dear Ilnar Salimzianov,
My name is Nariman. I am a third-year bachelor student at
Innopolis University(Russia, Tatarstan). I am studying Data Science and
really interested in disciplines such as machine learning, natural
language processing, information retrieval etc.
Recently I read your paper, RBMT from Kazakh to Turkish, which was
published in EAMT 2018. It was really interesting to read. The thing is,
I am applying to GSoC(Google Summer of Code) this year to Apertium, but
I am still thinking on the topic which I would like to deal with. One of
the topics was to bring the defined language pair to state-of-the-art
quality and I would like to deal with Kazakh-Turkish pair as the
Kazakh language my mother tongue and I studied the Turkish language in
the high school for 5 years.
I would like to ask If there any restrictions on how to increase the
quality of this pair?
Excluding adding a large number of rules or by expanding the
dictionary(taken for granted). For instance by optimizing the algorithms
given in the pipeline. I am asking this question because by modifying
some part of the pipeline, we can increase accuracy on our pair of
languages, but decrease on another pair and constructing a different
pipeline for different pairs is not a good idea in my opinion.
Thanks in advance!
Best Regards,
Daniyar Nariman
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff