Hi, Maharaj,

Bodo seems to me to be an excellent language for Apertium. Unfortunately,
we have no morphological parser for it, even embryonic. That's why  to
develop one seems to me much more realistic than to think of an automatic
translator project between Bodo and another language (and even more so, if
it belongs to another language family).

It would therefore be a matter of doing this type of project:
https://wiki.apertium.org/wiki/Ideas_for_Google_Summer_of_Code#Develop_a_morphological_analyser


I've been looking a bit at the morphology of the language, and lexd [2]
seems the best choice for developing a parser (e.g. because of the
possessive prefixes).

It would now be a matter of making the coding challenge of the project.
There are about ten days left before the deadline. That is not a lot of
time.

I would suggest joining IRC to get faster help, if needrf:
https://wiki.apertium.org/wiki/IRC

Best regards,
Hèctor

[1] https://github.com/apertium/lexd

Missatge de Maharaj Brahma via Apertium-stuff <
apertium-stuff@lists.sourceforge.net> del dia dv., 24 de març 2023 a les
15:54:

> Dear Apertium folks,
>
> I'm writing to express my interest in participating in the GSoC with
> Apertium. I'm a first-year Ph.D. student in the NLP domain. I'm excited to
> work with the Apertium community on developing translation technology for
> low-resource languages, particularly Bodo. Bodo is a low-resource language
> primarily spoken in the Northeastern region of India. As an NLP researcher
> and native speaker of Bodo, I'm committed to building technology to
> preserve and promote indigenous languages like Bodo.
>
> I'm interested in adding a new language English-Bodo to the Apertium
> platform through this GSoC program or otherwise. I believe this can
> potentially impact the translation technology for low resources.
> Additionally, due to the following points:
> (i) There is no existing publicly available translation technology for
> English-Bodo.
> (ii) Bodo is a low-resource language (potentially ample contribution space
> remains).
> (iii) Belongs to the Sino-Tibetan language family, unlike other Indian
> languages like Hindi and Assamese.
>
> I would like to know if this is a potential project. If so, I would like
> to interact with potential mentors.
>
> Any comments on the English-Bodo language pair are most welcome.
>
> With Regards,
> Maharaj Brahma
> Research Scholar
> Dept. of Computer Science & Engineering
> CS23RESCH01004
>
> Disclaimer:- This footer text is to convey that this email is sent by one
> of the users of IITH. So, do not mark it as SPAM.
> _______________________________________________
> Apertium-stuff mailing list
> Apertium-stuff@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/apertium-stuff
>
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to