Dear apertiumers,
I've an idea for GSOC about subtitles. Before putting it on the wiki, I
wanted to check with /yous/, as it seems that there has been previous
related work [1].
In a nutshell, it is about translating subtitles from OpenSubtitles with
Apertium, for closely-related pairs of languages A and B such that (i)
there are mature A-->B systems in Apertium and (ii) on OpenSubtiles
there are many subtitles for A and very few for B. An example is A=ES,
B=CA. There could be 2 tasks:
1. Development of a tool to translate subtitles. Given a translation
direction A-->B:
1.1. Use OpenSubtitles' API to find subtitles S in A not translated
yet into B
1.2. Translate S from A to B using Apertium's API.
1.3. Upload the translated subtitles to OpenSubtitles. These
subtitles could have a preamble such as "Warning: this subtitle is
machine translated! Powered by Apertium.org"
2. Evaluation of Apertium's quality for subtitles (for 1 or 2
translation directions) and improvements/modifications in Apertium's
systems for those directions based on that evaluation.
I'd be happy to hear opinions, experiences from related previous work,
criticism, etc :)
Thanks! Best,
Antonio
[1] http://wiki.apertium.org/wiki/Translating_subtitles_intro
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff