Hello,

Jaume Ortolà (contributing mainly to apertium-spa-cat) and I are preparing
a project to create a free ("libre") English-Catalan dictionary for
Softcatalà [1]. This is part of a call by Fundació.cat [2] for projects
promoting the Catalan language in technology, which will receive funding if
selected.

Our goal is to compile data from other sources (including terminology from
Termcat [3] and entries from DACCO [4]) and revise it. In addition, we have
been specially thinking about Apertium's English-Catalan pair as a
potential source and destination of data, given our involvement in the
project. It could be a great opportunity to expand the pair's bilingual
dictionary with a big amount of high-quality entries.

However, we are not sure about the potential licensing limitations this
could pose. Apertium is licensed under GPLv3, yet the other sources we have
found so far are licensed under CC-BY-SA. We know, for example, that
CC-BY-SA is one-way compatible with GPLv3 since version 4.0, which would
allow us to later include the data in Apertium, but not the other way. We
have no specific license in mind for the project yet; we want to release
the data and source for free for everyone to use it and reuse it, but the
fact that this project involves data from different sources with different
licenses makes everything a bit convoluted.

Does anyone know which options do we have to be able to reuse Apertium data
in such a project?

Thank you very much in advance,

*Marc Riera*

[1] https://www.softcatala.org
[2] https://convocatoria.fundacio.cat
[3] http://www.termcat.cat
[4] http://www.catalandictionary.org
_______________________________________________
Apertium-stuff mailing list
Apertium-stuff@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/apertium-stuff

Reply via email to