Re: [Wikidata] Machine translation efforts for underserved languages

2018-09-05 Thread mathieu lovato stumpf guntz
Hi Olya, Sorry for the late reply, but I just wondered if you were aware of Wikitrans[1], which "provides machine-translated versions of Wikipedia articles, completely linked and searchable in the target language, as well as cross language simultaneous Wikipedia searches". It doesn't use Wik

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-22 Thread David Cuenca Tudela
Hi Olya, There is also another topic to consider with translations. Normally a text reflects the reality of a speaker, which doesn't mean that that reality is interesting for a speaker of another language who might have different circumstances. For instance the translation of an article about met

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-22 Thread Olya Irzak
Dear Gerard, Scott, Lucie and Amir & everyone, Thank you for the helpful responses! Gerard - great to hear about your work and thank you for the reference on Cebuano Wikipedia. We weren't familiar with that case, but had similar fears. We are putting our machine translated content on a separate s

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-18 Thread Info WorldUniversity
Hi Olya, Lucie, and Wikidatans, Very interesting projects. And thanks for publishing, Lucie - very helpful! With regard to Swahili, Arabic (both African languages!) and Esperanto, and leveraging Google Translate / GNMT, I've been looking at this Google GNMT gif image - https://1.bp.blogspot.com/

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-18 Thread Gerard Meijssen
Hoi, On average there is little or no support for subjects that have to do with Africa. When I check the articles for politicians for instance, I find that even current presidents let alone ministers are missing in African Wikipedias. So it is wonderful that there have been projects that deal with

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-18 Thread Amir E. Aharoni
‬ 2018-06-18 2:12 GMT+03:00 Olya Irzak : > Dear Wikidata community, > > We're working on a project called Wikibabel to machine-translate parts of > Wikipedia into underserved languages, starting with Swahili. > > In hopes that some of our ideas can be helpful to machine translation > projects, we

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-18 Thread Gerard Meijssen
Ho Lucie, I would really love to work with you on content that is of particular relevance for African Wikipedias. At this time I have added many statements for African politicians, I started with African awards and African geography.. The structure from wards to country for Tanzania.. What I am re

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-18 Thread Lucie Kaffee
Hello Olya and everyone, Very interesting project! I am working on underserved languages in Wikipedia as well, mainly as part of my research. In our most recent work we experimented with generating Wikipedia summaries from Wikidata facts in underserved languages, which worked quite well [1][2]. Th

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-18 Thread Gerard Meijssen
Hoi, There is no explicit link between the data and the lexicographic data. As a consequence it will not be easy to make use of the existing labels for automated translation servies. This has been an explicit architectural decision.. For me it will be interesting to learn how these links will be r

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-17 Thread Info WorldUniversity
HI Irene, Wikibabel, Gerd, and Wikidatans, How does Wikidata's new lexicographical project work with regard to Swahili (since it is a Wikipedia / Wikidata language) and Google Translate / GNMT re your "Our approach leverages Google Translate to make Engli

Re: [Wikidata] Machine translation efforts for underserved languages

2018-06-17 Thread Gerard Meijssen
Hoi, I am giving a lot of attention to content that deals with Africa. At that I also target the Swahili wikipedia [1] (I have not filled in all the red links yet). At this moment I am adding information in Wikidata about Tanzanian wards based on sw.wikipedia categories and templates. Many of the

[Wikidata] Machine translation efforts for underserved languages

2018-06-17 Thread Olya Irzak
Dear Wikidata community, We're working on a project called Wikibabel to machine-translate parts of Wikipedia into underserved languages, starting with Swahili. In hopes that some of our ideas can be helpful to machine translation projects, we wrote a blogpost about how we prioritized which pages