That sounds to me as very useful indeed, Amir.
I'm a newby to wikidata but as soon as I am back from my holiday (as of
5/5) I'd like to help.
I know how to transliterate Armenian, Georgian, Ukrainian and Russian to
internationally standardized (ISO-standards) forms (and to English and
maybe French and German too). Problem could be that these standardized
forms lead to diacritic characters (e.g. hacek on 'c' & 's'; Чайковский
---> Čajkovskij and Щедрин ---> Ščedrin). Is wikidata able to deal with
these?

best regards,
Eric van Balkum ('Eric de Muziekbibliothecaris')
(former) Music Librarian, managing authority database at www.mcomb.nl
_'Music was my first love, and it will be my last'_

Amir Ladsgroup schreef op 2015-04-22 14:48: 

> Hello, 
> I started bot of auto-transliterating names of humans, initially with Persian 
> and English (as a pair) since I know both and I can debug. After some 
> modifications, In the last check, In more than several hundreds of edits I 
> checked, I couldn't find any errors, I want to expand this bot for other 
> languages but before, I need opinions of people who know rules of 
> transliterating names in these languages, I tried to realize rules and this 
> is my result but I need someone familiar to confirm 
> *Chinese: Instead of space it uses "·" character (it's not dot) but order is 
> the same. e.g Alan Turing is: "艾伦·图灵" [2] which "艾伦" means Alan and "图灵" 
> means Turing 
> *Japanese: it's the same but different separator: "・", e.g. "アラン・チューリング" [3] 
> *Russian: The separator is space character but order is like "FamilyName, 
> GivenName" e.g. "Тьюринг, Алан" [4] is "Turing, Alan". Handling names with 
> more than two words would be pretty complicated (I skip them) 
> *I checked Hebrew and Greek and both are simple languages like Persian, same 
> order, space as separator. 
> 
> If you can help me, it would make a great difference in number of labels in 
> your language. Things you can help are: 
> 1- Confirm or correct rules of these languages and add other rules if needed. 
> 2- Suggest more languages. I thought about Sanskrit, Hindi, and Telugu but I 
> don't know anyone who can check the rules, if you do, please help me. 
> 3- For any language I will do an initial run just to test, if you can check 
> edits of the bot (which is pretty easy, e.g. see this [5]) it would be 
> awesome. 
> 
> Thanks, 
> Best 
> 
> _______________________________________________
> Wikidata-l mailing list
> [email protected]
> https://lists.wikimedia.org/mailman/listinfo/wikidata-l [1]

 

Links:
------
[1] https://lists.wikimedia.org/mailman/listinfo/wikidata-l
[2]
https://zh.wikipedia.org/wiki/%E8%89%BE%E4%BC%A6%C2%B7%E5%9B%BE%E7%81%B5
[3]
https://ja.wikipedia.org/wiki/%E3%82%A2%E3%83%A9%E3%83%B3%E3%83%BB%E3%83%81%E3%83%A5%E3%83%BC%E3%83%AA%E3%83%B3%E3%82%B0
[4]
https://ru.wikipedia.org/wiki/%D0%A2%D1%8C%D1%8E%D1%80%D0%B8%D0%BD%D0%B3,_%D0%90%D0%BB%D0%B0%D0%BD
[5]
https://www.wikidata.org/w/index.php?title=Special:Contributions/Dexbot&dir=prev&offset=20150422000329&target=Dexbot
_______________________________________________
Wikidata-l mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-l

Reply via email to