Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Amir E. Aharoni
2012/8/14 Nikola Smolenski smole...@eunet.rs:
 I believe it should be possible to alleviate this problem to an extent by
 introducing automatic transcription between languages and specifying what
 language the mayor's default name is in. If automatic transcription gets
 it wrong, it could still be overriden when someone enters the name in
 another language.

It is guaranteed to be profoundly broken. The above-mentioned Hebrew
names will be transliterated as 'mrm mcn' (the apostrophes are part
of the transliteration!) and ywnh yhb. The same problem applies to
Arabic, Punjabi and many other languages. Without manual maintenance
it will perpetuate horrendously wrong transliteration.

Some very limited auto-transliteration is OK, but just as a
suggestion. I was actually going to write an email about that. But it
must not be automatic all the way and propagate to all wikis.

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Amir E. Aharoni
2012/8/14 Nikola Smolenski smole...@eunet.rs:
 On 14/08/12 08:57, Amir E. Aharoni wrote:

 2012/8/14 Nikola Smolenskismole...@eunet.rs:

 I believe it should be possible to alleviate this problem to an extent by
 introducing automatic transcription between languages and specifying what
 language the mayor's default name is in. If automatic transcription
 gets
 it wrong, it could still be overriden when someone enters the name in
 another language.


 It is guaranteed to be profoundly broken. The above-mentioned Hebrew
 names will be transliterated as'mrm mcn'  (the apostrophes are part


 Would it? How many Hebrew names are there that are spelled עמרם? If the
 transliteration software knows it's a human name it can transliterate it as
 Amram.

What you say is kinda true, but in practice it's much more
complicated. I worked for a few years in a company that makes software
that does this and I was the lead developer. There are two software
packages that do it for Hebrew, they are proprietary and very
expensive. It's not that making a Free package is impossible, but you
need a team for every language that has such problems, you need
several full time people to maintain the words, and what's worst is
that most words have six or so possible pronunciations. Sure,
crowdsourcing in Wikidata may change that, but it's too early to talk
about this.

AFAIK the situation is even worse in Arabic, which is a much bigger
language than Hebrew.

What I'm getting at is, again, that some limited helping
transliteration may be OK, but it must not be automatically
propagated. Naïve people may think that that's how the name is
actually written, and in such matters most people are very naïve.

--
Amir

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Gregor Hagedorn
I think the topic is relevant for the Wikidata editing UI.

At the hackathon in Berlin we had discussions about a chain of
fallback languages. Have reworked and added some potential
user-interface behaviour to

http://meta.wikimedia.org/wiki/Wikidata/Notes/Language_fallback

Gregor

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Nikola Smolenski

On 14/08/12 09:28, Amir E. Aharoni wrote:

2012/8/14 Nikola Smolenskismole...@eunet.rs:

On 14/08/12 08:57, Amir E. Aharoni wrote:

2012/8/14 Nikola Smolenskismole...@eunet.rs:

I believe it should be possible to alleviate this problem to an extent by
introducing automatic transcription between languages and specifying what
language the mayor's default name is in. If automatic transcription
gets
it wrong, it could still be overriden when someone enters the name in
another language.


It is guaranteed to be profoundly broken. The above-mentioned Hebrew
names will be transliterated as'mrm mcn'   (the apostrophes are part


Would it? How many Hebrew names are there that are spelled עמרם? If the
transliteration software knows it's a human name it can transliterate it as
Amram.


What you say is kinda true, but in practice it's much more
complicated. I worked for a few years in a company that makes software
that does this and I was the lead developer. There are two software
packages that do it for Hebrew, they are proprietary and very
expensive. It's not that making a Free package is impossible, but you
need a team for every language that has such problems, you need
several full time people to maintain the words, and what's worst is
that most words have six or so possible pronunciations. Sure,
crowdsourcing in Wikidata may change that, but it's too early to talk
about this.

AFAIK the situation is even worse in Arabic, which is a much bigger
language than Hebrew.

What I'm getting at is, again, that some limited helping
transliteration may be OK, but it must not be automatically
propagated. Naïve people may think that that's how the name is


For Hebrew, Arabic and a few similar cases. In a large number of 
language combinations we will not have such problems.




smime.p7s
Description: S/MIME Cryptographic Signature
___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread dr . cueppers . gmbh
 
In cases of changing an old name by a new name 
a second problem will be the fact: For the old name 
may exist an article in some of the wikis - what means, there 
the old name is [[Adam]], but the new has no article and 
must be Eva, but not [[Eva]]: To change this correct, 
always human intervention is necessary. 

Dr.cueppers


Dr. Cüppers GmbH
Hübnerstraße 2
33104 Paderborn
05254 935700

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Denny Vrandečić
It is not. Wikidata would know if an article on Eva exists in the
Wikipedia of a given language (if it is appropriately connected), and
thus would be able to automatically offer the link or not.

Cheers,
Denny

2012/8/14  dr.cueppers.g...@arcor.de:

 In cases of changing an old name by a new name
 a second problem will be the fact: For the old name
 may exist an article in some of the wikis - what means, there
 the old name is [[Adam]], but the new has no article and
 must be Eva, but not [[Eva]]: To change this correct,
 always human intervention is necessary.

 Dr.cueppers


 Dr. Cüppers GmbH
 Hübnerstraße 2
 33104 Paderborn
 05254 935700

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



-- 
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 | http://wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Amir E. Aharoni
2012/8/14 Denny Vrandečić denny.vrande...@wikimedia.de:
 In general I am a strong believer of let's start with the simple
 thing, which is to let editors add transliterations (that is why we
 have a label field for every entity in every language).

 I may see a use case for a transliteration-bot that does some of the
 transliterations (semi?)automatically, but I actually would think that
 this is probably something that should be left to the community.

 There might be some simple cases for language fallbacks (including
 transliterations) but we have not touched that development item yet.
 We have to see how this works out.

 But in short, I am wary of automatic systems and rather would count on
 the knowledge of the editors.

 I hope that makes sense,

This makes perfect sense and I agree.

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread dr . cueppers . gmbh
 
And if there are 10 articles Eva with [[Eva (abc)]], [[Eva (def]], [[Eva 
(ghi]]   ?
Wikidata will find out the correct one in every case? 

- Original Nachricht 
Von: Denny Vrande?i? denny.vrande...@wikimedia.de
An:  Discussion list for the Wikidata project. 
wikidata-l@lists.wikimedia.org
Datum:   14.08.2012 16:30
Betreff: Re: [Wikidata-l] watching Wikidata changes that affect my wiki

 It is not. Wikidata would know if an article on Eva exists in the
 Wikipedia of a given language (if it is appropriately connected), and
 thus would be able to automatically offer the link or not.
 
 Cheers,
 Denny
 
 2012/8/14  dr.cueppers.g...@arcor.de:
 
  In cases of changing an old name by a new name
  a second problem will be the fact: For the old name
  may exist an article in some of the wikis - what means, there
  the old name is [[Adam]], but the new has no article and
  must be Eva, but not [[Eva]]: To change this correct,
  always human intervention is necessary.
 
  Dr.cueppers
 
 
  Dr. Cüppers GmbH
  Hübnerstraße 2
  33104 Paderborn
  05254 935700
 
  ___
  Wikidata-l mailing list
  Wikidata-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikidata-l
 
 
 
 -- 
 Project director Wikidata
 Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
 Tel. +49-30-219 158 26-0 | http://wikimedia.de
 
 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.
 
 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l
 

Dr. Cüppers GmbH
Hübnerstraße 2
33104 Paderborn
05254 935700

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Marco Fleckinger


On 08/14/2012 05:29 PM, dr.cueppers.g...@arcor.de wrote:


And if there are 10 articles Eva with [[Eva (abc)]], [[Eva (def]], [[Eva 
(ghi]]   ?
Wikidata will find out the correct one in every case?


As I unterstood, we have articles and we have WikiData-Items. Linking 
articles will be done by people not by machines. Of course we already 
have interwiki links between article [[de:Berlin]] and [[en:Berlin]]. 
Once anybody linked one of those to the item Berlin' a bot can add the 
other one to this item as well.


Marco

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l


Re: [Wikidata-l] watching Wikidata changes that affect my wiki

2012-08-14 Thread Denny Vrandečić
The editors would need to make the connections explicitly between the
article. The software would not be guessing it, if that is what you
are assuming. And the editors should be good at getting the correct
one.

Together with Marco's answer, does it make it clearer?

2012/8/14  dr.cueppers.g...@arcor.de:

 And if there are 10 articles Eva with [[Eva (abc)]], [[Eva (def]], [[Eva 
 (ghi]]   ?
 Wikidata will find out the correct one in every case?

 - Original Nachricht 
 Von: Denny Vrande?i? denny.vrande...@wikimedia.de
 An:  Discussion list for the Wikidata project. 
 wikidata-l@lists.wikimedia.org
 Datum:   14.08.2012 16:30
 Betreff: Re: [Wikidata-l] watching Wikidata changes that affect my wiki

 It is not. Wikidata would know if an article on Eva exists in the
 Wikipedia of a given language (if it is appropriately connected), and
 thus would be able to automatically offer the link or not.

 Cheers,
 Denny

 2012/8/14  dr.cueppers.g...@arcor.de:
 
  In cases of changing an old name by a new name
  a second problem will be the fact: For the old name
  may exist an article in some of the wikis - what means, there
  the old name is [[Adam]], but the new has no article and
  must be Eva, but not [[Eva]]: To change this correct,
  always human intervention is necessary.
 
  Dr.cueppers
 
 
  Dr. Cüppers GmbH
  Hübnerstraße 2
  33104 Paderborn
  05254 935700
 
  ___
  Wikidata-l mailing list
  Wikidata-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikidata-l



 --
 Project director Wikidata
 Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
 Tel. +49-30-219 158 26-0 | http://wikimedia.de

 Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
 Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
 unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
 Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l


 Dr. Cüppers GmbH
 Hübnerstraße 2
 33104 Paderborn
 05254 935700

 ___
 Wikidata-l mailing list
 Wikidata-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikidata-l



-- 
Project director Wikidata
Wikimedia Deutschland e.V. | Obentrautstr. 72 | 10963 Berlin
Tel. +49-30-219 158 26-0 | http://wikimedia.de

Wikimedia Deutschland - Gesellschaft zur Förderung Freien Wissens e.V.
Eingetragen im Vereinsregister des Amtsgerichts Berlin-Charlottenburg
unter der Nummer 23855 B. Als gemeinnützig anerkannt durch das
Finanzamt für Körperschaften I Berlin, Steuernummer 27/681/51985.

___
Wikidata-l mailing list
Wikidata-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l