Esc3300 added a comment.

  In T267636#7196372 <https://phabricator.wikimedia.org/T267636#7196372>, 
@Amire80 wrote:
  
  > In T267636#7195695 <https://phabricator.wikimedia.org/T267636#7195695>, 
@Esc3300 wrote:
  >
  >> The linked page lists several samples now archived at Wikidata 
<https://www.wikidata.org/wiki/Property_talk:P6375/Archives/P969#%22und%22_or_%22und-latn%22>.
 These have been converted to P6375 statements:
  >>
  >> - using "uk" 
<https://www.wikidata.org/w/index.php?title=Q12114478&diff=1324896030&oldid=1199047662>,
 using "uk" 
<https://www.wikidata.org/w/index.php?title=Q12114479&diff=1324894331&oldid=1199047679>,
  >> - using "und" 
<https://www.wikidata.org/w/index.php?title=Q27919920&diff=1324924605&oldid=1290785531>,
 using "und" 
<https://www.wikidata.org/w/index.php?title=Q137550&diff=1324932897&oldid=1258628875>,
  >> - using "ru" 
<https://www.wikidata.org/w/index.php?title=Q932748&diff=1324879339&oldid=1301361959>,
  >>
  >> So, actually, not only "und" was used.
  >
  > All of these are just wrong, and "und" is not necessary in any of them. 
It's supposed to be Ukrainian, Russian, Tajik, Japanese. In all these cases, a 
dedicated code would just perpetuate data that is sloppy and easily fixable.
  
  I think the sloppiness was caused by the lack of adequate language codes. 
"und-latn" would have been that and still could be (but it's not harder to 
apply).  As @Lydia_Pintscher mentioned, it's not a life or death situation, but 
inaction and delays in the addition of the IETF language codes to Wikidata can 
lead to a deterioration of data quality at Wikidata.
  
  Not sure where you want to go with "It's supposed to be Ukrainian, Russian, 
Tajik, Japanese":
  
  - technically it would be correct to use "ru" or "uk" for Latin script text 
in these languages, but I don't think this is desirable at Wikidata. AFAIK, 
it's generally not being used that way in Wikidata.
  - if you think that Wikidata shouldn't store structured data for the samples 
given above, that is something you should propose and discuss as a Wikidata 
contributor in the adequate forum (e.g. Project chat). Here we try to determine 
the appropriate language code for the sample texts with help of a review by 
langcom.

TASK DETAIL
  https://phabricator.wikimedia.org/T267636

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: Esc3300
Cc: Esc3300, Mbch331, jhsoby, Amire80, Aklapper, Mohammed_Sadat_WMDE, 
Lydia_Pintscher, Lea_Lacroix_WMDE, Invadibot, maantietaja, Akuckartz, Nandana, 
Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, 
rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude
_______________________________________________
Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org
To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org

Reply via email to