Esc3300 added a comment.
In T267636#7196372 <https://phabricator.wikimedia.org/T267636#7196372>, @Amire80 wrote: > In T267636#7195695 <https://phabricator.wikimedia.org/T267636#7195695>, @Esc3300 wrote: > >> The linked page lists several samples now archived at Wikidata <https://www.wikidata.org/wiki/Property_talk:P6375/Archives/P969#%22und%22_or_%22und-latn%22>. These have been converted to P6375 statements: >> >> - using "uk" <https://www.wikidata.org/w/index.php?title=Q12114478&diff=1324896030&oldid=1199047662>, using "uk" <https://www.wikidata.org/w/index.php?title=Q12114479&diff=1324894331&oldid=1199047679>, >> - using "und" <https://www.wikidata.org/w/index.php?title=Q27919920&diff=1324924605&oldid=1290785531>, using "und" <https://www.wikidata.org/w/index.php?title=Q137550&diff=1324932897&oldid=1258628875>, >> - using "ru" <https://www.wikidata.org/w/index.php?title=Q932748&diff=1324879339&oldid=1301361959>, >> >> So, actually, not only "und" was used. > > All of these are just wrong, and "und" is not necessary in any of them. It's supposed to be Ukrainian, Russian, Tajik, Japanese. In all these cases, a dedicated code would just perpetuate data that is sloppy and easily fixable. I think the sloppiness was caused by the lack of adequate language codes. "und-latn" would have been that and still could be (but it's not harder to apply). As @Lydia_Pintscher mentioned, it's not a life or death situation, but inaction and delays in the addition of the IETF language codes to Wikidata can lead to a deterioration of data quality at Wikidata. Not sure where you want to go with "It's supposed to be Ukrainian, Russian, Tajik, Japanese": - technically it would be correct to use "ru" or "uk" for Latin script text in these languages, but I don't think this is desirable at Wikidata. AFAIK, it's generally not being used that way in Wikidata. - if you think that Wikidata shouldn't store structured data for the samples given above, that is something you should propose and discuss as a Wikidata contributor in the adequate forum (e.g. Project chat). Here we try to determine the appropriate language code for the sample texts with help of a review by langcom. TASK DETAIL https://phabricator.wikimedia.org/T267636 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Esc3300 Cc: Esc3300, Mbch331, jhsoby, Amire80, Aklapper, Mohammed_Sadat_WMDE, Lydia_Pintscher, Lea_Lacroix_WMDE, Invadibot, maantietaja, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org