Nikki added a comment.
In T230833#6103004 <https://phabricator.wikimedia.org/T230833#6103004>, @daniel wrote: > I dimly recall a similar discussion from years ago. IIRC, IETF is extensible, and we came up with a way to encode item IDs in language tages, something like `qid-36163` (by fortunate coincidence, "qid" lies within the range for private use tags, between "qaa" and "qtz"), or `und-x-wikidata-Q36163` (the "mis" code should not be used, according to BCP47). Isn't Wikibase using this kind of encoding somewhere already? It uses it for lexemes - people can add `-x-qid` to an existing code (which doesn't always produce a valid tag but that's a separate issue :)) > This would be my solution for determining a language tag for Items that do not specify one. I don't understand the use case well enough to tell whether this would actually solve the problem at hand. There are some languages which don't have any usable language tags, but the cases being discussed here do. Inventing tags would be better than nothing, since it would provide a way of distinguishing all the languages currently lumped under `und`, but only doing that wouldn't be a proper solution because we would be inventing tags for languages which already have them, e.g. Wikidata normally uses the assigned code `sco` for Scots, it would be weird and inconsistent for wbsearchentities to return `und-x-q14549` instead. TASK DETAIL https://phabricator.wikimedia.org/T230833 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Nikki Cc: So9q, daniel, Addshore, Lydia_Pintscher, Nikki, LucasWerkmeister, darthmon_wmde, Nandana, Mringgaard, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Jonas, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
