Smalyshev added a comment.
> The sitelinks are distinguishable by the URL (https://simple.wikipedia.org/)
URL is the different triple than the data, and matching URLs means that the
client should maintain own database which says which Wiki URL matches which
language and do pattern matching on
Smalyshev added a comment.
> This language code should be used for the sitelinks. In HTML and in the RDF
> export
That would lead to the situation where links to Simple English wiki and to
English wiki are indistinguishable. Which is not good.
> If you want to change this to en-x-simple then
Smalyshev added a comment.
@Fomafix we're not talking about user interface languages here. We're talking
about language specification in the RDF export - which should follow BCP 47 and
common accepted language codes, otherwise third-party tools would not be able
to understand in which language
gerritbot added a subscriber: gerritbot.
gerritbot added a comment.
Change 225518 had a related patch set uploaded (by Smalyshev):
https://phabricator.wikimedia.org/T105430: canonicalize language codes
https://gerrit.wikimedia.org/r/225518
TASK DETAIL
https://phabricator.wikimedia.org/T105430