mxn added a comment.
In T236593#8017255 <https://phabricator.wikimedia.org/T236593#8017255>, @mxn wrote: > In T236593#8015993 <https://phabricator.wikimedia.org/T236593#8015993>, @AGutman-WMF wrote: > >> The ideal solution would be to allow (in the language code validator) arbitrary language codes including a rank identifier. For instance, for Viatnamese one should be able to use codes such as vi-x-Q8201-1, vi-x-Q8201-2 etc. Currently this doesn't pass the validation as one gets the error //Invalid Item ID "Q8201-1"//. > > It sounds like representations need the ability to have qualifiers… To elaborate, each //Nôm// character needs a different set of Han character in this lexeme <https://www.wikidata.org/wiki/Property:P5425> statements (multiple statements for compound words), different sources, probably other things that aren’t coming to mind. It’s not that I don’t want to give the multiple-representation approach a try, but how else would hủy bỏ/huỷ bỏ <https://www.wikidata.org/wiki/Lexeme:L679211> and ký hiệu/kí hiệu <https://www.wikidata.org/wiki/Lexeme:L679212> be modeled but to keep the characters in separate forms? In principle, each character should even get its own lexeme, but since each //Nôm// character is an alternative form of a //quốc ngữ// word, the various spellings of that word would need to be duplicated as lemmas of each such lexeme. It ends up being a lot of redundancy and room for error. I had tried this approach at one point, with very redundant lexemes for phở <https://www.wikidata.org/wiki/Special:PermanentLink/1560865877>, 𬖾 <https://www.wikidata.org/wiki/Special:PermanentLink/1560864869>, and 頗 <https://www.wikidata.org/wiki/Special:PermanentLink/1560865170>, but it seemed like needless complication for both editors and data consumers. TASK DETAIL https://phabricator.wikimedia.org/T236593 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mxn Cc: AGutman-WMF, mxn, So9q, Ijon, daniel, Asaf, Mahir256, Danmichaelo, Fnielsen, Lucas_Werkmeister_WMDE, Denny, Lydia_Pintscher, jeblad, jhsoby, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Bodhisattwa, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
