mxn added a comment.

  In T236593#8017255 <https://phabricator.wikimedia.org/T236593#8017255>, @mxn 
wrote:
  
  > In T236593#8015993 <https://phabricator.wikimedia.org/T236593#8015993>, 
@AGutman-WMF wrote:
  >
  >> The ideal solution would be to allow (in the language code validator) 
arbitrary language codes including a rank identifier. For instance, for 
Viatnamese one should be able to use codes such as vi-x-Q8201-1, vi-x-Q8201-2 
etc. Currently this doesn't pass the validation as one gets the error //Invalid 
Item ID "Q8201-1"//.
  >
  > It sounds like representations need the ability to have qualifiers…
  
  To elaborate, each //Nôm// character needs a different set of Han character 
in this lexeme <https://www.wikidata.org/wiki/Property:P5425> statements 
(multiple statements for compound words), different sources, probably other 
things that aren’t coming to mind. It’s not that I don’t want to give the 
multiple-representation approach a try, but how else would hủy bỏ/huỷ bỏ 
<https://www.wikidata.org/wiki/Lexeme:L679211> and ký hiệu/kí hiệu 
<https://www.wikidata.org/wiki/Lexeme:L679212> be modeled but to keep the 
characters in separate forms?
  
  In principle, each character should even get its own lexeme, but since each 
//Nôm// character is an alternative form of a //quốc ngữ// word, the various 
spellings of that word would need to be duplicated as lemmas of each such 
lexeme. It ends up being a lot of redundancy and room for error. I had tried 
this approach at one point, with very redundant lexemes for phở 
<https://www.wikidata.org/wiki/Special:PermanentLink/1560865877>, 𬖾 
<https://www.wikidata.org/wiki/Special:PermanentLink/1560864869>, and 頗 
<https://www.wikidata.org/wiki/Special:PermanentLink/1560865170>, but it seemed 
like needless complication for both editors and data consumers.

TASK DETAIL
  https://phabricator.wikimedia.org/T236593

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: mxn
Cc: AGutman-WMF, mxn, So9q, Ijon, daniel, Asaf, Mahir256, Danmichaelo, 
Fnielsen, Lucas_Werkmeister_WMDE, Denny, Lydia_Pintscher, jeblad, jhsoby, 
Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, 
Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, 
rosalieper, Bodhisattwa, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to