mxn added a comment.

  The approach I was forced to take with Vietnamese (separate lexemes per word 
per writing system, “translations” from one writing system to another) has some 
downsides. For one thing, the criteria for a translation between vi and vi-Hani 
must be stricter than the criteria for a translation between vi and en; 
otherwise there would be no way to distinguish these transcriptions from 
translations more generally. In principle, it would follow that every 
simplified Chinese character should also have a separate lexeme from the 
corresponding traditional character(s), as on Wiktionary, and we could even 
take this to the extreme that “colour” is the en-GB “translation” of “color” in 
en-US.
  
  On a practical level, this separate lexeme approach means any Wiktionary 
template similar to https://en.wiktionary.org/wiki/Template:vi-readings would 
need to look up translations, while a template generating a table of 
translations of an English sense would need to know to ignore vi-Hani 
statements or merge them with vi statements. In a Vietnamese dictionary, it’s 
also normal to list the other words represented by the same characters. 
Currently, such a template on Wiktionary requires a series of expensive calls 
to look up second-order lexemes. (A rejected property proposal 
<https://www.wikidata.org/wiki/Wikidata:Property_proposal/ch%E1%BB%AF_N%C3%B4m> 
would streamline that somewhat.)
  
  It would be nice to be able to more strongly link representations in the two 
Vietnamese writing systems, but allowing multiple representations to have the 
same language code would only be a partial solution anyways. A full solution 
would be able to limit some statements to certain representations of a form. 
Otherwise, how would one indicate that one representation is now rare, having 
been supplanted by the other, independently of any broader linguistic shift, or 
that two sources disagree about whether that change has even occurred?

TASK DETAIL
  https://phabricator.wikimedia.org/T236593

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: mxn
Cc: mrephabricator, LucasWerkmeister, C933103, AGutman-WMF, mxn, So9q, Ijon, 
daniel, Asaf, Mahir256, Danmichaelo, Fnielsen, Lucas_Werkmeister_WMDE, Denny, 
Lydia_Pintscher, jeblad, jhsoby, Astuthiodit_1, karapayneWMDE, Invadibot, 
maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, 
QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, 
Mbch331
_______________________________________________
Wikidata-bugs mailing list -- [email protected]
To unsubscribe send an email to [email protected]

Reply via email to