mxn added a comment.
The approach I was forced to take with Vietnamese (separate lexemes per word per writing system, “translations” from one writing system to another) has some downsides. For one thing, the criteria for a translation between vi and vi-Hani must be stricter than the criteria for a translation between vi and en; otherwise there would be no way to distinguish these transcriptions from translations more generally. In principle, it would follow that every simplified Chinese character should also have a separate lexeme from the corresponding traditional character(s), as on Wiktionary, and we could even take this to the extreme that “colour” is the en-GB “translation” of “color” in en-US. On a practical level, this separate lexeme approach means any Wiktionary template similar to https://en.wiktionary.org/wiki/Template:vi-readings would need to look up translations, while a template generating a table of translations of an English sense would need to know to ignore vi-Hani statements or merge them with vi statements. In a Vietnamese dictionary, it’s also normal to list the other words represented by the same characters. Currently, such a template on Wiktionary requires a series of expensive calls to look up second-order lexemes. (A rejected property proposal <https://www.wikidata.org/wiki/Wikidata:Property_proposal/ch%E1%BB%AF_N%C3%B4m> would streamline that somewhat.) It would be nice to be able to more strongly link representations in the two Vietnamese writing systems, but allowing multiple representations to have the same language code would only be a partial solution anyways. A full solution would be able to limit some statements to certain representations of a form. Otherwise, how would one indicate that one representation is now rare, having been supplanted by the other, independently of any broader linguistic shift, or that two sources disagree about whether that change has even occurred? TASK DETAIL https://phabricator.wikimedia.org/T236593 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mxn Cc: mrephabricator, LucasWerkmeister, C933103, AGutman-WMF, mxn, So9q, Ijon, daniel, Asaf, Mahir256, Danmichaelo, Fnielsen, Lucas_Werkmeister_WMDE, Denny, Lydia_Pintscher, jeblad, jhsoby, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
