[Wikidata-bugs] [Maniphest] T364631: Request for script codes: mr-modi, mr-knda (lexemes and monolingual text)
mrephabricator created this task. mrephabricator added projects: Wikidata Lexicographical data, Language codes, Wikidata. TASK DESCRIPTION This is a request for codes to tag lexemes and monolingual text for Marathi in the Modi and Kannada scripts. The Modi script is specific to Marathi; some examples on lexemes can be seen here: https://www.wikidata.org/wiki/Lexeme:L723583 https://www.wikidata.org/wiki/Lexeme:L1121483 https://www.wikidata.org/wiki/Lexeme:L295761 The Kannada script has been used in some historical inscriptions of Marathi and would also be useful to tag and document its use. (Source: Sheldon Pollock (2009). The Language of the Gods in the World of Men. Chapter 8) TASK DETAIL https://phabricator.wikimedia.org/T364631 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Danny_Benjafield_WMDE, S8321414, Astuthiodit_1, MaryMunyoki, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Dringsim, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, srishakatux, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331, Anoop ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T357205: Request for regional language codes: ps-af, ps-pk (lexemes and monolingual text)
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T357205 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Danny_Benjafield_WMDE, Astuthiodit_1, MaryMunyoki, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, srishakatux, KimKelting, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T313782: Allow support for terms (label, description, aliases) for bal
mrephabricator added a comment. Thank you, I have updated the description of this task with some information about subtags that would be useful to have with this language code. TASK DETAIL https://phabricator.wikimedia.org/T313782 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: hoo, mrephabricator, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, srishakatux, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T313782: Allow support for terms (label, description, aliases) for bal
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T313782 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: hoo, mrephabricator, Aklapper, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, srishakatux, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T345177: Constraint warnings not being shown on many Basque lexemes
mrephabricator added a comment. It seems like you are exactly write about the 50 entity check limit being related to this issue - this lexeme is just shy of 50 entities total with 6 senses and 38 forms, and still shows constraint violations. https://www.wikidata.org/wiki/Lexeme:L1092922 F37668405: image.png <https://phabricator.wikimedia.org/F37668405> TASK DETAIL https://phabricator.wikimedia.org/T345177 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Nikki, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Eihel, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Mahir256, QZanden, Esc3300, LawExplorer, _jensen, rosalieper, Agabi10, Scott_WUaS, abian, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T345177: Constraint warnings not being shown on many Basque lexemes
mrephabricator added a comment. For what it is worth, I have definitely noticed there is a limit in lexeme size of some kind where constraint violations no longer appear. I typically do not expect them to appear on Hindustani or Punjabi verbs. On the first sense of a lexeme adding a gloss quote usually results in a constraint violation, but if I add one now to this lexeme with over 50 senses no constraint violation is applied (the lexeme is https://www.wikidata.org/wiki/Lexeme:L33485 but see the screenshot since I do not want to actually leave this statement unreferenced.) F37668396: image.png <https://phabricator.wikimedia.org/F37668396> (To be honest, it did not occur to me to report this because if these violations did keep coming up it would significantly slow down the load time of the page.) TASK DETAIL https://phabricator.wikimedia.org/T345177 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Nikki, Danny_Benjafield_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Eihel, Nandana, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Mahir256, QZanden, Esc3300, LawExplorer, _jensen, rosalieper, Agabi10, Scott_WUaS, abian, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322945: Special:NewLexeme does not allow non-Latin script input in the language code field
mrephabricator renamed this task from "Special:NewLexeme does allow non-Latin script input in the language code field" to "Special:NewLexeme does not allow non-Latin script input in the language code field". TASK DETAIL https://phabricator.wikimedia.org/T322945 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator reopened this task as "Open". mrephabricator added a comment. As far as I can tell this is still not resolved—the language code was not available for monolingual text when I tried. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: noarave, mrephabricator Cc: Nikki, jhsoby, Amire80, Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332714: add new language codes for Wikidata - Spring 2023
mrephabricator reopened subtask T317161: Add lexeme language code pks for Pakistan Sign Language as Open. TASK DETAIL https://phabricator.wikimedia.org/T332714 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Arian_Bozorg, mrephabricator Cc: Arian_Bozorg, mrephabricator, Marsupium, Aklapper, Lydia_Pintscher, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator reopened this task as "Open". mrephabricator added a comment. @Arian_Bozorg I could have made it more clear but this was not just for lexemes. I gave a specific example where it would be used for monolingual text. I was also under the impression that all lexeme languages are supposed to be available in monolingual text. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: noarave, mrephabricator Cc: Nikki, jhsoby, Amire80, Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332714: add new language codes for Wikidata - Spring 2023
mrephabricator reopened subtask T317161: Add lexeme language code pks for Pakistan Sign Language as Open. TASK DETAIL https://phabricator.wikimedia.org/T332714 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Arian_Bozorg, mrephabricator Cc: Arian_Bozorg, mrephabricator, Marsupium, Aklapper, Lydia_Pintscher, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T327796: Request for language codes in lexemes + monolingual text for historical Middle Indic languages: pra, psu, pgd
mrephabricator added subscribers: Lydia_Pintscher, jhsoby. mrephabricator added a comment. Any thoughts on this? @Lydia_Pintscher @jhsoby I have been adding an increasing number of Prakrit lexeme forms in different scripts, and it would be preferable to have the proper language codes available so that thousands of language codes don't have to be changed later. It also has the potential to cause confusion with Hindi or Sanskrit lexemes if someone is not aware that the language code is not available for Prakrit. (This has happened at least once so far.) TASK DETAIL https://phabricator.wikimedia.org/T327796 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: jhsoby, Lydia_Pintscher, mrephabricator, Amire80, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: jhsoby, Amire80, Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator added a comment. I have also updated the autonym since finding some additional information for this. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320713: Add item termbox, monolingual text, and lexeme support for Hindko
mrephabricator closed this task as "Resolved". mrephabricator claimed this task. mrephabricator added a comment. This can now be closed as the code was made available today via the interface translations. TASK DETAIL https://phabricator.wikimedia.org/T320713 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316001: Add item termbox label, lexeme label, and monolingual text code support for Shina (scl)
mrephabricator added a project: Language codes. TASK DETAIL https://phabricator.wikimedia.org/T316001 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316004: Add item termbox label, lexeme label, and monolingual text code support for Rajasthani (raj)
mrephabricator added a comment. Dingal is a Rajasthani language which lacks a language code of its own https://www.wikidata.org/wiki/Q5278158 Example Dingal lexeme: https://www.wikidata.org/wiki/Lexeme:L1039048 I have been using mrw for it (Marwari being a widely spoken Rajasthani language). Adding this code would allow using raj-x-Q5278158 for Dingal lexemes instead. TASK DETAIL https://phabricator.wikimedia.org/T316004 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, jhsoby, Amire80, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator added a comment. Done TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Arian_Bozorg, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316040: Add item termbox label, lexeme label, and monolingual text code support for Dogri (doi)
mrephabricator added a comment. Done TASK DETAIL https://phabricator.wikimedia.org/T316040 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, Biaoo, Philoserf, ItamarWMDE, Akuckartz, Ironie, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, Hydriz, aude, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316040: Add item termbox label, lexeme label, and monolingual text code support for Dogri (doi)
mrephabricator renamed this task from "Add item termbox label, lexeme label, and monolingual text code support for Dogra (doi)" to "Add item termbox label, lexeme label, and monolingual text code support for Dogri (doi)". mrephabricator updated the task description. Restricted Application added a project: Internet-Archive. TASK DETAIL https://phabricator.wikimedia.org/T316040 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, Biaoo, Philoserf, ItamarWMDE, Akuckartz, Ironie, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, Hydriz, aude, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T315999: Add item termbox label, lexeme label, and monolingual text code support for Brahui Brolikva (brh-latn)
mrephabricator added a comment. I have added items for a number of scholalry works written in Brahui, such as https://www.wikidata.org/wiki/Q113302350 It should be noted that the Latin script is very rarely used for Brahui and was contrived in 2008 at the same university that continues to publish papers in the Arabic script like the one above. However, the translator for Brahui on TranslateWiki is insistent that Brahui should only be written in the Latin script and as such that code has been set to left to right and used for a mix of scripts. TASK DETAIL https://phabricator.wikimedia.org/T315999 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, jhsoby, Amire80, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T315959: Add item termbox label, lexeme label, and monolingual text code support for kls (2 scripts, latin and perso-arabic script)
mrephabricator added a comment. I have added items for a number of scholalry works written in Brahui, such as https://www.wikidata.org/wiki/Q113302350 It should be noted that the use of the Latin script is very rarely used for Brahui and was contrived in 2008 at the same university that continues to publish papers in the Arabic script like the one above. However, the translator for Brahui on TranslateWiki is insistent that Brahui should only be written in the Latin script and as such that code has been set to left to right and used for a mix of scripts. TASK DETAIL https://phabricator.wikimedia.org/T315959 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, jhsoby, Amire80, alaa, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320714: Add item termbox, monolingual text, lexeme support for Bagri language
mrephabricator added a comment. Done TASK DETAIL https://phabricator.wikimedia.org/T320714 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320714: Add item termbox, monolingual text, lexeme support for Bagri language
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T320714 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320715: Add termbox label, monolingualtext, and lexeme support for Gujari
mrephabricator added a comment. Done TASK DETAIL https://phabricator.wikimedia.org/T320715 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320715: Add termbox label, monolingualtext, and lexeme support for Gujari
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T320715 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320713: Add item termbox, monolingual text, and lexeme support for Hindko
mrephabricator added a comment. Done TASK DETAIL https://phabricator.wikimedia.org/T320713 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T320713: Add item termbox, monolingual text, and lexeme support for Hindko
mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T320713 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lydia_Pintscher, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T328890: Language code request: Wolastoqey
mrephabricator added a comment. In T328890#8664641 <https://phabricator.wikimedia.org/T328890#8664641>, @Fjjulien wrote: > I was excited to see this feature request being moved into a "task". However, nothing happened with it for the last three weeks. Is this normal? There is a backlog of requests going back nearly a year (see for example T308062 <https://phabricator.wikimedia.org/T308062>). It is unknown when new language codes will be added again TASK DETAIL https://phabricator.wikimedia.org/T328890 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Aklapper, Fjjulien, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332425: Add language code support for Bhadrawahi (bhd-deva, bhd-takr) for use in monolingual text, lexemes, and sense glosses
mrephabricator created this task. mrephabricator added projects: Language codes, Wikidata, Wikidata Lexicographical data. TASK DESCRIPTION An example Bhadrawahi lexeme: https://www.wikidata.org/wiki/Lexeme:L1014179 Two script codes are needed: - bhd-deva Devanagari - bhd-takr Takri TASK DETAIL https://phabricator.wikimedia.org/T332425 WORKBOARD https://phabricator.wikimedia.org/project/board/4981/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332424: Add language code support for Mahasu (bfz-deva, bfz-takr) in monolingual text, lexemes, and sense glosses
mrephabricator created this task. mrephabricator added projects: Language codes, Wikidata, Wikidata Lexicographical data. TASK DESCRIPTION An example Mahasu lexeme: https://www.wikidata.org/wiki/Lexeme:L983576 Two script codes are needed: - bfz-deva Devanagari - bfz-takr Takri TASK DETAIL https://phabricator.wikimedia.org/T332424 WORKBOARD https://phabricator.wikimedia.org/project/board/4981/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332423: Add language code for Gaddi (gbk-deva, gbk-takr) for use in monolingual text, lexemes, and sense glosses
mrephabricator created this task. mrephabricator added projects: Language codes, Wikidata, Wikidata Lexicographical data. TASK DESCRIPTION An example Gaddi lexeme: https://www.wikidata.org/wiki/Lexeme:L991699 Two script codes are needed: - gbk-deva Devanagari - gbk-takr Takri TASK DETAIL https://phabricator.wikimedia.org/T332423 WORKBOARD https://phabricator.wikimedia.org/project/board/4981/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T332422: Add language code for Kundal Shahi language (shd) to use in monolingual text, lexemes, and sense glosses
mrephabricator created this task. mrephabricator added projects: Language codes, Wikidata, Wikidata Lexicographical data. TASK DESCRIPTION Here is an example lexeme in this language https://www.wikidata.org/wiki/Lexeme:L1079941 It is written witb the Perso-Arabic script with right to left text direction TASK DETAIL https://phabricator.wikimedia.org/T332422 WORKBOARD https://phabricator.wikimedia.org/project/board/4981/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317161: Add lexeme language code "pks" for Pakistan Sign Language
mrephabricator added a comment. Just adding a note here that we now have a property for Pakistan Sign Language lexemea on Wikidata https://www.wikidata.org/wiki/Property:P11652 At the moment it is not possible to add statements containing text in this language or glosses on senses with the specific language code. TASK DETAIL https://phabricator.wikimedia.org/T317161 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314458: Add item termbox label, lexeme label, and monolingual text code support for trw
mrephabricator added a comment. Just adding a note here that we now have a property for Torwali lexemes in Wikidata — https://www.wikidata.org/wiki/Property:P11301 At the moment any statements on these containing Torwali text or sense glosses in the language cannot be added with the language specific code TASK DETAIL https://phabricator.wikimedia.org/T314458 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T258391: Lemma not shown when linking senses unless the sense has a definition in the current language
mrephabricator added a comment. It would be great if the links simply showed gloss(es) in the language of the lemma if available. As a matter of practicality, I am not in the habit of glossing senses in languages other than the language of the lexeme—each gloss takes a certain amount of time to add, and if I want to minimize the amount of time I am spending writing glosses this is optimal. I don TASK DETAIL https://phabricator.wikimedia.org/T258391 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Lucas_Werkmeister_WMDE, So9q, Nikki, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T267636: add monolingual code "und-latn"
mrephabricator added a comment. This would be useful for quoting mis- or poorly transcribed transcriptions of text, as well as the titles of works which consist of invented words. TASK DETAIL https://phabricator.wikimedia.org/T267636 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Esc3300, Mbch331, jhsoby, Amire80, Aklapper, Mohammed_Sadat_WMDE, Lydia_Pintscher, Lea_Lacroix_WMDE, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T327796: Request for language codes in lexemes + monolingual text for historical Middle Indic languages: pra, psu, pgd
mrephabricator created this task. mrephabricator added projects: Wikidata Lexicographical data, Wikidata, Language codes. TASK DESCRIPTION **Feature summary** (what you would like to be able to do and where): Use language codes for Prakrit (generally), Sauraseni, and Gandhari in lexemes and in monolingual text for the titles of works and names of historical entities on Wikidata items. The specific code + script combinations I am requesting are as follows: Prakrit: - pra-deva - pra-guru - pra-arab - pra-brah Sauraseni: - psu-deva - psu-guru - psu-arab - psu-brah Gandhari: - pgd-khar - pgd-deva - pgd-arab **Use case(s)** (list the steps that you performed to discover that problem, and describe the actual underlying problem which you want to solve. Do not describe only a solution): I have been adding a number of lexemes in Prakrit and related languages for historical interest in the etymologies of modern Indic languages. As these words are often indirectly attested--for example, written after the time they originated as oral literature, or known from transcriptions of original works which are lost--they may be written in a variety of scripts. Further, there is no clear line to be drawn where Prakrit ended and the modern languages started; Punjabi and Hindustani are examples of languages which may be considered modern continuations of Prakrit. In texts written in modern languages, Prakrit words are typically transcribed in the script used for the modern descendant. The Urdu Lughat dictionary transcribes Prakrit in the Perso-Arabic script, and Punjabi dictionaries published in India transcribe it in Gurmukhi. This is why I have also been representing Prakrit lexemes in these scripts. **Benefits** (why should this be implemented?): This would help Wikidata become a more valuable resource for collecting data about the origins and connections between modern Indic languages. There are other codes which may be used for the Prakrit varieties which preceded other Indic languages, but for now I am focusing this ticket on those that I have been adding and am more familiar with. Pinging @Amire80 since I had asked about the possibility of getting some of these historical language codes added TASK DETAIL https://phabricator.wikimedia.org/T327796 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Amire80, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T326234: Request for new language codes for Kangri in lexemes and monolingual text: xnr-deva and xnr-takr
mrephabricator created this task. mrephabricator added projects: Wikidata Lexicographical data, Wikidata, Language codes. TASK DESCRIPTION There are two writing systems for Kangri to be represented; Devanagari (Deva) and Takri (Takr). TASK DETAIL https://phabricator.wikimedia.org/T326234 WORKBOARD https://phabricator.wikimedia.org/project/board/2292/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T326231: Request for language codes for Middle Persian (Pahlavi) to use in lexemes + monolingual text: pal-phli, pal-phlp, pal-phlv
mrephabricator created this task. mrephabricator added projects: Wikidata Lexicographical data, Wikidata, Language codes. Restricted Application added a subscriber: Huji. TASK DESCRIPTION There are three attested writing systems for Middle Persian. These would be represented as: - pal-phli = Inscriptional Pahlavi - pal-phlp = Psalter Pahlavi - pal-phlv = Book Pahlavi TASK DETAIL https://phabricator.wikimedia.org/T326231 WORKBOARD https://phabricator.wikimedia.org/project/board/2292/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Huji, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322946: Special:NewLexeme shows raw wiki markup
mrephabricator closed this task as "Resolved". mrephabricator claimed this task. TASK DETAIL https://phabricator.wikimedia.org/T322946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lucas_Werkmeister_WMDE, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322946: Special:NewLexeme shows raw wiki markup
mrephabricator added a comment. OK the message and the bottom of the new item page is the same but is getting resolved differently - I figured out the issue by process of elimination looking at the other translations and the new lexeme page source code - the license name variable is hard coded for Special:NewLexeme and substituted in English, something else is happening on Special:NewLexeme. I think if I allow it to insert the English string it will resolve, it was possible to translate the whole thing on the old page I can make a pull request to remove the italic text. TASK DETAIL https://phabricator.wikimedia.org/T322946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lucas_Werkmeister_WMDE, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322946: Special:NewLexeme shows raw wiki markup
mrephabricator added a comment. The brackets are in the correct direction - TranslateWiki will actually not let you publish translations without brackets in the correct direction. The markup also resolves in the Translatewiki interface. Using FSI/PDI that way just forces the characters to render that way. They do not behave like quotation marks which can make things confusing. TASK DETAIL https://phabricator.wikimedia.org/T322946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lucas_Werkmeister_WMDE, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322946: Special:NewLexeme shows raw wiki markup
mrephabricator added a comment. The message(s) on the new item page are different as far as I can tell. The messages are also not legible at all due to being rendered in Italics, which breaks the glyph rendering. The translation is more for other people using TranslateWiki than anything. I do not understand why italics are being used here if users' interpretation of the message is a concern. TASK DETAIL https://phabricator.wikimedia.org/T322946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lucas_Werkmeister_WMDE, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322946: Special:NewLexeme shows raw wiki markup
mrephabricator added a comment. Yes, it is strange - it seems specific to this string, in this version of Special:NewLexeme, in this locale. I initially added that commented line when the Special:NewLexeme (old) page and Special:NewLexemeAlpha page were both online, where the comment would be hidden on the first. (The comment saying something along the lines of, if you are seeing this message, it is because there is an error preventing this wikitext and substitutions from resolving.) I think most of what is on the New Item page is translated through the on-wiki instance of Content Translation, which has not had this issue from what I have seen. It seems like the two places where an issue might be introduced are: 1) the delivery of the string from TranslateWiki to Special:NewLexeme or 2) the handling of the string by the surrounding elements of the Special:NewLexeme front end. Since Special:NewLexeme is now updated in full, I have just updated the message to have no wikitext besides the substitution variables and a more complete explanation of the issue. تسیں ”بݨا“ بٹن کریئے، ورتݨ دیاں شرطاں نال تسیں راضی ہندے اے، اتے تہاڈے کم لئی کریاٹیو کامنز زیرو لائیسنس ورتوگے سی۔ کوئی کجھ لیݨ دیݨ، عاوم یا نجی دی ورتوں لئی ایہتھوں اِجازت دتی اے۔ وِکیڈیٹے توں غلطی ہوگئی اے جو لوڑے جوڑ نہیں لگادے۔ ایہہ مُلاں ”$1“، ”[[$2]]“ تے ”[$3 $4]“ لئی لائسنس خاص پتے ویکھا چاہیدے۔ ایس غلطی بارے «phabricator.wikimedia.org» تے ”T322946 <https://phabricator.wikimedia.org/T322946>“ مسئلہ لبھ جا سکدے او۔ TASK DETAIL https://phabricator.wikimedia.org/T322946 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Lucas_Werkmeister_WMDE, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T318543: Support for transliteration variants
mrephabricator added a comment. You could try sux-x-QID as I don't think that format is limited to mis. The advantage of using a QID is that it may be localised; what would be nice is to render a label based on the QID TASK DETAIL https://phabricator.wikimedia.org/T318543 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Situxx, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322947: Entering a lemma in Devanagari script on Special:NewLexeme does not work anymore
mrephabricator closed this task as "Resolved". mrephabricator claimed this task. mrephabricator added a comment. I think that's exactly what it is actually. There are problems with all of the Indic ULS keyboards I have tried. I was not able to replicate this TASK DETAIL https://phabricator.wikimedia.org/T322947 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Mahir256, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322686: tab behaviour issue on Special:NewLexeme
mrephabricator added a comment. I will point out that the terms and conditions section does not actually render as intended in every locale. For pnb it shows raw wiki markup and the links do not resolve. TASK DETAIL https://phabricator.wikimedia.org/T322686 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Sarai-WMDE, Nikki, Lydia_Pintscher, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322151: Glosses and monolingual text as in usage examples should force a linebreak after for mul and mis
mrephabricator added a comment. An example of what I mean may be seen here in the usage example statement: https://wikidata.org/wiki/Lexeme:L722046 The issue is apparent at certain window sizes/dimensions in both the mobile and desktop views. F35699888: Screenshot_20221102-231451.png <https://phabricator.wikimedia.org/F35699888> The reason I have chosen to include two text directions in contexts like this is because otherwise there is no way to indicate directly that these are equivalent. It also makes it easier to give examples attested in speech rather than writing. Putting them together seems functionally equivalent to how representations of a lemma are rendered in statements. TASK DETAIL https://phabricator.wikimedia.org/T322151 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, Muhammed4IT, LawExplorer, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Huji, Amire80, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322151: Glosses and monolingual text as in usage examples should force a linebreak after for mul and mis
mrephabricator changed the subtype of this task from "Task" to "Bug Report". TASK DETAIL https://phabricator.wikimedia.org/T322151 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, Muhammed4IT, LawExplorer, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Huji, Amire80, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T322151: Glosses and monolingual text as in usage examples should force a linebreak after for mul and mis
mrephabricator created this task. mrephabricator added projects: Wikidata Lexicographical data, RTL, Wikidata, I18n. Restricted Application added a project: wdwb-tech. TASK DESCRIPTION mul and mis text can be in any or multiple text directions and the label for the language "name" in these cases is not properly isolated from the string. Without using Unicode control characters, we get something like: [LTR text] (multiple languages) / [RTL text] (simulated using just LTR text to avoid unintentional confusion) In these instances, the language label actually shows up in the middle of the string which is undesirable. With Unicode control characters (POP Isolate pairs around each differing direction portion), we see: multiple) [LTR text] / [RTL (languages text] when the string gets too wide as it often does. This is slightly better but it is also confusing to look at. TASK DETAIL https://phabricator.wikimedia.org/T322151 WORKBOARD https://phabricator.wikimedia.org/project/board/2292/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, Muhammed4IT, LawExplorer, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Huji, Amire80, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T300261: Figure out correct HTML attributes for formatting lexeme ID values using lemmas
mrephabricator added a comment. Restricted Application added a project: wdwb-tech. As a rule, the direction should be set to "ltr" or "rtl" as appropriate if known for the language, and "auto" if the language is unknown. The main reason for this is that a lemma can include ambiguous characters and without the direction tags, lemmas with ambiguous characters at the beginning or end currently re-orient them according to the interface language. The alignment of the text to the left or right overall is not important by comparison. (Personally, I actually prefer that they all align to the same direction, left or right. As lemma representations are unordered, this means that for languages which use both RTL and LTR scripts, you end up with "zig-zag" shaped lists of statements where each entry is aligned to one side depending on which representation happened to be entered first.) Here is an example: If we have کدے… The ellipsis is the final character and should be shown to the left of the word, as: کدے… (Simulated here followed by the Arabic Letter Mark control character.) We can only get this behaviour if the lemma is tightly wrapped in dir="rtl" or dir="auto", or immediately followed by a control character (Right to Left Mark should be used for Hebrew, and Arabic Letter Mark for all other RTL scripts including non-Arabic ones like Divehi). Without this, the ambiguous character can end up anywhere. Since my interface is set to a RTL locale, this means that for example, a lemma like "-ed" shows up as "ed-" in various places, particularly if it is a lemma without a standard code. Every Proto-Indo-European lemma shows the asterisks and hyphens in the wrong position, but this would be fixed with dir="auto". TASK DETAIL https://phabricator.wikimedia.org/T300261 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Lucas_Werkmeister_WMDE, Jakob_WMDE, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, Muhammed4IT, LawExplorer, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Huji, Amire80, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316979: A long x-scroll bar appear when clicking on any edit link on any Wikidata item/property page with RTL languages
mrephabricator added a comment. Restricted Application added a project: wdwb-tech. This is likely the same issue as T321441 <https://phabricator.wikimedia.org/T321441> , where a solution has been posted. I did not spot this in checking to see if a ticket existed TASK DETAIL https://phabricator.wikimedia.org/T316979 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Lucas_Werkmeister_WMDE, Sakura_emad, Aklapper, AramBakir, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, Muhammed4IT, LawExplorer, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Huji, Amire80, Gryllida, Shizhao, Arrbee, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T321441: Adding a statement to entity on Wikidata now causes extremely long horizontal scroll
mrephabricator added a subscriber: Nikki. mrephabricator added a comment. @Nikki was able to help identify that this problem is caused by a `textarea` with `inset: -px auto -px auto` set as an HTML style attribute. Upon adding the following to my user CSS, it now works beautifully again: textarea { inset: auto auto auto auto !important; } The `textarea` also problematically has no `id` or `class`, and can be inserted at any position, so overriding for all `textarea` elements is necessary. TASK DETAIL https://phabricator.wikimedia.org/T321441 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Nikki, Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Manishagoenka, maantietaja, cristiana023, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, JGirault, Scott_WUaS, Wikidata-bugs, aude, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T321441: Adding a statement to entity on Wikidata now causes extremely long horizontal scroll
mrephabricator added a comment. Changing the labels on items also causes this problem.w TASK DETAIL https://phabricator.wikimedia.org/T321441 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Manishagoenka, maantietaja, cristiana023, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, JGirault, Scott_WUaS, Wikidata-bugs, aude, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T321441: Adding a statement to entity on Wikidata now causes extremely long horizontal scroll
mrephabricator added projects: Design, Browser-Support-Firefox. TASK DETAIL https://phabricator.wikimedia.org/T321441 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, Manishagoenka, maantietaja, cristiana023, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, JGirault, Scott_WUaS, Wikidata-bugs, aude, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T321441: Adding a statement to entity on Wikidata now causes extremely long horizontal scroll
mrephabricator created this task. mrephabricator added a project: Wikidata. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION **Steps to replicate the issue** (include links if applicable): - Using interface language pnb (Punjabi Shahmukhi, right-to-left Perso-Arabic script) - You can test for example on this sandbox entity in the same interface language here https://www.wikidata.org/wiki/Lexeme:L123?uselang=pnb and try adding a statement such as P8530 <https://phabricator.wikimedia.org/P8530> https://www.wikidata.org/wiki/Property:P8530?uselang=pnb - This happens in the latest stable version of Firefox on desktop. I have not seen this on mobile Chromium (browsing Wikidata in "Desktop mode") **What happens?**: Extremely long horizontal scroll is added to the page as soon as a new statement is added. This has been happening in the last couple of weeks; this did not used to happen. **What should have happened instead?**: No horizontal scrolling. **Software version** (skip for WMF-hosted wikis like Wikipedia): **Other information** (browser name/version, screenshots, etc.): lastest stable version of Firefox on Windows desktop TASK DETAIL https://phabricator.wikimedia.org/T321441 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317037: Punjabi Gurmukhi nukta / bindi character NFC normalization should be turned off
mrephabricator added a comment. See also T206188 <https://phabricator.wikimedia.org/T206188> , similar task related to nukta letter error in Bengali TASK DETAIL https://phabricator.wikimedia.org/T317037 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, Af420, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, JJMC89, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Nirmos, Cwek, Wikidata-bugs, aude, Dinoguy1000, Gryllida, Shizhao, Arrbee, KartikMistry, Arlolra, Jackmcbarn, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T317037: Punjabi Gurmukhi nukta / bindi character NFC normalization should be turned off
mrephabricator created this task. mrephabricator added projects: MediaWiki-Internationalization, MediaWiki-Parser, Wikidata, Wikidata Lexicographical data, I18n, MediaWiki-libs-utfnormal. Restricted Application added a project: wdwb-tech. TASK DESCRIPTION These nukta / bindi characters of the Gurmukhi Unicode block have precomposed forms, which the Unicode NFC normalization specification has exceptions for to decompose them to the "parent" character + nukta / bindi attaching character. ਸ਼ ਖ਼ ਲ਼ ਗ਼ ਫ਼ ਜ਼ This apparently seems to be for purported backwards compatibility issues, but the current situation on the web is that the precomposed characters are preferred by most websites and databases which use Punjabi Gurmukhi. This is understandable, as these letters represent one single consonant each, and it is quite annoying for users to have to press backspace twice for them while not having to for others. Keyboard layouts tend to use the precomposed characters. The use of precomposed characters in URLs makes many Punjabi websites and external identifiers unlinkable from Wikimedia sites. For example, you can see here https://www.wikidata.org/wiki/Lexeme:L697770 the Sri Granth ID link which does exist is broken. Entering escape sequence manually in the property does not work either. This is a problem for the lexeme data itself as well, for reconciling against other databases, for transliteration to Shahmukhi (Perso-Arabic script), and for use with newer fonts which tend to operate under the assumption that people are using the preferred precomposed characters. I am not sure where the most effective and least controversial place to change this is. Would Unicode ever change this? Could this be changed in the NFC normalization library itself, or should it be changed on a case by case basis for inputs in Wikimedia projects where an override would be particularly warranted? Maybe someone here knows TASK DETAIL https://phabricator.wikimedia.org/T317037 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, Prufkick, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, Af420, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, JJMC89, _jensen, rosalieper, Scott_WUaS, Srdjan, MuhammadShuaib, LNDDYL, Psychoslave, Nirmos, Cwek, Wikidata-bugs, aude, Dinoguy1000, Gryllida, Shizhao, Arrbee, KartikMistry, Arlolra, Jackmcbarn, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316053: Gurmukhi language labels to add to the CLDR Extension on Gerrit
mrephabricator claimed this task. mrephabricator added a comment. Assigning to myself as I figured out how to submit patches on Gerrit TASK DETAIL https://phabricator.wikimedia.org/T316053 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, Af420, GoranSMilovanovic, Mahir256, QZanden, Esc3300, LawExplorer, _jensen, rosalieper, Scott_WUaS, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Nemo_bis, Raymond, Arrbee, KartikMistry, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316053: Gurmukhi language labels to add to the CLDR Extension on Gerrit
mrephabricator created this task. mrephabricator added projects: Wikidata, Wikidata Lexicographical data, MediaWiki-extensions-CLDR, MediaWiki-Internationalization. TASK DESCRIPTION I haven't been able to figure out how to make a patch for this on Gerrit so here is a post in case somebody can push these through. These are languages in the termbox list on a Wikidata item which either have the wrong label or show an English fallback when using the site with Punjabi Gurmukhi set as the locale. (Cloning the repository results in a "no permission" error even with an account + SSH key set up on Gerrit.) The file would be LocalNames/LocalNamesPa.php The languages: - `cdo`: ਚੀਨੀ (ਮਿਨ ਡੌਂਗ) - `cr`: ਕਰੀ - `bzl`: ਬਰਾਜ਼ੀਲੀ ਸਾਈਨ ਭਾਸ਼ਾ - `crh`: ਕਰੀਮੀਨ ਟਾਟਾਰ - `crh-latin`: ਕਰੀਮੀਨ ਟਾਟਾਰ (ਲਾਤੀਨੀ ਲਿੱਪੀ) - `csb`: ਕਾਸ਼ੂਬੀਅਨ - `dag`: ਡਗਬਾਨੀ - `dty`: ਡੋਟੇਲੀ - `eml`: ਈਮਿਲੀਅਨੋ-ਰੋਮਾਗਨੋਲੋ - `ext`: ਐਕਸਟਰੀਮਾਡੂਰਨ - `frp`: ਅਰਪਿਟਨ - `frr`: ਉੱਤਰੀ ਫਰਿਸੀਅਨ - `gcr`: ਗੂਈਆਨੀਸ ਕਰੀਓਲ ਫਰੈਂਚ - `glk`: ਗਿਲਾਕੀ - `gom`: ਗੋਆਂ ਕੋਂਕਣੀ - `got`: ਗੋਥਿਕ - `guw`: ਗੰਨ - `hyw`: ਪੱਛਮੀ ਆਰਮੇਨੀਆਈ - `ie`: ਇੰਟਰਲਿੰਗੂਆ - `ik`: ਇਨੂਪੂਆਕ - `ilo`: ਇਲੋਕਾਨੋ - `jam`: ਜਾਮਾਈਕਨ ਕਰੀਓਲ ਅੰਗਰੇਜ਼ੀ - `kaa`: ਕਰਾਕਲਪਾਕ - `kbp`: ਕਾਬੀਏ - `kg`: ਕੌਂਗੋ - `ku-latn`: ਕੁਰਦਿਸ਼ (ਲਾਤੀਨੀ ਲਿੱਪੀ) - `lbe`: ਲਾਕ - `lfn`: ਲਿੰਗੂਆ ਫਰਾਂਸਾ ਨੋਵਾ - `lij`: ਲਿਗੂਰੀਅਨ - `lld`: ਲਾਡੀਨੋ - `lmo`: ਲੌਂਬਾਰਡ - `ltg`: ਇਟਾਲੀਅਨ - `lzh`: ਚੀਨੀ (ਰਵਾਇਤੀ) - `map-bms`: ਬਨਯੂਮਾਸਨ - `mhr`: ਪੂਰਬੀ ਮੈਰੀ - `mnw`: ਮੋਨ - `mrj`: ਪੱਛਮੀ ਮਾਰੀ - `ms-arab`: ਮਲਯ (ਜਾਵੀ ਲਿੱਪੀ) - `nah`: ਨਾਵਾਚ - `ng`: ਓਸ਼ਿਵਾਮਬੋ - `nov`: ਨੋਵੀਅਲ - `nrm`: ਨੌਰਮਨ - `olo`: ਲਿਵੀ-ਕਾਰੇਲੀਅਨ - `pcd`: ਪਿਕਾਰਡ - `pdc`: ਪੈੱਨਸਿਲਵੇਨੀਆ ਜਰਮਨ - `pfl`: ਪਫਾਈਲਜ਼ਿਸਚ - `pih`: ਪਿਟਕਰਨ-ਨੋਰਫਕ - `pms`: ਪੀਦਮੋਨਟੀਸ - `rmf`: ਕਾਲੋ ਫ਼ਿਨੀ - `roa-taro`: ਟੈਰੇਨਟੀਨੋ - `rue`: ਰੁਸਿਨ - `bat-smg`: ਸਾਮੋਗਿਤਿਆਂ - `hr`: ਸਰਬੋ-ਕ੍ਰੋਏਸ਼ੀਅਨ - `stq`: ਸਾਤਰਲੇਨਡੀ - `tcy`: ਤੁਲੁ - `tg-cyrl`: ਤਾਜਿਕ (ਸਿਰਿਲਿਕ ਲਿਪੀ) - `tl` : ਤਗਾਲੋਗ - `tt`: ਟਾਟਾਰ (ਸਿਰਿਲਿਕ ਲਿਪੀ) - `tt-latn`: ਟਾਟਾਰ (ਲਾਤੀਨੀ ਲਿੱਪੀ) - `vec`: ਵੇਨੇਸ਼ੀਅਨ - `vep`: ਵੇਪਸ - `vls`: ਪੱਛਮੀ ਫਲੇਮਿਸ਼ - `fiu-vro`: Võro ਵੋਰੋ - `za`: ਜ਼ੁਆਂਗ - `zea`: ਜ਼ੀਲੈਂਡੀ - `zh-cn`: ਚੀਨੀ (ਚੀਨ) - `zh-tw`: ਚੀਨੀ (ਤਾਈਵਾਨ) - `zh-hk`: ਚੀਨੀ (ਹਾਂਗ ਕਾਂਗ) Any help on this would be appreciated. TASK DETAIL https://phabricator.wikimedia.org/T316053 WORKBOARD https://phabricator.wikimedia.org/project/board/71/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, Af420, GoranSMilovanovic, Mahir256, QZanden, Esc3300, LawExplorer, _jensen, rosalieper, Scott_WUaS, MuhammadShuaib, LNDDYL, Psychoslave, Wikidata-bugs, aude, Nemo_bis, Raymond, Arrbee, KartikMistry, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316000: Add item termbox label, lexeme label, and monolingual text code support for Balti Yige (bft-tibt)
mrephabricator renamed this task from "Add item termbox label, lexeme label, and monolingual text code support for Balti Yige" to "Add item termbox label, lexeme label, and monolingual text code support for Balti Yige (bft-tibt)". mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T316000 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T315999: Add item termbox label, lexeme label, and monolingual text code support for Brahui Brolikva (brh-latn)
mrephabricator renamed this task from "Add item termbox label, lexeme label, and monolingual text code support for Brahui Brolikva" to "Add item termbox label, lexeme label, and monolingual text code support for Brahui Brolikva (brh-latn)". mrephabricator updated the task description. TASK DETAIL https://phabricator.wikimedia.org/T315999 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T315957: Special:NewLexeme allows creating lexemes with spelling variant skr-arab
mrephabricator added a project: Language codes. TASK DETAIL https://phabricator.wikimedia.org/T315957 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T316040: Add item termbox label, lexeme label, and monolingual text code support for Dogra (doi)
mrephabricator created this task. mrephabricator added projects: Wikidata Lexicographical data, Wikidata, Language codes. TASK DESCRIPTION There are three scripts: Devanagari (doi-deva) Dogra (doi-dogr) Perso-Arabic (doi-arab) TASK DETAIL https://phabricator.wikimedia.org/T316040 WORKBOARD https://phabricator.wikimedia.org/project/board/2292/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T306918: Prohibit duplication of mul labels in other languages
mrephabricator added a comment. Another example to consider - the dinosaur Changdusaurus (en) was first described in Chinese sources as 昌都龍. In Vietnamese, this dinosaur is called Changtusaurus, having been transliterated from Chinese using Vietnamese Latin script. (That other languages have duplicated the English name is likely incidental - there is no reason to prefer one over the other, and like many dinosaur names, this represents a genus but not one with a Latin taxon name.) If a different dinosaur name derived the same way in Vietnamese and English happened to match, that would not mean they have the same name in each language, since the shared letters don't represent the same sound. Should that "duplicate" get removed we could say that it would not matter because a query would return the same fallback anyway, but the same would be true for dinosaurs which never had a Vietnamese name entered to begin with. The information about which labels exactly would be homographic between which languages would be gone, and a certain amount of unrecoverable data would be gone. This would make working with data within a given language harder as there would be no way to tell between mul (fallback added for English and Swedish) and mul (differently pronounced English and Hawaiian words happened to be written the same way) further skewing the data quality outside of a handful of popular languages. At least ensuring that "mul" is understood as meaning "multiple languages" and not "Latin script" could prevent some of this from happening. I think it would be fitting that preference be given to labels which would not fit anywhere else but would be legible in other languages. For example, if the Balti name of a town in Gilgit-Baltistan is added to mul in absence of a bft Balti code, it would likely be legible to Urdu readers or Kashmiri readers and so on. Then if readers of those uncoded languages are using Urdu or English as a locale, they would still be able to get these names as a fallback. TASK DETAIL https://phabricator.wikimedia.org/T306918 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Lucas_Werkmeister_WMDE, Lydia_Pintscher, Nikki, Mahir256, Manuel, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T306918: Prohibit duplication of mul labels in other languages
mrephabricator added a comment. It's entirely possible that duplicate labels are not a real problem - there has been heated debate about this same thing on OpenStreetMap for years at this point, but the consensus has always been to keep the "duplicates" as they really contain information that data consumers can't do with out. Many of the detractors allege that Wikidata would be able to store this information should it be removed, but if that becomes no longer true, it seems like that could damage Wikidata's credibility as a useful tool for interlingual labels, as so far it has been discussed as a way to store more of that kind of information rather than less of it. TASK DETAIL https://phabricator.wikimedia.org/T306918 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Lucas_Werkmeister_WMDE, Lydia_Pintscher, Nikki, Mahir256, Manuel, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T306918: Prohibit duplication of mul labels in other languages
mrephabricator added a comment. This should not be done. ک in Urdu is ڪ in Sindhi, but Sindhi still has ک but uses it for a different sound. It is exceptional in this regard, so it would not be surprising for the "mul" label to be read as using ک to represent what it does more commonly. This would mean that a label in Sindhi could be identical to an Urdu one while representing a word that is meant to be pronounced distinctly from the Urdu one. This likely extends to most scripts. "W" and "v" are homophonous sounds to many users of Latin scripts. For example with Latin script, if we look at this item: https://www.wikidata.org/wiki/Q113450202 I have labeled this in English as "Waddi Punjabi Lughat" as this is how many South Asian English speakers and users of Latin script would be inclined to spell it. However, Vaddi Punjabi Lughat is the label I have used for Canadian, American, and British English because to speakers of these English dialects, the sound they would associate with "V" would be a closer match to the correct pronunciation. If I were to duplicate the label across dialects, this would be indicating the useful information that the "W" would be understood as a typical spelling in all of them, meaning that it would be reasonable for an American to pronounce "Waddi" like "water" even if this is not the "original" pronunciation. That makes duplicating the label an indicator of useful information which would not be clear otherwise. I think it is quite likely that people will use homoglyph letters as substitutes to get around this, or even unintentionally. For example, ڻ and ٹ are different letters which are associated with different sounds. However, they look identical in middle and initial positions. So if we have ڻڻڻ and ٹٹٹ, you would have a hard time telling what the first two letters are. There are lots of things we can fudge like this in various scripts and have it go unnoticed. Hawaii in the native language Hawaiian, which uses the Latin script, is spelled Hawaiʻi. If we write this as Hawai'i, using an apostrophe rather than the ʻokina character used for Polynesian languages in Latin script, we have now "duplicated" the string without using the same characters. Many would do this entirely unintentionally not knowing ʻokina is a different character, and then if someone wanted to correct the character in the termbox it is in, it would give an error. TASK DETAIL https://phabricator.wikimedia.org/T306918 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Lucas_Werkmeister_WMDE, Lydia_Pintscher, Nikki, Mahir256, Manuel, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T312097: [EPIC] Enable language code mul on Wikidata
mrephabricator added a comment. I know "mul-arab" has been brought up in previous threads and not included in this then, but I would like to strongly recommend reconsidering and that at least 3 "mul" termbox labels be made to minimize confusion. These would be: - mul for standalone numbers and glyphs - mul for left to right - mul for right to left The exact names or codes do not matter that much I think so much as they are usable for these purposes. The problem with single "mul" is that left to right and right to left scripts when rendered together in a line or using the same code often result in messy rendering in browsers, with unpredictable positions of text or things like letters appearing out of order. By standalone numbers and glyphs, I say this to account for the fact that there are no actual Arabic "Arabic numerals" - the digits used in most writing systems regardless of overall directions are variations of the same Indic numerals which are always read left to right. So 1, ⠁, ١, ੧, and so on may be rendered the same way for items with standalone labels like this in any writing system. For some right-to-left examples, let's say we have left-to-right "Pakistan," we may write this as: پاکستان in: - Brahui* - Persian - Uyghur* - Saraiki* - Pashto - Punjabi - Luri** - Kashimiri - Sorani/Central Kurdish - Balochi** - Azerbaijani - Urdu Here * indicates a language which currently shares both Arabic-based and Latin-based scripts in one box as they have not benefited from separate codes, and ** indicates languages which seem to have boxes only for a specific dialect that likely do not indicate anything dialect specific in the absence of a main label for the language or other dialect codes. The languages that have a different label here are Arabic, Malay, Sindhi, and Mazanderani. There are several missing languages from the termbox labels which could use the same label when/if added. پ is the letter that prevents Arabic from sharing a label here - for strings which only use characters shared among a greater set of languages, the list would be longer. This could be done for basically any place name or person's name in South Asia for example, and the additional advantage of having a right-to-left label like this is that there would be at least something to display that is legible within various languages which Wikidata does not support yet. Until there is a code for Khowar, we would be able to put the name for a town where most people speak Khowar in the mul right-to-left box in the mean time. TASK DETAIL https://phabricator.wikimedia.org/T312097 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Aklapper, Manuel, Astuthiodit_1, BeautifulBold, Suran38, karapayneWMDE, Invadibot, maantietaja, Peteosx1x, NavinRizwi, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Dinoguy1000, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T284882: Creating a new lexeme always asks for spelling variant for languages without an ISO 639-1 code
mrephabricator added a comment. I think it should always ask spelling variant. Many languages use multiple scripts and have multiple codes, and picking one automatically makes it seem like there is only one "preferred" variant. One should have the option to add a "tg" representation of a Persian lexeme first since Tajik is a dialect of Persian with a Cyrillic orthography, yet if a Tajik writer/typer wishes to add a lexeme this way, they would have to change the code manually after creating the lexeme. This may be a contributing factor to why Persian has yet to be successfully merged like other multi-script languages are for lexemes. That you have to select a variant for mono-scripts is mildly annoying at worst, and generally still requires less typing than adding a multi-script lexeme TASK DETAIL https://phabricator.wikimedia.org/T284882 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Manuel, daniel, Denny, Mahir256, Lucas_Werkmeister_WMDE, Lydia_Pintscher, Bugreporter, waldyrious, Nikki, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T290858: Senses and their statements should not be tied to a language, but instead shared between languages
mrephabricator added a comment. This is not a good idea. Each language has its own semantic framework and as such two senses which are even glossed the same way in at least one language do not necessarily mean exactly the same thing. An example: - English makes a semantic distinction between a rock and a stone. A rock in English is a naturally occurring feature, whereas a stone can broadly include man made objects such as milestones, gemstones, headstones and so on. - Most languages do not have the rock/stone distinction, and speakers of those languages would likely find it intuitive to gloss both English words the same way. - German makes a completely different semantic distinction that most English speakers would never consider. The word fels is used for rocks, unless the rock is a glacial erratic specifically, in which case it is a findling. Findlings are not instances of fels in German. - A German speaker unaware of this may gloss the English word simply as "fels," not knowing that the English word for rock is not semantically equivalent to any German word. A rock in English means fels *and* findling. Now it may sound ridiculous out of context to say that Germans don't have a word for rocks, but this is technically correct. Senses exist to describe rocks in every language, but those senses are rarely exact matches to each other. In this example of apples, how would we know if one language treats apples and pears as the same fruit? (If this sounds odd, consider how odd "pineapple" looks to non-English speakers.) Senses are ultimately language-specific and should be treated as such. A sense doesn't need glosses in every language. Any sense with "item for this sense" linking to the same "apple" item can be linked to any of the numerous interlingual labels on the item itself. TASK DETAIL https://phabricator.wikimedia.org/T290858 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, So9q, Ivi104, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T236593: Cannot enter multiple forms for the same language variant
mrephabricator added a comment. This may be verging on pedantry, but I will say that the principle of "one form per combination of grammatical features" does not sound broadly applicable enough to follow for each language. Maybe I am missing something and this is just a convention for certain languages. In any case, here are some examples which illustrate where this would not be a helpful model. In Punjabi, an alternate form with identical grammatical features could represent any combination of the following: - An alternative pronunciation of the same form, represented by mutual "alternative form" property links without mutual "homophone form" links - An alternative spelling of the same form in any or all of the spelling variants/orthographies represented, represented by mutual "alternative form" links and mutual "homophone form" links. - If the the spelling varies only for one representation--which actually is not as common as I initially expected--the other representation(s) are duplicated exactly. This may seem somewhat tedious, but for the time being it is an effective way to store the useful information that where spelling varies in one writing system, only one spelling is accepted in the other. - Dialectal or regional variants of the same form, most often simply indicated with "variety of form" set to "unknown value," as usually no empirical evidence exists to assign the form to a specific named dialect or say anything more specific than "this form will vary depending on who you talk to." - Shortened or contracted variants of the same form, indicated with mutual "alternative form" property links and "short form" as a grammatical feature on the shorter form. - Versions of forms which are only for use in spoken language / dialogue as opposed to versions of forms which are only used in writing. For example, for some forms on a Punjabi verb, the form will get inflected twice for grammatical number and/or person, once on an infixed part of the form, and once on the suffixed ending of the form, but in spoken/colloquial language it is acceptable to use a form which is only inflected once. Notably all of the above will only apply to particular inflections of a given lexeme. If we take this verb for example, https://www.wikidata.org/wiki/Lexeme:L688582 , there are 30 forms with "alternate forms" that share grammatical features with another so far out of the 99 forms documented. If we were to create 30 separate lexemes to represent this 1 word, how would we represent the rest of the context that is important for understanding what these inflections represent, or indicate for example that ਹਸਾਏਂਗੀ and ਹਸਾਵੇਂਗੀ are interchangeable spelling + pronunciation options for second person + feminine + singular + additive + causative + subjunctive + definite, but that only ਹਸਾਵਾਂਗੀ is acceptable as a spelling + pronunciation option for first person + feminine + singular + additive + causative + subjunctive + definite? On other lexemes, the same grammatical feature combination may permit variation. (This is ultimately governed by the final phoneme of the root in a verb which only ever applies to the gender-inflected, written/formal first person subjunctive definite forms.) That would be an unsustainable model. I am relatively conservative about what constitutes a separate lexeme; I tend to base it primarily on a combination of part of speech + mode of derivation rather than pronunciation or spelling variation, especially since the latter factors generally don't have any bearing on how and where a lexeme can be used according to the internal logic of the language. I am inclined to agree that the numbered Q-item language code patch is hard to discern the specific purpose. I think what may be the case here is that each of the concerns brought up in this thread have different solutions. Theoretically, there is no upper limit on the number of variations a form can have, and it could become confusing if languages started to have long vertical strips of representations, some of which are governed by a consistent heuristic, and some of which are arbitrary. What may be productive is the addition of various properties for use on lexeme forms which offer more nuanced ways to model the different languages discussed here. TASK DETAIL https://phabricator.wikimedia.org/T236593 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, LucasWerkmeister, C933103, AGutman-WMF, mxn, So9q, Ijon, daniel, Asaf, Mahir256, Danmichaelo, Fnielsen, Lucas_Werkmeister_WMDE, Denny, Lydia_Pintscher, jeblad, jhsoby, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_
[Wikidata-bugs] [Maniphest] T314512: Retire "Western Punjabi" as a name for Punjabi Shahmukhi Wikipedia, and use the labels "Punjabi Gurmukhi" and "Punjabi Shahmukhi" for language codes pa and pnb
mrephabricator added a comment. See also: - Proposal to merge `pa` and `pnb` wikis: https://phabricator.wikimedia.org/T97884 - I think this is possible, but requires certain technology to be available first. I am working on a project which may help with this at the moment, but it will be a while before it is ready. - Related https://phabricator.wikimedia.org/T278372 TASK DETAIL https://phabricator.wikimedia.org/T314512 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, LennardHofmann, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Dcljr, jeblad, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314512: Retire "Western Punjabi" as a name for Punjabi Shahmukhi Wikipedia, and use the labels "Punjabi Gurmukhi" and "Punjabi Shahmukhi" for language codes pa and pnb
mrephabricator created this task. mrephabricator added projects: Wikidata, Wikidata Lexicographical data, Language codes, Wikimedia-Language-setup. TASK DESCRIPTION Punjabi is written in two scripts, Gurmukhi in India, and Shahmukhi in Pakistan. Wikimedia has always used `pa` for the former and `pnb` for the latter. However, the way this is set up now, `pnb` is labeled in English as "Western Punjabi" while `pa` is labeled as simply "Punjabi." This is a problem for a few reasons: - The majority of Punjabi speakers live in Pakistan and use Shahmukhi if they write in the language. Using the label "Punjabi" unqualified for the orthographic variant that fewer native language speakers can read seems like an unreasonable bias. - The original request and creation of Shahmukhi Wikipedia did not involve calling it "Western Punjabi"; see for example https://phabricator.wikimedia.org/T7010 - There are dialect variations in the Punjabi language which can roughly be split across a west/east axis, but this is *not* coincident with the present day border of India and Pakistan. The Gurmukhi script now used in India was refined and standardized in part of Western Punjab which would now be in Pakistan. As a result, the Gurmukhi script includes provisions for characters which represent pronunciations which are only still commonly used in the far west/south of Punjab (now in Pakistan). ਵੰਞਣਾ represents a word associated with western dialects of Punjabi, and with Saraiki, which cannot be written phonetically in Shahmukhi. Under the implication that these language labels have to do with regions rather than writing systems, this word would be interpreted as being part of "eastern Punjabi," which is not correct. - When adding monolingual text properties to Wikidata items, I keep having to add a "writing system" qualifier to each `pa` and `pnb` statement even though these language codes already de facto represent writing systems, because their labels do not match their use. Using "Punjabi Gurmukhi" for `pa` and "Punjabi Shahmukhi" for `pnb` would offer more clarity on what these codes actually represent, and would be more fair with respect to the fact that Punjabi written in Gurmukhi is not "default" Punjabi. TASK DETAIL https://phabricator.wikimedia.org/T314512 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, LennardHofmann, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Dcljr, jeblad, Mbch331, Jay8g ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314459: Add item termbox label, lexeme label, and monolingual text code support for bha
mrephabricator renamed this task from "Add monolingual text code bha" to "Add item termbox label, lexeme label, and monolingual text code support for bha". mrephabricator added projects: Wikidata Lexicographical data, Wikidata. TASK DETAIL https://phabricator.wikimedia.org/T314459 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T312845: [Process] Add new language codes to Wikidata
mrephabricator added a comment. Also pending: - https://phabricator.wikimedia.org/T311666 - https://phabricator.wikimedia.org/T313782 - https://phabricator.wikimedia.org/T314458 - https://phabricator.wikimedia.org/T314459 TASK DETAIL https://phabricator.wikimedia.org/T312845 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Aklapper, Manuel, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T311666: Add item termbox label, lexeme label, and monolingual text code support for ess
mrephabricator renamed this task from "Add monolingual language code ess" to "Add item termbox label, lexeme label, and monolingual text code support for ess". mrephabricator added a project: Wikidata Lexicographical data. TASK DETAIL https://phabricator.wikimedia.org/T311666 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T313782: Add item termbox label, lexeme label, and monolingual text code support for bal
mrephabricator renamed this task from "Add monolingual language code bal" to "Add item termbox label, lexeme label, and monolingual text code support for bal". mrephabricator added a project: Wikidata Lexicographical data. TASK DETAIL https://phabricator.wikimedia.org/T313782 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T314458: Add item termbox label, lexeme label, and monolingual text code support for trw
mrephabricator renamed this task from "Add monolingual language code trw" to "Add item termbox label, lexeme label, and monolingual text code support for trw". mrephabricator added projects: Wikidata, Wikidata Lexicographical data. TASK DETAIL https://phabricator.wikimedia.org/T314458 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T313476: allow overwriting wikit's default font
mrephabricator added a comment. This may require its own ticket, but may I request that the open source Noto Nastaleeq Urdu font be hosted as a web font and then used for all Urdu and Punjabi Shahmukhi text by default? There are characters this font includes that most fonts don't render, and while I have a custom stylesheet for this on desktop, I can't see certain letters on my phone because of this. TASK DETAIL https://phabricator.wikimedia.org/T313476 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: noarave, mrephabricator Cc: mrephabricator, Masumrezarock100, Aklapper, Sarai-WMDE, Mohammed_Sadat_WMDE, Michael, Lydia_Pintscher, Astuthiodit_1, STH, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Volker_E, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T313782: Add monolingual language code bal
mrephabricator created this task. mrephabricator added projects: Language codes, Wikidata. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION Please add the language code bal to the list of language codes supported for monolingual text values. This is one of the major languages of Pakistan, particularly in the Balochistan province, as well as a major language in Iran and Afghanistan. It has about 6 million speakers in Pakistan alone, some few million more than that if considering global speakers. There is a lot of referenceable content in Balochi which can be added to Wikidata and other Wikimedia projects. (For example, see this Balochi glossary/dictionary https://dsal.uchicago.edu/dictionaries/mumtaz/) There are codes in the system for some specific dialects of Balochi, but these are of limited utility as most resources do not specify dialect and for aspects of Wikidata like lexemes, the dialect codes are likely to be used just to create duplicate entries of words which would be better included under the main language umbrella if they aren't dialect-specific. Usage example: On the language item: https://www.wikidata.org/wiki/Q33049 On the Nushki, Balochistan item: https://www.wikidata.org/wiki/Q2977118 On the item for the Wikimedia multilingual page for Balochi (linked to Balochi Wikisource): https://www.wikidata.org/wiki/Q105659170 Please allow this to be used in the Wikidata termbox and lexemes as well; as I understand it this is possible if added correctly. TASK DETAIL https://phabricator.wikimedia.org/T313782 WORKBOARD https://phabricator.wikimedia.org/project/board/4981/ EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: mrephabricator, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T311666: Add monolingual language code ess
mrephabricator created this task. mrephabricator added projects: Wikidata, Language codes. Restricted Application added a subscriber: Aklapper. TASK DESCRIPTION Please add the language code ess to the list of language codes supported for monolingual text values. This is known on English Wikipedia as Central Siberian Yupik, not to be confused with Central Yupik, although the state of Alaska government calls it St. Lawrence Island Yupik. It is an official language of Alaska and as such there is quite a bit of documentation in and about the language that would be good to represent on Wikimedia projects. There are also some speakers in Russia on the other side of the Bering Strait. Usage example: On the language item: https://www.wikidata.org/wiki/Q27993 On the Alaska Poppy item: https://www.wikidata.org/wiki/Q21384059 On the lingoberry item: https://www.wikidata.org/wiki/Q93235 Please allow this to be used in the Wikidata termbox and lexemes as well; as I understand it this is possible if added correctly. TASK DETAIL https://phabricator.wikimedia.org/T311666 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Aklapper, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Nikki, Wikidata-bugs, aude, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T308062: Carolinian language (ISO code: cal) and Tanapag (ISO code: tpv) label support on Wikidata
mrephabricator added a comment. Is there existing documentation on this process, or anything I can help with here? There is quite a backlog of languages that would be useful to have termbox support at this point TASK DETAIL https://phabricator.wikimedia.org/T308062 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Nikki, Mbch331, Manuel, Amire80, mrephabricator, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Bodhisattwa, Scott_WUaS, Wikidata-bugs, aude, Lydia_Pintscher ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T308062: Carolinian language (ISO code: cal) and Tanapag (ISO code: tpv) label support on Wikidata
mrephabricator added a comment. Thank you for placing this in the correct tags/categories and confirming that this is possible. How long does it typically take for a termbox to be added? I am thinking of going through the official languages of other countries and territories to see if there are other notable omissions. Those would be "low hanging fruit" for multilingual wikidata contributions since any language with official status is going to have written supporting documentation in that language as well as translations, dictionaries, etc. TASK DETAIL https://phabricator.wikimedia.org/T308062 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: Mbch331, Manuel, Amire80, mrephabricator, Wangombe, Astuthiodit_1, karapayneWMDE, Invadibot, PallaviPatke, maantietaja, Rileych, ItamarWMDE, Akuckartz, 50019062, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Soum213, Bodhisattwa, Taiwania_Justo, Scott_WUaS, Nikki, Wikidata-bugs, aude, Lydia_Pintscher, Nikerabbit, Arrbee, santhosh, KartikMistry ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org
[Wikidata-bugs] [Maniphest] T308062: Carolinian language (ISO code: cal) and Tanapag (ISO code: tpv) label support on Wikidata
mrephabricator created this task. mrephabricator added projects: Language-Team (Language-2022-April-June), Wikidata, Wikidata Lexicographical data. TASK DESCRIPTION **Feature summary** (what you would like to be able to do and where): Add Carolinian and Tanapag language labels to items on Wikidata. At the moment it is only possible to specify Carolinian as the language for a string value corresponding to a property. (This feature was added in 2020 based on a ticket here based on my searching.) **Use case(s)** (list the steps that you performed to discover that problem, and describe the actual underlying problem which you want to solve. Do not describe only a solution): Carolinian is an official language of the Northern Mariana Islands and I have been adding Carolinian translations and etymologies to place names and related terms. There is a good open source/public domain Carolinian-English dictionary that has been informative for this. Tanapag is a related language which is endangered but still spoken on the language, and Carolinian has a lot of Tanapag loan words where it would be useful to specify where the word comes from. When I tried adding these translations to labels of Wikidata items, I found no way to do this. **Benefits** (why should this be implemented?): The main benefit is more complete coverage of the Northern Mariana Islands and related topics on Wikidata, which would in turn be helpful for tying to any complimentary data relating to the territory available through other channels. At the moment, any coverage of this topic is inhibited by the fact that it isn't congruent with the way the languages of other regions of the world are typically specified or at least supposed to be. (I am aware language support limitations have long been an issue in the Mediawiki ecosystem, but I am not knowledgeable on the technical details of it, or how feasible it is to add specific languages until a more comprehensive system for specifying language is introduced.) Best TASK DETAIL https://phabricator.wikimedia.org/T308062 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: mrephabricator Cc: #wikidata_lexicographical_data, #wikidata, PHID-PROJ-52qctwr6lw2vd42lg45f, mrephabricator, Wangombe, Astuthiodit_1, karapayneWMDE, Invadibot, PallaviPatke, maantietaja, Rileych, ItamarWMDE, Akuckartz, 50019062, Nandana, Lahi, Gq86, GoranSMilovanovic, Mahir256, QZanden, LawExplorer, _jensen, rosalieper, Soum213, Bodhisattwa, Taiwania_Justo, Scott_WUaS, Wikidata-bugs, aude, Nikerabbit, Arrbee, santhosh, KartikMistry, Mbch331 ___ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org