Lucas_Werkmeister_WMDE added a comment.
In T327514#8632710 <https://phabricator.wikimedia.org/T327514#8632710>, @Michael wrote: > Could we maybe go the opposite way? Having an allow-list of characters in `Cf` that we explicitly decode? Then we could maybe start with ZWJ/ZWNJ and add further chars as needed. I imagine that this would feel safer and more understandable to me when reading the code. Maybe, though I felt like some of the other characters in that Cf list <https://www.fileformat.info/info/unicode/category/Cf/list.htm> also looked like they might theoretically be useful (some of them are even printable). But we could also start with ZW(N)J, sure. TASK DETAIL https://phabricator.wikimedia.org/T327514 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Lucas_Werkmeister_WMDE Cc: Michael, ItamarWMDE, Aklapper, Arian_Bozorg, Nikki, Sarai-WMDE, Astuthiodit_1, AWesterinen, karapayneWMDE, Invadibot, MPhamWMF, maantietaja, CBogen, Akuckartz, Nandana, Namenlos314, Lahi, Gq86, Lucas_Werkmeister_WMDE, GoranSMilovanovic, Mahir256, QZanden, EBjune, merbst, LawExplorer, Salgo60, _jensen, rosalieper, Scott_WUaS, Jonas, Xmlizer, jkroll, Wikidata-bugs, Jdouglas, aude, Tobias1984, Manybubbles, Lydia_Pintscher, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- [email protected] To unsubscribe send an email to [email protected]
