daniel created this task.
daniel added a subscriber: daniel.
daniel added a project: Wikidata.
Herald added subscribers: StudiesWorld, Aklapper.

TASK DESCRIPTION
  When entering "Alasehir", items with the label "Alaşehir" should be found, 
without an aliases being explicitly defined. Similarly, "Munchen" should find 
"München".  This should work at least for roman-based diacritics. Not sure if 
such "simplification" is applicable for other scripts.
  
  Implementation notes:
  * a rough-and-dirty implementation can be based on converting to DNF, and 
then stripping all non-ascii characters. This will turn Ä into A, and so fort. 
It will however break non-roman scripts completely, so we need a whitelist of 
language codes (or character ranges).
  * Perhaps Elastic/Lucene already provides this, so we'd get it for free with 
Cirrus integration.

TASK DETAIL
  https://phabricator.wikimedia.org/T121863

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: daniel
Cc: Aklapper, StudiesWorld, daniel, Wikidata-bugs, aude, Mbch331



_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to