dcausse created this task.
dcausse triaged this task as "Normal" priority.
dcausse added projects: Wikidata, Discovery-Search (Current work), Discovery, User-Smalyshev.

TASK DESCRIPTION

Wikidata uses array fields, it's likely that popular items gets more aliases, the all field is affected as well.
We know that array fields may cause troubles with length normalization causing popular items to have low score.
The plan would be to tune the BM25 b param for:

  • all
  • labels.*

and proceed as follow:
1/ reindex relforge to setup different similarities for these field
2/ tune the b param (once the similarity is set we can close the index to tune it and iterate like that)
3/ submit a patch to wmf-config to add a wikidata profile in wgCirrusSearchSimilarityProfiles


TASK DETAIL
https://phabricator.wikimedia.org/T182293

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: dcausse
Cc: Smalyshev, Aklapper, EBernhardson, daniel, gerritbot, debt, dcausse, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, Avner, Gehel, Jdrewniak, FloNight, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
Wikidata-bugs@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to