dcausse created this task. dcausse triaged this task as "Normal" priority. dcausse added projects: Wikidata, Discovery-Search (Current work), Discovery, User-Smalyshev. |
TASK DESCRIPTION
Wikidata uses array fields, it's likely that popular items gets more aliases, the all field is affected as well.
We know that array fields may cause troubles with length normalization causing popular items to have low score.
The plan would be to tune the BM25 b param for:
- all
- labels.*
and proceed as follow:
1/ reindex relforge to setup different similarities for these field
2/ tune the b param (once the similarity is set we can close the index to tune it and iterate like that)
3/ submit a patch to wmf-config to add a wikidata profile in wgCirrusSearchSimilarityProfiles
TASK DETAIL
EMAIL PREFERENCES
To: dcausse
Cc: Smalyshev, Aklapper, EBernhardson, daniel, gerritbot, debt, dcausse, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, Avner, Gehel, Jdrewniak, FloNight, Wikidata-bugs, aude, Mbch331
Cc: Smalyshev, Aklapper, EBernhardson, daniel, gerritbot, debt, dcausse, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, Avner, Gehel, Jdrewniak, FloNight, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list Wikidata-bugs@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs