ChristianKl added a comment.

Scientific articles are one class of entities where this problem exists and given that they have more words in the title and we have >10 million of them they are the most important. In the past with the old search I however remember similar issues with songs and geonames derived geographic items as well.
If we would import those 2 million German companies, they would likely also produce a lot of hits.

I'm okay, with fixing it by deranking scientific articles specifically but I would expect that even if we derank any class of entries that are problematic at the moment sooner or later we will add a new big dataset by bot that brings up search results that would be better to not outrank human created items.


TASK DETAIL
https://phabricator.wikimedia.org/T183243

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: ChristianKl
Cc: Lydia_Pintscher, Sjoerddebruin, Smalyshev, Aklapper, ChristianKl, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, LawExplorer, Avner, Gehel, FloNight, Wikidata-bugs, aude, Mbch331
_______________________________________________
Wikidata-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs

Reply via email to