| ChristianKl added a comment. |
Scientific articles are one class of entities where this problem exists and given that they have more words in the title and we have >10 million of them they are the most important. In the past with the old search I however remember similar issues with songs and geonames derived geographic items as well.
If we would import those 2 million German companies, they would likely also produce a lot of hits.
I'm okay, with fixing it by deranking scientific articles specifically but I would expect that even if we derank any class of entries that are problematic at the moment sooner or later we will add a new big dataset by bot that brings up search results that would be better to not outrank human created items.
Cc: Lydia_Pintscher, Sjoerddebruin, Smalyshev, Aklapper, ChristianKl, Lahi, Gq86, GoranSMilovanovic, QZanden, EBjune, LawExplorer, Avner, Gehel, FloNight, Wikidata-bugs, aude, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
