Spinster added a comment.
In T244847#8502964 <https://phabricator.wikimedia.org/T244847#8502964>, @Pintoch wrote: > Here is the current status of this issue: > > The team at TIB (headed by @Loz.ross) is now maintaining the current wrapper and has contributed improvements to deploy it on other Wikibase instances: > https://gitlab.com/nfdi4culture/ta1-data-enrichment/openrefine-wikibase > Thank you to them! So the Wikidata reconciliation service currently falls under the responsibility of a German research group... I know TIB does amazing work for the Wikibase ecosystem (thank you indeed @Loz.ross and colleagues) but for Wikidata, this surely is a weird situation. If I'd have the money and the people, I'd jump to help, but I don't. > Prompted by the bad performance of this endpoint, Ontotext offers alternate endpoints for certain entity types, powered by their own search index over Wikidata. This video explains their approach <https://www.ontotext.com/knowledgehub/videos/kgf21-talks-reconciliation-server-demonstration-against-wikidata/>. The endpoints are: > > - https://reconcile.ontotext.com/people > - https://reconcile.ontotext.com/organizations > - https://reconcile.ontotext.com/locations > > Thank you Ontotext (@VladimirAlexiev among others). I, too, have been very enthusiastic seeing this initiative. However, I've tried the Ontotext reconciliation service several times, and I've had quite a few issues with it: - it produces *a lot* of wrong matches for me. Up to 30 to 40% of the "100% confident" matches are just plain wrong (see screenshot below - I just tried it again) - the service provides only 3 suggested matches when it's uncertain, and these tend to be wrong as well, even if a correct value is present (see second screenshot) F36869693: image.png <https://phabricator.wikimedia.org/F36869693> F36869695: image.png <https://phabricator.wikimedia.org/F36869695> I would currently not use the Ontotext alternatives for any real world application for that reason, and I have discouraged trainees from the GLAM sector from using them. (The 'old' Wikidata reconciliation service works much better and is still extremely workable when it's up, in my experience.) @VladimirAlexiev can such issues be reported somewhere, and if so where? Is Ontotext willing and able to improve upon the above services? TASK DETAIL https://phabricator.wikimedia.org/T244847 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Spinster Cc: Sj, VladimirAlexiev, Tarrow, Michael, GFontenelle_WMF, jdtoy, SandraF_WMF, JeanFred, Fuzheado, Eugene233, PabloCastellano, Loz.ross, RShigapov, Spinster, Samantha_Alipio_WMDE, WMDE-leszek, Addshore, Lydia_Pintscher, Abbe98, tfmorris, David_Haskiya_WMSE, Lokal_Profil, Mvolz, Alicia_Fagerving_WMSE, Regisrob, Pintoch, Aklapper, Astuthiodit_1, karapayneWMDE, Invadibot, maantietaja, ItamarWMDE, Akuckartz, Thadguidry, Tore_Danielsson_WMSE, Nandana, Lahi, Gq86, GoranSMilovanovic, Nattes, QZanden, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Nikerabbit, Mbch331
_______________________________________________ Wikidata-bugs mailing list -- wikidata-bugs@lists.wikimedia.org To unsubscribe send an email to wikidata-bugs-le...@lists.wikimedia.org