tfmorris added a comment.
It would seem like the 2018-03-13 spreadsheet should be adequate to call this task complete. I would recommend including some qualitative understanding of the source of the Freebase data in addition to just pure curation ratio when making judgements about how to use which data. Things like MusicBrainz IDs and ISFDB IDs went through a heavily QA'd reconciliation process and are going to be high quality. Films, and to a lesser extent TV shows, were an area of focus for the Freebase team, so will generally be both high quality and relatively complete. Also many of the quality issues with the initial data set didn't have anything to do with the Freebase data itself, but the junky "evidence" URLs that Google produced after the fact to satisfy the Wikidata call for evidence. These tend to be of much, much lower quality than the data itself. Of course, after so many years, much of the value of the data has been squandered, but I bet there are still some areas where it could be used to significantly improve Wikidata. TASK DETAIL https://phabricator.wikimedia.org/T188715 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: Tpt, tfmorris Cc: tfmorris, Aklapper, Hjfocs, Jingbiao95, darthmon_wmde, Nandana, Lahi, Gq86, GoranSMilovanovic, Kiailandi, QZanden, dachary, LawExplorer, _jensen, rosalieper, Scott_WUaS, Wikidata-bugs, aude, Ricordisamoa, Tacsipacsi, Sjoerddebruin, Tpt, Mbch331
_______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
