On Sun, Jan 18, 2015 at 9:11 AM, Maarten Dammers <[email protected]> wrote:
> Hi Tom, > > Tom Morris schreef op 17-1-2015 om 17:17: > > 2015-01-17 4:27 GMT-05:00 Lydia Pintscher <[email protected]>: > >> >> The log is at >> https://meta.wikimedia.org/wiki/IRC_office_hours/Office_hours_2015-01-16 >> for anyone who couldn't make it. > > > Denny discusses importing all missing VIAF keys from Freebase using > "multichill" (unclear what that is from the context) on the assumption that > the error rate is low. It would be worth checking assumptions like that > with folks who are familiar with the Freebase data before acting on them. > > I guess you are refereing to "18:57:55 <vrandecic> If you ask me, I am > happy with just letting multichill to upload the VIAFs that are still > missing" > > That would be me. VIAF is a very good starting point for getting more > authority data. If you have viaf, you can add other authority control data > based on that. So getting more links to viaf would be nice. Not sure how > many are still missing. I recently did that for ULAN and NTA ( > https://www.wikidata.org/w/index.php?title=Q120609&diff=182583270&oldid=180304990 > / > https://www.wikidata.org/w/index.php?title=Q1610938&diff=182636686&oldid=182321267 > ). Was able to add over 100.000 new links. I still have to do this for > other types of authority control. The more tightly connected things get, > the easier it gets to find problems or duplicates. > > Maarten > Hello All, It was me that originally imported the about 400,000 VIAF links into Wikidata. The way that they were matched was using by using a name-and-date-of-birth-matching algorithm with an English Wikipedia dump, and it was done by a team that works for VIAF.org . Then then those matches were imported into English Wikipedia. Additionally other languages like Italian Wikipedia and Commons had done some manual matching. After about a year of manual correcting in Wikipedias, I took the authority control from (I think about 9) different Wikis, and imported it into Wikidata. I also later did some of what Maarten/multichill is doing now which is to do some lookups on VIAF and import subsequent data over, like sex/gender and alternative names. I'm going to ping the people at VIAF (which is part of OCLC, for whom I used to work, but no longer) about this to see if they have any thoughts to add as well. Make a great day, Max Klein ‽ http://notconfusing.com/ > > _______________________________________________ > Wikidata-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wikidata-l > >
_______________________________________________ Wikidata-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-l
