OK, now I get that too. No idea what I did different before but... nevermind.
This has got to be a bug. The same item has been entered who knows how many times, and at least some of the IDs are consecutive. http://openlibrary.org/authors/OL4602522A http://openlibrary.org/authors/OL4602523A http://openlibrary.org/authors/OL4602524A Ditto the European law one. http://openlibrary.org/authors/OL4791619A http://openlibrary.org/authors/OL4791620A etc. I think there was another case like this before. kc On 5/9/12 2:57 PM, Ben Companjen wrote: > I am, yes. I loaded the ~6.9 million author records from April's dump > into MySQL, did a "GROUP BY slug" (where slug is the author name in > lower case, without spaces and punctuation) and got > "shirleyinstitute/wirajointconference1977manchestereng": 10047. > > I then searched for Shirley institute 1977 as an author on the website > and got 10,047 hits. And I still do: > http://openlibrary.org/search/authors?q=shirley+institute+1977 > > Second in the list of slugs is colloquyoneuropeanlaw1981messinaitaly: 2368 > http://openlibrary.org/search/authors?q=colloquy+1981+messina > > Ben > > On 9 May 2012 23:44, Karen Coyle<[email protected]> wrote: >> This is rather odd. When I look up Shirley institute as an author and >> find the 1977 joint conference I get 2 work titles, each that has only 1 >> edition. Ben, are you working with the dump? >> >> kc >> >> On 5/9/12 6:05 AM, Ben Companjen wrote: >>> Hi, >>> >>> Although I found 341 duplicates of President Clinton a lot yesterday, >>> there is still the "author" that goes by the name "Shirley >>> Institute/Wira Joint Conference (1977 Manchester, Eng.)". There are a >>> whopping 10,047 authors with that name! Merging those manually is only >>> for those who desperately need an extremely boring task :) >>> >>> Looking at the subject and book titles in the search results, I think >>> one MARC record was imported many times without duplicate detection, >>> so merging the authors would still leave some 10000 duplicate >>> works/editions. >>> >>> Any idea how to best solve this? >>> >>> Ben >>> _______________________________________________ >>> Ol-discuss mailing list >>> [email protected] >>> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss >>> To unsubscribe from this mailing list, send email to >>> [email protected] >>> >> >> -- >> Karen Coyle >> [email protected] http://kcoyle.net >> ph: 1-510-540-7596 >> m: 1-510-435-8234 >> skype: kcoylenet >> _______________________________________________ >> Ol-discuss mailing list >> [email protected] >> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss >> To unsubscribe from this mailing list, send email to >> [email protected] > -- Karen Coyle [email protected] http://kcoyle.net ph: 1-510-540-7596 m: 1-510-435-8234 skype: kcoylenet _______________________________________________ Ol-discuss mailing list [email protected] http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss To unsubscribe from this mailing list, send email to [email protected]
