OK, now I get that too. No idea what I did different before but... 
nevermind.

This has got to be a bug. The same item has been entered who knows how 
many times, and at least some of the IDs are consecutive.

http://openlibrary.org/authors/OL4602522A
http://openlibrary.org/authors/OL4602523A
http://openlibrary.org/authors/OL4602524A


Ditto the European law one.

http://openlibrary.org/authors/OL4791619A
http://openlibrary.org/authors/OL4791620A
etc.

I think there was another case like this before.

kc

On 5/9/12 2:57 PM, Ben Companjen wrote:
> I am, yes. I loaded the ~6.9 million author records from April's dump
> into MySQL, did a "GROUP BY slug" (where slug is the author name in
> lower case, without spaces and punctuation) and got
> "shirleyinstitute/wirajointconference1977manchestereng": 10047.
>
> I then searched for Shirley institute 1977 as an author on the website
> and got 10,047 hits. And I still do:
> http://openlibrary.org/search/authors?q=shirley+institute+1977
>
> Second in the list of slugs is colloquyoneuropeanlaw1981messinaitaly: 2368
> http://openlibrary.org/search/authors?q=colloquy+1981+messina
>
> Ben
>
> On 9 May 2012 23:44, Karen Coyle<[email protected]>  wrote:
>> This is rather odd. When I look up Shirley institute as an author and
>> find the 1977 joint conference I get 2 work titles, each that has only 1
>> edition. Ben, are you working with the dump?
>>
>> kc
>>
>> On 5/9/12 6:05 AM, Ben Companjen wrote:
>>> Hi,
>>>
>>> Although I found 341 duplicates of President Clinton a lot yesterday,
>>> there is still the "author" that goes by the name "Shirley
>>> Institute/Wira Joint Conference (1977 Manchester, Eng.)". There are a
>>> whopping 10,047 authors with that name! Merging those manually is only
>>> for those who desperately need an extremely boring task :)
>>>
>>> Looking at the subject and book titles in the search results, I think
>>> one MARC record was imported many times without duplicate detection,
>>> so merging the authors would still leave some 10000 duplicate
>>> works/editions.
>>>
>>> Any idea how to best solve this?
>>>
>>> Ben
>>> _______________________________________________
>>> Ol-discuss mailing list
>>> [email protected]
>>> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
>>> To unsubscribe from this mailing list, send email to 
>>> [email protected]
>>>
>>
>> --
>> Karen Coyle
>> [email protected] http://kcoyle.net
>> ph: 1-510-540-7596
>> m: 1-510-435-8234
>> skype: kcoylenet
>> _______________________________________________
>> Ol-discuss mailing list
>> [email protected]
>> http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
>> To unsubscribe from this mailing list, send email to 
>> [email protected]
>

-- 
Karen Coyle
[email protected] http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet
_______________________________________________
Ol-discuss mailing list
[email protected]
http://mail.archive.org/cgi-bin/mailman/listinfo/ol-discuss
To unsubscribe from this mailing list, send email to 
[email protected]

Reply via email to