Hi all,

I have one more list of duplicates, this time it's work records.
(Links to lists of duplicate authors were in a mail to OL-discuss
[1].)

http://companjen.name/ol/dupe_works.html

The author with the most duplicate work records (counting only the
works with title slug, subtitle slug and first author the same in at
least 5 records), is Plutarch.

http://openlibrary.org/authors/OL58120A/Plutarch : 7,881 works, of
which at least 7100 are duplicate, i.e. are very similar in title and
subtitle to another work.

Before any attempt to merge these records, more information about the
works is needed. Tropical Snow is also in this list, of which it is
known that the duplicate work records are a bad import. I don't know
what needs to happen with those records, but merging is probably not a
good idea.
Oh, I also don't know whether some duplicate works are actually
multiple volumes of the same work/edition. The works of "United
States. Immigration and Naturalization Service" appear to be volumes
rather than true duplicates (although the discussion on multivolume
works wasn't conclusive about what to do with them, I believe). This
could be checked when counting duplicate Editions.

Regards,

Ben

[1] http://www.mail-archive.com/[email protected]/msg00668.html

P.S. http://companjen.name/ol/dupe_works.csv has the same data, but
without the notes.
_______________________________________________
Ol-tech mailing list
[email protected]
http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech
To unsubscribe from this mailing list, send email to 
[email protected]

Reply via email to