Hi all, I have one more list of duplicates, this time it's work records. (Links to lists of duplicate authors were in a mail to OL-discuss [1].)
http://companjen.name/ol/dupe_works.html The author with the most duplicate work records (counting only the works with title slug, subtitle slug and first author the same in at least 5 records), is Plutarch. http://openlibrary.org/authors/OL58120A/Plutarch : 7,881 works, of which at least 7100 are duplicate, i.e. are very similar in title and subtitle to another work. Before any attempt to merge these records, more information about the works is needed. Tropical Snow is also in this list, of which it is known that the duplicate work records are a bad import. I don't know what needs to happen with those records, but merging is probably not a good idea. Oh, I also don't know whether some duplicate works are actually multiple volumes of the same work/edition. The works of "United States. Immigration and Naturalization Service" appear to be volumes rather than true duplicates (although the discussion on multivolume works wasn't conclusive about what to do with them, I believe). This could be checked when counting duplicate Editions. Regards, Ben [1] http://www.mail-archive.com/[email protected]/msg00668.html P.S. http://companjen.name/ol/dupe_works.csv has the same data, but without the notes. _______________________________________________ Ol-tech mailing list [email protected] http://mail.archive.org/cgi-bin/mailman/listinfo/ol-tech To unsubscribe from this mailing list, send email to [email protected]
