About 1/3 of the 1M ebooks on OpenLibrary.org have full MARC records, and you can retrieve the record via the API. There is also a "secret" record format that returns not the full MARC for the hard copy (which is what the records represent because these are digitized books) but a record that has been modified to represent the ebook.

The MARC records for the hard copy follow the pattern:

https://archive.org/download/[archive identifier]/[archive identifier]_marc.[xml|mrc]

Download MARC XML https://archive.org/download/myantonia00cathrich/myantonia00cathrich_marc.xml Download MARC binary https://www.archive.org/download/myantonia00cathrich/myantonia00cathrich_meta.mrc <https://archive.org/download/myantonia00cathrich/myantonia00cathrich_meta.mrc>


To get the one that represents the ebook, do:

https://archive.org/download/[archive identifier]/[archive identifier]_archive_marc.xml

https://archive.org/download/myantonia00cathrich/myantonia00cathrich_archive_marc.xml

This one has an 007, the 245 $h, and a few other things.

Tom Morris did some code that helps you search for books by author and title and retrieve a MARC record. I don't recall where his github archive is, but I'll find out and post it here. The code is open source. We used it for a project that added ebook records to a public library catalog.

You can also use the OPenLibrary API to select all open access ebooks. What I'd like to see is a way to create a list or bibliography in OL that then is imported into a program that will find MARC records for those books. The list function is still under development, though.

kc

On 8/18/14, 3:04 PM, Stuart Yeates wrote:
There are a stack of great free ebook repositories available on the web, things like https://unglue.it/ http://www.gutenberg.org/ https://en.wikibooks.org/wiki/Main_Page http://www.gutenberg.net.au/ https://www.smashwords.com/books/category/1/newest/0/free/any etc, etc

What there doesn't appear to be, is high-quality AACR2 / RDA records available for these. There are things like https://ebooks.adelaide.edu.au/meta/pg/ which are elaborate dublin core to MARC converters, but these lack standardisation of names, authority control (people, entities, places, etc), interlinking, etc.

It seems to me that quality metadata would greatly increase the value / findability / use of these projects and thus their visibility and available sources.

Are there any projects working in this space already? Are there suitable tools available?

cheers
stuart

--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
m: +1-510-435-8234
skype: kcoylenet/+1-510-984-3600

Reply via email to