Re: [CODE4LIB] Millions of Harvard Library Catalog Records Publicly Available

2012-04-25 Thread Matt Amory
I think this DPLA notice covers this release of metadata:

John Palfrey jpalf...@law.harvard.edu
Apr 24 (1 day ago)


to dpla-discussion
Dear colleagues interested in the DPLA:

 Below, please find a news release that we have just issued from Harvard
about a major open access metadata release that will benefit our DPLA
initiative.  The NYT has coverage in its Bits Blog:


http://bits.blogs.nytimes.com/2012/04/24/harvard-releases-big-data-for-books/

 We at Harvard very much hope that other institutions will join us in open
access metadata releases made available through the DPLA's emerging new
platform.

   Much to talk about later this week in San Francisco!  Again, thanks for
everything you're doing.

 Best,
John

 --

 *Millions of Harvard Library Catalog Records Publicly Available**

Harvard releases nearly 100% of its records

*

April 24, 2012 – The Harvard Library announced it would make more than 12
million catalog records from Harvard’s 73 libraries publicly
availablehttp://openmetadata.lib.harvard.edu/.




The records contain bibliographic information about books, videos, audio
recordings, images, manuscripts, maps, and more. The Harvard Library is
making these records available in accordance with its Open Metadata
Policyhttp://openmetadata.lib.harvard.edu/and under a Creative
Commons 0 public domain license. In addition, the
Harvard Library announced its open distribution of metadata from its Digital
Access to Scholarship at Harvard http://dash.harvard.edu/ (DASH)
scholarly article repository under a similar CC0 license.



The Harvard Library is committed to collaboration andopen access. We hope
this contribution is one of many steps toward sharing the vital cultural
knowledge held by libraries with all, said Mary Lee Kennedy, Senior
Associate Provost for the Harvard Library.



The catalog records are available for bulk download from Harvard, and are
available for programmatic access by software applications via API's
at the Digital
Public Library of America http://dp.la/ (DPLA). The records are in the
standard MARC21 format.



By instituting a policy of open metadata, the HarvardLibrary has expressed
its appreciation for the great potential that librarymetadata has for
innovative uses. The two metadata releases today are primeexamples, said
Stuart Shieber, Library Board Member, Director of the Office for Scholarly
Communication and Professor of Computer Science at Harvard.



John Palfrey, chair of the DPLA, said, With this major contribution,
developers will be able to start experimenting with building innovative
applications that put to use the vital national resource that consists of
our local public and research libraries, museums, archives and cultural
collections. He added that he hoped that this would encourage other
institutions to make their own collection metadata publicly
availablehttp://dp.la/dev/wiki/Metadata_upload
.



The records consist of information describing works—including creator,
title, publisher, date, language, and subject headings—as well as other
descriptors usually invisible to end users, such as the equalization system
used in a recording. Harvard’s Kennedy noted, The accessibility of the
entire set of data for each item will, we hope, spur imaginative uses that
will find new value in what libraries know.”

Media Contact:
Kira Poplowski kira_poplow...@harvard.edu
Director of Communications
The Harvard Library
617.496.3758

--
You received this message because you are subscribed to the Digital Public
Library of America planning initiative listserv.

To post to this group, email dpla-discuss...@eon.law.harvard.edu
To unsubscribe from this group, visit
https://cyber.law.harvard.edu/lists/signoff/dpla-discussion
To edit your subscription options, visit
https://cyber.law.harvard.edu/lists/suboptions/dpla-discussion
A searchable archive of all messages is available at
https://cyber.law.harvard.edu/lists/arc/dpla-discussion

We encourage you to post your ideas and resources to the DPLA public wiki:
https://cyber.law.harvard.edu/dpla/Main_Page

On Wed, Apr 25, 2012 at 9:57 AM, Joseph Montibello 
joseph.montibe...@dartmouth.edu wrote:

 Thanks for noting this, Will, it was news to me.

 This is nothing like an official statement, but here's an interesting
 perspective from someone at OCLC:

 http://bit.ly/hvrdrecOCLC

 Joe Montibello, MLIS
 Library Systems Manager
 Dartmouth College Library
 603.646.9394
 joseph.montibe...@dartmouth.edu






 On 4/24/12 4:43 PM, Will Kurt wck...@gmail.com wrote:

 Apologies if this is old news, but I was very excited to see Harvard
 making all this data public:
 http://isites.harvard.edu/icb/icb.do?keyword=k77982pageid=icb.page498373
 
 Tons of cool data analysis / machine learning work to be done here!
 Warm up your SVMs ;)
 
 --Will
 




-- 
Matt Amory
(917) 771-4157
matt.am...@gmail.com
http://www.linkedin.com/pub/matt-amory/8/515/239


[CODE4LIB] Millions of Harvard Library Catalog Records Publicly Available

2012-04-24 Thread Will Kurt
Apologies if this is old news, but I was very excited to see Harvard
making all this data public:
http://isites.harvard.edu/icb/icb.do?keyword=k77982pageid=icb.page498373

Tons of cool data analysis / machine learning work to be done here!
Warm up your SVMs ;)

--Will