On Fri, 9 May 2008, Bess Sadler wrote:

Those of us involved in the Blacklight and VuFind projects are
spending lots of time recently thinking about marc records indexing.
We're about to start running some performance tests, and we want to
create unit tests for our marc to solr indexer, and also people
wanting to download and play with the software need to have easy
access to a small but representative set of marc records that they
can play with.

[trimmed]

It seems to me that the set that Casey donated to Open Library
(http://www.archive.org/details/marc_records_scriblio_net) would be a
good place from which to draw records, because although IANAL, this
seems to sidestep any legal hurdles. I'd also love to see the ability
for the community to contribute test cases. Assuming such a set
doesn't exist already (see my question below) this seems like the
ideal sort of project for code4lib to host, too.

OpenLibrary has other datasets that you might be able to use / combine /
whatever to meet your requirements:

       http://openlibrary.org/dev/docs/data


-----
Joe Hourcle

Reply via email to