Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-29 Thread stuart yeates
On 29/08/12 19:46, Michael Hopwood wrote: Thanks for this pointer Owen. It's a nice illustration of the fact that what users actually want (well, I know I did back when I actually worked in large information services departments!) is something more like an intranet where the content I find is

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-29 Thread Michael Hopwood
mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Owen Stephens Sent: 28 August 2012 21:37 To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google The JISC funded CLOCK project did some thinking around cataloguing processes and tracking changes to statements and/or records - e

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-28 Thread Lars Aronsson
A week ago, I wrote: What I have done is just to search (worldcat.org and hathitrust.org) for some common Swedish words, and I don't have to do this for long before some very obvious (to a native speaker) spelling mistakes appear. I've reported 38 errors to Hathitrust, and got feedback that the

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-28 Thread Owen Stephens
The JISC funded CLOCK project did some thinking around cataloguing processes and tracking changes to statements and/or records - e.g. http://clock.blogs.lincoln.ac.uk/2012/05/23/its-a-model-and-its-looking-good/ Not solutions of course, but hopefully of interest Owen Owen Stephens Owen Stephen

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-28 Thread Simon Spero
On Aug 28, 2012, at 2:17 PM, Joe Hourcle wrote: > I seem to recall seeing a presentation a couple of years ago from someone in > the intelligence community, where they'd keep all of their intelligence, but > they stored RDF quads so they could track the source. > > They'd then assign a confiden

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-28 Thread Joe Hourcle
On Aug 28, 2012, at 12:05 PM, Galen Charlton wrote: > Hi, > > On 08/27/2012 04:36 PM, Karen Coyle wrote: >> I also assumed that Ed wasn't suggesting that we literally use github as >> our platform, but I do want to remind folks how far we are from having >> "people friendly" versioning software -

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-28 Thread Simon Spero
An interesting reference is this: High, W. M. (1990). Editing Changes to Monographic Cataloging Records in the OCLC Database: An Analysis of the Practice in Five University Libraries. PhD thesis, University of North Carolina at Chapel Hill. It's in UMI (and Heavy Trussed). Simon On Aug 28

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-28 Thread Galen Charlton
Hi, On 08/27/2012 04:36 PM, Karen Coyle wrote: I also assumed that Ed wasn't suggesting that we literally use github as our platform, but I do want to remind folks how far we are from having "people friendly" versioning software -- at least, none that I have seen has felt "intuitive." The featur

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread stuart yeates
On 28/08/12 12:07, Peter Noerr wrote: They are not descendents of the same original, they are independent entities, whether they are recorded as singular MARC records or collections of LD triples. That depends on which end of the stick one grasps. Conceptually these are descendants of the ab

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Peter Noerr
ntity. Or this may all prove to be an unnecessary complication. Peter > -Original Message- > From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of > stuart yeates > Sent: Monday, August 27, 2012 3:42 PM > To: CODE4LIB@LISTSERV.ND.EDU > Subject: Re: [

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread stuart yeates
These have to be named graphs, or at least collections of triples which can be processed through workflows as a single unit. In terms of LD there version needs to be defined in terms of: (a) synchronisation with the non-bibliographic real world (i.e. Dataset Z version X was released at time Y)

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Karen Coyle
Ed, Corey - I also assumed that Ed wasn't suggesting that we literally use github as our platform, but I do want to remind folks how far we are from having "people friendly" versioning software -- at least, none that I have seen has felt "intuitive." The features of git are great, and people h

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Ed Summers
On Mon, Aug 27, 2012 at 1:33 PM, Corey A Harper wrote: > I think there's a useful distinction here. Ed can correct me if I'm > wrong, but I suspect he was not actually suggesting that Git itself be > the user-interface to a github-for-data type service, but rather that > such a service can be buil

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Ed Summers
On Mon, Aug 27, 2012 at 8:49 AM, Karen Coyle wrote: > Actually, Ed, this would not only make for a good blog post (please, so it > doesn't get lost in email space), but I would love to see a discussion of > what kind of revision control would work: > > 1) for libraries (git is gawdawful nerdy) > 2

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Ed Summers
On Mon, Aug 27, 2012 at 10:36 AM, Ross Singer wrote: > For MARC data, while I don't know of any examples of this, it seems like > something like CouchDB [2] and marc-in-json [3] would be a fantastic way to > make something like this available. Great idea...and there are 4 years of transactions

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Corey A Harper
Karen, I think there's a useful distinction here. Ed can correct me if I'm wrong, but I suspect he was not actually suggesting that Git itself be the user-interface to a github-for-data type service, but rather that such a service can be built *on top* of an infrastructure component like GitHub.

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Ross Singer
On Aug 27, 2012, at 10:25 AM, Galen Charlton wrote: > Hi, > > On 08/27/2012 08:49 AM, Karen Coyle wrote: >> Actually, Ed, this would not only make for a good blog post (please, so >> it doesn't get lost in email space), but I would love to see a >> discussion of what kind of revision control woul

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Galen Charlton
Hi, On 08/27/2012 08:49 AM, Karen Coyle wrote: Actually, Ed, this would not only make for a good blog post (please, so it doesn't get lost in email space), but I would love to see a discussion of what kind of revision control would work: 1) for libraries (git is gawdawful nerdy) 2) for linked d

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-27 Thread Karen Coyle
Actually, Ed, this would not only make for a good blog post (please, so it doesn't get lost in email space), but I would love to see a discussion of what kind of revision control would work: 1) for libraries (git is gawdawful nerdy) 2) for linked data kc p.s. the Ramsay book is now showing on

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-26 Thread Ed Summers
Thanks for sharing this bit of detective work. I noticed something similar fairly recently myself [1], but didn't discover as plausible of a scenario for what had happened as you did. I imagine others have noticed this network effect before as well. On Tue, Aug 21, 2012 at 11:42 AM, Lars Aronsson

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-25 Thread Lars Aronsson
On August 22 ya'aQovwrote: *Lars, so the wrong spelling of that Swedish author is based on your browsing it, not on an automated procedure, or reference to an online thesaurus. Given your Swedish resources, is there any quality control mechanism you can suggest? This author's name was just one

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-21 Thread ya'aQov
*Lars, so the wrong spelling of that Swedish author is based on your browsing it, not on an automated procedure, or reference to an online thesaurus. Given your Swedish resources, is there any quality control mechanism you can suggest? * On Tue, Aug 21, 2012 at 10:42 AM, Lars Aronsson wrote: >

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-21 Thread Lars Aronsson
On 2012-08-20 22:38, Roy Tennant wrote: Any errors in WorldCat can be reported to bibcha...@oclc.org. We take record quality seriously, but as you can imagine when you take in records from thousands of sources around the world, this is a constant struggle. We have our own quality control efforts,

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-21 Thread Angelina Z
Lars, With regards to HathiTrust and Google, HathiTrust receives bib records from partner libraries. Google also receives bib records from partner libraries. HathiTrust has never received bib records from Google - sometimes we provide corrected records to them, but the flow is only one-sided. For

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-20 Thread ya'aQov
*Lars, Thank you. Which online thesaurus are you matching these names with (when you declare them mistakes)? if an online procedure, please be specific; how can VIAF benefit, it at all, from *Project Runeberg? Ya'aqov Ziso

Re: [CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-20 Thread Roy Tennant
Any errors in WorldCat can be reported to bibcha...@oclc.org. We take record quality seriously, but as you can imagine when you take in records from thousands of sources around the world, this is a constant struggle. We have our own quality control efforts, but individuals reporting problems are al

[CODE4LIB] Corrections to Worldcat/Hathi/Google

2012-08-20 Thread Lars Aronsson
This big mess of broken bibliographic records that started out as Google Book Search, and then spread to Hathi Trust, is now also spreading to OCLC Worldcat. I find the same kind of misspellings and misconceptions in all three places. So far, the VIAF author names seem to be more correct, but mayb