Re: [RDA-L] Browse and search BNB open data

2011-08-03 Thread Bernhard Eversberg

02.08.2011 18:34, J. McRee Elrod:

   http://www.allegro-c.de/db/a30/bl.htm

Am I correct that there is no MARC display available?


OK, for what it's worth and for good measure, I've added that in;
no big deal since we've got what it takes.
Now, MARC appears directly underneath the regular display. But only
as complete and as correct as the stuff that was released.
The format made available by BL is an XML schema of their
own design, documented here:

  http://www.bl.uk/bibliographic/datafree.html
  (under Data model  draft schema)

A sample XML record:

rdf:Description
dcterms:titleThe elves and the emperor/dcterms:title
dcterms:creator
rdf:Description
rdfs:labelRobinson, Hilary, 1962-/rdfs:label
/rdf:Description
/dcterms:creator
dcterms:contributor
rdf:Description
rdfs:labelSanfilippo, Simona./rdfs:label
/rdf:Description
/dcterms:contributor
dcterms:type
rdf:Description
rdfs:labeltext/rdfs:label
/rdf:Description
/dcterms:type
dcterms:type
rdf:Description
rdfs:labelmonographic/rdfs:label
/rdf:Description
/dcterms:type
isbd:P1016
rdf:Description
rdfs:labelLondon/rdfs:label
/rdf:Description
/isbd:P1016
dcterms:publisher
rdf:Description
rdfs:labelWayland/rdfs:label
/rdf:Description
/dcterms:publisher
dcterms:issued2009/dcterms:issued
dcterms:language
rdf:Description
rdf:value rdf:datatype=http://purl.org/dc/terms/ISO639-2;eng/rdf:value
/rdf:Description
/dcterms:language
dcterms:extent
rdf:Description
rdfs:label31 p/rdfs:label
/rdf:Description
/dcterms:extent
dcterms:descriptionOriginally published: 2008./dcterms:description
dcterms:subject
skos:Concept
skos:notation rdf:datatype=ddc:Notation428.6/skos:notation
skos:inScheme rdf:resource=http://dewey.info/scheme/e22; /
/skos:Concept
/dcterms:subject
dcterms:isPartOf
rdf:Description
rdfs:labelFairytale jumbles/rdfs:label
/rdf:Description
/dcterms:isPartOf
dcterms:isPartOf
rdf:Description
rdfs:labelStart reading. Purple band 8/rdfs:label
/rdf:Description
/dcterms:isPartOf
dcterms:identifier(Uk)015346892/dcterms:identifier
dcterms:identifierGBA979108/dcterms:identifier
bibo:isbn9780750255233/bibo:isbn
bibo:isbn0750255234/bibo:isbn
dcterms:identifierURN:ISBN:9780750255233/dcterms:identifier
dcterms:identifierURN:ISBN:0750255234/dcterms:identifier
/rdf:Description

which translates like this:

=LDR  01234cam a22002771i 45e0
=001  015346892
=007  ta
=008  \\991231s2009n\\\eng\d
=020  \\$a9780750255233
=040   $ea
=082  00$a428.6
=100  1\$aRobinson, Hilary (1962-)
=245  04$aThe elves and the emperor /$cHilary Robinson
=260  \\$aLondon :$bWayland,$c2009
=300  \\$a31 p
=440  \0$aFairytale jumbles
=440  \0$aStart reading. Purple band 8
=500  \\$aOriginally published: 2008.
=700  12$aSanfilippo, Simona


B.E.


Re: [RDA-L] Browse and search BNB open data

2011-08-03 Thread James Weinheimer

On 03/08/2011 08:34, Bernhard Eversberg wrote:
snip

02.08.2011 18:34, J. McRee Elrod:

   http://www.allegro-c.de/db/a30/bl.htm

Am I correct that there is no MARC display available?


OK, for what it's worth and for good measure, I've added that in;
no big deal since we've got what it takes.
Now, MARC appears directly underneath the regular display. But only
as complete and as correct as the stuff that was released.
The format made available by BL is an XML schema of their
own design, documented here:

  http://www.bl.uk/bibliographic/datafree.html
  (under Data model  draft schema)

/snip

This is interesting. From the table 
http://www.bl.uk/bibliographic/pdfs/marctordfxmlmappingsv0-3-2.pdf, we 
see how some of the semantics of the MARC format are lost in the 
conversion. As we evolve away from the MARC format, I am sure the 
direction will be toward simplification, so it seems valuable to discuss 
what could be eliminated from MARC with the fewest consequences. From a 
very quick review of that table, I see the 534 being translated to 
dcterms:description, losing some handy subfields, and all of the 
subfields in the 100/700 fields mapping to dcterms:creator. Also, all of 
the subfields in the 6xx fields are being placed into dcterms:subject, 
and there is a loss of the subfield description avxyz.


I need to emphasize that this is discussing losing the specific subfield 
*coding*, NOT losing the information, e.g.

100 0_*|a *Benedict*|b *XVI,*|c *Pope,*|d *1927-
as opposed to
*dcterms:creator*Benedict**XVI,**Pope,**1927-*/dcterms:creator*

In practical terms for all the various metadata communities, where 
precisely is the loss here?


While there is an undoubted loss in semantics, with the future evolution 
of MARC format, we can ask: do such losses have any practical 
consequences? Although I think many subfields (although not the 
information) could disappear without any essential loss, some will have 
important consequences to different communities. For instance, we see in 
the mapping the complete elimination of 245$c, which would obviously 
have important consequences for *librarians* (i.e. necessary for 
determination of a copy), although the loss of 245$c would be much less 
dire for the users. Loss of subfields with some of the most consequences 
would seem to be the subfields in the 6xx fields, since those semantics 
*could* lead to novel computer manipulation, sorting by chronology, 
geographic, and all kinds of other ways. Also, the distinctions of:

650$aHistory$xBibliography
650$aHistory$vBibliography
650$aBibliography$xHistory

would be lost.

Compare this to losing the subfields in the 1xx/7xx, where the 
consequences would appear to be much fewer.


Yet, compare this to what others want: even more semantics, for example, 
to encode 300$a even further to specify pages or leaves or whatever. e.g.

300
a
pages
245
/pages
leaves
56
/leaves
/a
/300
etc.

There are definite advantages with this level of coding but on the 
negative side, it is more work, prone to many more errors, and is more 
difficult to train new people, especially as there will be the push to 
simplify.


I think these questions will begin to be asked (finally!), and answered 
too. This project from the British Library may be a great catalyst for 
the discussion.


--
James Weinheimer  weinheimer.ji...@gmail.com
First Thus: http://catalogingmatters.blogspot.com/
Cooperative Cataloging Rules: http://sites.google.com/site/opencatalogingrules/



Re: [RDA-L] Browse and search BNB open data

2011-08-03 Thread Bernhard Eversberg

Am 03.08.2011 10:55, schrieb James Weinheimer:


There are definite advantages with this level of coding but on the 
negative side, it is more work, prone to many more errors, and is more 
difficult to train new people, especially as there will be the push to 
simplify.


I think these questions will begin to be asked (finally!), and 
answered too. This project from the British Library may be a great 
catalyst for the discussion.

The BL has teamed up with Talis to develop and improve their
open data activities. Here's more about that, together with a nice
diagram any cataloger might love to mount on their office wall:

  http://consulting.talis.com/2011/07/british-library-data-model-overview/

I understand that the current release is only a first step, and
together with Talis they will produce an improved version in the
near future.

B.Eversberg



Re: [RDA-L] Browse and search BNB open data

2011-08-03 Thread metadata
Bernard and all, 

In order to clarify the current situation, The British Library would like to 
take this opportunity to outline the range of free/open BNB options and 
encourage anyone seeking details to check 
http://www.bl.uk/bibliographic/datafree.html for further information. We would 
like to emphasise the experimental nature of this work and the likelihood that 
datasets we make available will be subject to change over time. As a result, we 
would recommend that those wishing to use the most up to date version of the 
BNB dataset obtain it directly from the BL. Older versions available from other 
sites have now been superseded and we will be contacting organisations we 
identify mounting these to offer updated versions.

The current BNB options are: 

1) BNB as linked data (the latest free data release, in association with Talis) 
- Available under a CC0 license using: SPARQL, Describe and Search endpoints. 
This dataset has been updated from an initial preview version of around 400,000 
records to cover over 2.6 million monographs (80,249,538 triples) ; we hope to 
also offer a dump of the file via FTP shortly using the new data model 
(available at http://www.bl.uk/bibliographic/pdfs/datamodelv1_01.pdf) and 
schema (available at: 
(http://www.bl.uk/bibliographic/pdfs/britishlibrarytermsv1-01.pdf) 

2) BNB in basic RDF/XML via FTP (the dataset currently under discussion) - 
Available under a CC0 license to individual researchers or organisations not 
requiring MARC21 data but wishing to data mine, mash up or otherwise 
interrogate the data set in bulk. An updated version is currently being 
produced which will be available via FTP directly from the BL - please contact 
metad...@bl.uk for access details. 

3) BNB Z39.50 MARC21 Access - A free registration based service for 
non-commercial use under terms outlined on the British Library free data web 
page at: http://www.bl.uk/bibliographic/datafree.html 

If you have any queries about any of the BNB data offerings, please contact us 
at metad...@bl.uk 

Thank you

Best regards

Corine

Corine Deliot on behalf of Metadata Services, The British Library. 
email: metad...@bl.uk 


From: Resource Description and Access / Resource Description and Access 
[mailto:RDA-L@LISTSERV.LAC-BAC.GC.CA] On Behalf Of Bernhard Eversberg
Sent: 03 August 2011 10:14
To: RDA-L@LISTSERV.LAC-BAC.GC.CA
Subject: Re: [RDA-L] Browse and search BNB open data

Am 03.08.2011 10:55, schrieb James Weinheimer: 

There are definite advantages with this level of coding but on the negative 
side, it is more work, prone to many more errors, and is more difficult to 
train new people, especially as there will be the push to simplify. 

I think these questions will begin to be asked (finally!), and answered too. 
This project from the British Library may be a great catalyst for the 
discussion.
The BL has teamed up with Talis to develop and improve their
open data activities. Here's more about that, together with a nice
diagram any cataloger might love to mount on their office wall:

  http://consulting.talis.com/2011/07/british-library-data-model-overview/

I understand that the current release is only a first step, and 
together with Talis they will produce an improved version in the
near future.

B.Eversberg


Re: [RDA-L] Browse and search BNB open data

2011-08-03 Thread J. McRee Elrod
In article 4e38ebe3.5090...@biblio.tu-bs.de, you wrote:

OK, for what it's worth and for good measure, I've added that in;
no big deal since we've got what it takes.

Bless your sweet heart.

Did you notice the not for commercial purposes in the BL posting? We
are not even going to ask.  No matter how much we give back, as
outsourcer we are made to feel dirty.

How anyone comparing the XML and MARC versions could prefer the XML is
beyond me.  We find it simple to crosswalk from MARC to XML for anyone
who wants it, but not back again.

Mac


   __   __   J. McRee (Mac) Elrod (m...@slc.bc.ca)
  {__  |   / Special Libraries Cataloguing   HTTP://www.slc.bc.ca/
  ___} |__ \__


Re: [RDA-L] Browse and search BNB open data

2011-08-03 Thread Karen Coyle

Quoting James Weinheimer weinheimer.ji...@gmail.com:


While there is an undoubted loss in semantics, with the future  
evolution of MARC format, we can ask: do such losses have any  
practical consequences? Although I think many subfields (although  
not the information) could disappear without any essential loss,  
some will have important consequences to different communities.


Jim, this is much of the motivation for the work that I have been  
doing to try to identify the actual elements of MARC21 -- elements  
in the semantic sense, trying to ignore the MARC21 structure (which  
results in much repetition, etc.) A report on my study is available in  
the recent Code4Lib journal:


http://journal.code4lib.org/articles/5468

One of the difficulties of deciding what we do and do not want to keep  
in MARC, or what we want to move over to the RDA environment, is that  
we have no dictionary of everything that MARC covers. For example,  
what standard identifiers are available in MARC? They are scattered  
all over the format, so it's hard to know. What about things like  
language and date? Those appear in different fields with somewhat  
different meanings.


My assumption is that a complete inventory of MARC elements is  
essential for any move away from MARC. Unfortunately, I have gotten  
now to the 1xx-8xx fields (the study so far is 00x and 0xx, that's  
already pretty complex!) and may not have the energy to complete the  
study on my own. However, what I have done so far at least sets down  
some possible principles to follow.


I'm doing it all on the futurelib wiki so my process is as transparent  
as I can make it:

  http://futurelib.pbworks.com/w/page/29114548/MARC%20elements

kc

--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
ph: 1-510-540-7596
m: 1-510-435-8234
skype: kcoylenet


Re: [RDA-L] Article on RDA implementation

2011-08-03 Thread J. McRee Elrod
http://www.libraryjournal.com/lj/home/891482-264/cataloging_community_galvanized_as_u.s..csp

It says:

However, the JSC will no longer update AACR2. So continuing to use these 
rules does not remain a viable long term option. 


This does not preclude others doing so.  The most widely used rules in
the world are various translations of Concise AACR2.

If the conditions set by the US national libraries are not met, and
RDA is not implemented, JSC might have to reconsider its decision not
to update AACR2.

Thanks to Allen Mullen for posting this URL to Autocat.


   __   __   J. McRee (Mac) Elrod (m...@slc.bc.ca)
  {__  |   / Special Libraries Cataloguing   HTTP://www.slc.bc.ca/
  ___} |__ \__