On Thu, Jun 10, 2010 at 5:02 PM, Samuele Kaplun <[email protected]> wrote:
Currently when you use xmlmarc2textmarc utility, and you export to
Aleph, a dummy leader is generated, which proved to be enough for Aleph.
Ahh, I wasn't aware of that existing utility. I showed the marc output
from bibformat to some librarian coder friends and they said, "huh,
looks like aleph sequential" :)
So Aleph doesn't care about the leader values?
I am currently not an expert on the leader subject, but to me it seems
that the leader makes sense mostly in the MARC21 binary format, and when
dealing with plain library records. And it exists in MARCXML just as a
conversion consequence. Is this correct? Proof is that Invenio can do
powerful and extremely flexible things without any need for the leader.
Not an expert either. I think for Invenio it really just boils down to
interoperability. I mean, saying, Software X can do incredible,
amazing things with internal, non-standard format Y, isn't really a
remarkable statement.
In particular if Invenio has to support the leader in MARCXML how can we
map its workflows with the rigid schema of the leader:
I think sensible defaults for some values combined with a minimum of
conditional logic should suffice. The first part of that may be the
trickier as I'm still trying to figure out defaults myself.
Also what is the meaning of certain bytes of the leader in MARCXML:
(from <http://www.loc.gov/marc/bibliographic/bdleader.html>):
[...]
Character Positions
00-04 - Record length
[...]
12-16 - Base address of data
[...]
leader/05 = 'n' - the term "new" in this context is confusing but
I've been told "don't overthink it"
leader/06 = 'a' - "...electronic resources that are basically textual in nature"
leader/07 is where i'm less confident but I *think* the logic is
simply 'b' for articles, 'a' for things that are part of a collection
or proceedings, and 'm' for everything else. For ADS we are currently
storing our internal item-type description in the 690a (which may be
incorrect) and this is how i'm determining the leader/07
leader/08 = '#' - not sure about this one
leader/09 = 'a' - assuming unicode
I've been looking at http://www.itsmarc.com/crs/bib1465.htm for some
guidance and when I have a dumb question about something I'll ask in
the #code4lib irc channel.
Apparently the 008 control field
(http://www.loc.gov/marc/bibliographic/bd008.html) is also important
to many applications, but I haven't really explored it or determined
the level of importance.
In the end, probably the best thing is still to put a fake leader like
xmlmarc2textmarc currently does, with the most neutral values.
Yeah, I guess I agree, although I'm not sure what a "neutral" value
would be for something like the leader/07. also it's important to get
the leader/09 correct as tools like pymarc need to know how to decode.
It is true that, on the other hand, when Invenio records have been
imported from original MARC21 or from MARCXML with a leader, Invenio
should not throw away such information.
Agreed.
--jay
******************************************************
Jay Luker Astrophysics Data System (ADS)
[email protected] Center for Astrophysics
617-495-4588 60 Garden Street MS 67
617-495-7356 fax Cambridge, MA 02138
******************************************************