"access points" as 'textual identifiers' ??


I have thought for a while now that RDA needs a better, more
rigorous, clear, elegant and simple conceptual grounding of what are
currently being called 'controlled access points' (or sometimes just
'access points', which sometimes means the same thing and sometimes
not; previously 'headings', 'citations', 'main/added entries', etc.).


At the CC:DA meeting in Seattle, I heard someone (Jennifer Bowen?)
talking about "access points" vs. "identifiers", and how RDA would
like to make identifiers optional or even preferred for linking
records instead of 'access points', but it wasn't clear how that
would be done.


This got me started thinking, and lead to a line of thought,
influenced very significantly by Martha Yee's frequent use of the
term/concept 'work identifier', that I  explicate here in this draft
essay unfortunately more lengthy than I wish it were.


My proposal is that our traditional system of controlled 'headings',
'access points', or 'main entries', really _is_ a system of
identifiers, and best understood and discussed that way.   A
traditional heading is best understood as intending to uniquely and
unambiguously identify a bibliographic entity---in the FRBR regime, a
FRBR entity of some kind.  A work, manifestation, item, person,
corporate body, subject, etc.  Constructing a heading (or 'controlled
access point' in the not always consistent and somewhat confusing
terminology of current RDA) is really constructing an identifier for
one of these entities, within a particular system that has
traditionally been the primary system of identifiers in the catalog.
For lack of a better term, let us call identifiers constructued
according to one version or another of this traditional system of
headings: "Textual Identifiers".[1]


RDA terminology behind "controlled access points", discussing
"primary access points" vs "secondary access points", implies that
there are two KINDS of "controlled access points."  This is
inaccurate and misleading terminology. In fact, there are instead two
_uses_ of textual identifiers, or two tasks involving textual
identifiers that the cataloger has to face in creating a record:


1) ESTABLISHING 'textual identifiers' for newly established FRBR
entities. This includes, of course, establishing a textual identifier
for the record being cataloged (a manifestation). This use or task
roughly corresponds to the concept of 'main entry'.  This also
includes establishing identifiers for People, Corporate Bodies,
perhaps Subjects, perhaps Works and Expressions if cataloging codes
are to require such (as Bowen suggested RDA was inclined to do).


2) REFERENCING the textual identifiers of _foreign_ entities in the
record for the entity at hand, in order to establish RELATIONSHIPS.
These textual identifiers of foreign entities may have already
existed before the moment of cataloging, or may have been ESTABLISHED
_in order to_ allow their referencing to encode a relationship. This
corresponds very closely to 'added entry'--we will understand the
'added entry' as a _use_ of _another entity's_ textual identifier, in
order to establish a relationship between the entity at hand and that
foreign entity.  This is in fact also quite what happens with 6xx
subjects (referencing a 'textual identifier' for a topic or other
subject entity or entities), and with most other data elements you
can think of which REFERENCE a 'textual identifier' (that is, a
'heading', or 'controlled access point') for a foreign entity, in
order to encode a relationship between the entity at hand and that
foreign entity.


Using this new terminology, how do we translate or explain certain
cataloging practices?


"Title Main Entry"  =>  "The textual identifier for this Group 1
entity is constructed using only the entity's Title element."


"Author Main Entry" => "The textual identifier for this Group 1
entity is constructed using BOTH the Title element, and the Author."
Actually, to be clear. "... the Title element and the Textual
Identifier for the author, or one of the authors when multiple
authors exist, as chosen according to the code."  It should be clear
that 'author main entry' really means the textual identifier consists
of the Author AND the Title, not _just_ the author.  [This is a
difference between the 'main entry' terminology, which pre-dates even
the card catalog, and really makes sense only in the environment of
the bound catalog(!) --And the 'textual identifier' terminology more
suited for the digital era.]


It also becomes clear that a MARC 100, for instance, in current
cataloging, in fact serves at least TWO purposes--it is one component
of the Textual Identifier ESTABLISHED for the manifestation at hand
(the one described by the bib record). AND, it REFERENCES the textual
identifier of a Person entity, to describe a relationship between the
manifestation at hand and that Person. An author 700, on the other
hand, plays no part in ESTABLISHING the textual identifier for the
manifestation at hand, but only describes a relationship.  An author-
title 700 likewise simply REFERENCES a textual identifier--in this
case of a foreign group 1 entity--in order to establish a relationship.


This conceptual framework and terminology also points out some
ambiguities in our current practice.  What is a uniform title, or
'uniform title main entry'? Does a string in a MARC 240 ESTABLISH a
textual identifier for the Manifestation described in the bib record,
OR for an Expression OR for a Work?  Or does it somehow do multiple
duties for all of these?  We really aren't neccesarily sure from
current bib records and practice. Does it REFERENCE a foreign
identifier of a Work or Expression?  Almost certianly, but which?
Well, deciding/understanding that depends as a pre-requisite on
understanding the establishment of textual identifiers for all
entities involved.


This terminology provides a more rigorous and clear conceptual
framework for what's going on, that not only will be easier to
understand for 'non library communities', but will allow the kind of
principle based foundation that RDA aspires to, making it more clear
what the point is. This framework suggests two tasks of cataloging
practice:


1) Whenever a new record for any FRBR entity is created, the
cataloger must determine and establish the 'textual identifier' for
that record/entity. (To allow other records to encode relationships
to this record by traditional 'textual identifier' system.) [And our
records must be structured such that this textual identifier is
explicit and machine-addressable!]  [Even if a manifestation bib
record is traditional 'uniform title main entry', you still need to
establish a textual identfier for the manifestation as distinct from
a textual identifier for the work or expression.]


2) Whenever a cataloger references a textual identifier for a foreign
entity---that is, enters a 'controlled access point' for some foreign
entity--the cataloger must first explicitly decide precisely which
entity she means to reference (the ambiguity of current uniform
titles is highly undesirable), and secondly discover the already
established Textual Identifier for that entity by consulting a bib or
authority record, or by instead establishing the textual identifier
for that foreign entity. (And ideally by establishing a corresponding
bib or authority record to record that decision, but this isn't
always the way our current cataloging environment works).


This makes it clear how to talk about our traditional textual
identifiers vs. 'new fangled' identifiers too. You can establish
identifiers from any other system of identifiers (some type of URI or
URN perhaps) in #1 along with our traditional textual identifiers;
and you can reference these 'new fangled' identifiers in #2 instead
of our traditional textual identifiers.


It is my suggestion that this terminology and framework ought to be
at the heart of RDA, especially part B.  Of course, this conceptual
frameowrk doesn't entirely match the way things are currently done or
described. It may or may not be feasible to suggest changing the way
things are done to fit into this more rigorous and clear conceptual
framework---but it would still be valuable for RDA to re-organize
it's treatment baed on this framework, and to be clear about when
legacy practices are kept _even though_ they do not match RDA's
conceptual framework.


But even to go this far would require significant re-thinking, re-
organizing, and  re-writing, beyond which seems to be contemplated by
the very short schedule for RDA Part B and the continued assurances
that this is because Part B will be "unchanged from AACR2".   This
way of thinking about headings is not particularly new or original to
me; as I mentioned it is something Martha Yee (and others) have been
approaching for years, and I think it is in fact the way the JSC and
Editor are thinking about things. But it does not seem like it will
make it into RDA in a real way. This worries me. I think these issues
of controlled textual identifiers as relationship indicators may be
more important for bringing cataloging practice into the digital era
than the minutia of transcription decisions which much of Part A
consists of.


I appreciate any comments, feedback, debate, disagremeent on these
ideas.


Jonathan Rochkind


--


[1] I realize it could be argued that many other kinds of identifiers
are 'textual', depending on what is meant by 'textual', perhaps
including URIs and such. The intention of defining the term 'textual
identifier' as I have is not to argue this point. But we need a term
that refers explicitly to headings constructed according to our
traditional systems, and makes it clear that they are to be
understood as one type of identifier. 'Textual identifier' will do as
a term of art, unless someone can come up with a better one.

Reply via email to