Re: [RDA-L] Automatically adding relationship designators (was Cost of Retrospective Conversion for Legacy Data...)

James Weinheimer Tue, 10 Dec 2013 04:33:15 -0800

On 09/12/2013 0.04, Kelley McGrath wrote:
<snip>

OLAC is attempting a project of this sort for film and video credits.We are trying to teach a computer to recognize the names and rolesthat appear in 245$c, 260+$b, 508 and 511 (and if we get really bravemaybe 505) and also connect them to the correct 1xx/7xx if present.The current program, which uses natural language processing (NLP)techniques, is reasonably successful with personal names and withroles given in English. We are working on building a multilingualvocabulary. It tends to choke on complicated statements that involve alot of corporate bodies.

</snip>

I hesitate to bring this up because most probably everybody alreadythinks of me as a purveyor of doom and gloom, but I still believe thatwe must consider these things in realistic terms. Although the attemptis laudable, I still say that we must first of all see through the eyesof the users who would be interested in this kind of information. Forinstance, if I am a regular user and I wanted to know the moviesdirected by John Huston, what would be the first thing I would think of?

"Google it". I am sure almost everybody would. So I did a naturallanguage search: "what movies did john huston direct" and what happens?https://www.google.it/search?q=what+movies+did+john+huston+direct (Thisis linked data in action!) We find that down below in the links area (atleast in the results I get), #1 is a link to John Huston in Wikipedia,#2 goes to "Category:Films directed by John Huston" also in Wikipedia,and #3 goes into his page at the IMDB (which I personally prefer). Allhave lists of the movies he directed. This is incredibly easy to do andfree to all.

Putting aside for the moment the linked data result, the 3 links performexactly the same function as in the past when someone would ask areference librarian, "I need a list of the movies John Huston directed"and the knowledgeable reference librarian would reply: "Here. You canfind the list in this book." and would hand the user the latest issue ofthis title http://lccn.loc.gov/sn99044419 (or something similar) whichwas very possibly shelved in the reference collection for quick and easyaccess.

Therefore, just as the reference librarian would take the user'squestion and convert it into, "He needs to look in Film directors : acomplete guide", today a reference librarian would do the same thing butanswer/include, "He needs to look in the IMDB". Without any doubt, thatis the ethical answer for such a question and will remain so for a long,long time in the future.

The huge difference is that today, people rarely consult referencelibrarians. The librarian would already know that if you want to findthe films of specific directors, the library catalog is currently notthe right place to look for this information and when viewedrealistically, it never will be the right place. There is nothing at allwrong with that. Not every tool is good for every use, just as if youwant the latest business news or to find out why your XML won'tvalidate, the best place is not JSTOR, and it never will be. Thatdoesn't mean JSTOR is no good--it just means that you have to look inother places for that kind of information. Today, the correct place tolook for the films people have directed is the IMDB or perhaps a fewother places on the web. We are *really lucky* that we have such optionsfor free today. The reference librarians would be able to help thesearcher in these directions *if* they were asked, but sadly, that ishappening less and less.

So, adding the relator codes automatically will still demand manualcleanup, perhaps (probably) on a massive scale, if it is ever to becomeas good as IMDB is *right now*. I suggest that the correct method for alibrary catalog is to lead the person to the *right resource* that he orshe wants and perhaps even do it *better* than Google. In this case offilm directors, I find it very difficult even to imagine how we could dobetter than Google because the Google search works so incredibly well.Perhaps a film librarian could discover that the IMDB and Wikipedia areincorrect or incomplete. In that respect perhaps library efforts couldbe better focused on improving IMDB and Wikipedia than adding relatorcodes.

There is also the option that the library catalog could interact withthe IMDB (and/or Wikipedia) using the APIs.

This opens up a highly pertinent question for me: I don't even know whata library catalog is supposed to provide in today's semi-totalinformation environment. This is a great example. We can't ignore thesewonderful sites. What should the catalog do today?

--
James Weinheimer weinheimer.ji...@gmail.com
First Thus http://catalogingmatters.blogspot.com/
First Thus Facebook Page https://www.facebook.com/FirstThus

Cooperative Cataloging Ruleshttp://sites.google.com/site/opencatalogingrules/Cataloging Matters Podcastshttp://blog.jweinheimer.net/p/cataloging-matters-podcasts.html


To unsubscribe from RDA-L send an e-mail to the following address from the 
address you are subscribed under to:
lists...@listserv.lac-bac.gc.ca
In the body of the message:
SIGNOFF RDA-L

Re: [RDA-L] Automatically adding relationship designators (was Cost of Retrospective Conversion for Legacy Data...)

Reply via email to