Re: [CODE4LIB] Implementing OpenURL for simple web resources

Eric Hellman Mon, 14 Sep 2009 10:37:30 -0700

As I'm sure you're aware, the OpenURL spec only talks about providingservices, and resolving to full text is only one of many possibleservices. If *all* you know about a referent is the url, thenredirecting the user to the url is going to be the best you can do inalmost all cases. In particular, I don't think the dublin coreprofile, which is what Owen suggests to use, has much to say aboutresolving to full text.

http://catalog.library.jhu.edu/bib/NUM identifies a catalog record- Imean what else would you use to id the catalog record. unless you'veimplemented the http-range 303 redirect recommendation in your catalog(http://www.w3.org/TR/cooluris/), it shouldn't be construed asidentifying the thing it describes, except as a private id, and youshould use another field for that.


IIRC Google, Worldcat, and Wikipedia used rft_id.

I'm not in a position to answer any questions about specific linkresolver software that I no longer am associated with, however good itis/was.


Eric


On Sep 14, 2009, at 12:57 PM, Jonathan Rochkind wrote:

Well, in the 'wild' I barely see any rft_id's at all, heh. Asidefrom the obvious non-http URIs in rft_id, I'm not sure if I've seenhttp URIs that don't resolve to full text. BUT -- you can doanything with an http URI that you can do with an info uri. There isno requirement or guarantee in any spec that an HTTP uri willresolve at all, let alone resolve to full text for the documentcited in an OpenURL.The OpenURL spec says that rft_id is "An Identifier Descriptorunambiguously specifies the Entity by means of a Uniform ResourceIdentifier (URI)." It doesn't say that it needs to resolve to fulltext.
In my own OpenURL link-generating software, I _frequently_ putidentifiers which are NOT open access URLs to full text in rft_id.Because there's no other place to put them. And I frequently usehttp URIs even for things that don't resolve to full text, becausethe conventional wisdom is to always use http for URIs, whether ornot they resolve at all, and certainly no requirement that theyresolve to something in particular like full text.
Examples that I use myself when generating OpenURL rft_ids, of httpURIs that do not resolve to full text include ones identifying bibrecords in my own catalog:http://catalog.library.jhu.edu/bib/NUM [ Will resolve to mycatalog record, but not to full text!]
Or similarly, WorldCat http URIs.
Or, an rft_id to unambigously identify something in terms of it'sGoogle Books record: http://books.google.com/books?id=tl8MAAAACAAJ
Also, URIs to unambiguously specify a referent in terms of sudoc: http://purl.org/NET/sudoc/[sudoc] => will, as the purl is presently set up by rsinger,resolve to a GPO catalog record, but there's no guarantee of onlinepublic full text.
I'm pretty sure what I'm doing is perfectly appropriate based on thedefinition of rft_id, but it's definitely incompatible with areceiving link resolver assuming that all rft_id http URIs willresolve to full text for the rft cited. I don't think it'sappropriate to assume that just because a URI is http, that means itwill resolve to full text -- it's merely an identifier thatunambiguously specifies the referent, same as any other URI scheme.Isn't that what the sem web folks are always insisting in thearguments about how it's okay to use http URIs for any type ofidentifier at all -- that http is just an identifier (at least in acontext where all that's called for is a URI to identify), you can'tassume that it resolves to anything in particular? (Although it'snice when it resolves to RDF saying more about the thing identified,it's certainly not expected that it will resolve to full text).
Eric, out of curiosity, will your own link resolver softwareautomatically take rft_id's and display them to the user as links?
Jonathan

Eric Hellman wrote:
Could you give us examples of http urls in rft_id that are likethat? I've never seen such.
On Sep 14, 2009, at 11:58 AM, Jonathan Rochkind wrote:
In general, identifiers in URI form are put in rft_id that areNOT meant for providing to the user as a navigable URL. So thereceiving software can't assume that whatever url is in rft_isrepresents an actual access point (available to the user) for thedocument.
Eric Hellman
President, Gluejar, Inc.
41 Watchung Plaza, #132
Montclair, NJ 07042
USA

e...@hellman.net
http://go-to-hellman.blogspot.com/




Eric Hellman
President, Gluejar, Inc.
41 Watchung Plaza, #132
Montclair, NJ 07042
USA

e...@hellman.net
http://go-to-hellman.blogspot.com/

Re: [CODE4LIB] Implementing OpenURL for simple web resources

Reply via email to