[CODE4LIB] Access 2009 Conference

2009-05-14 Thread Paul Pound
Apologies for cross-posting...

The Access 2009 Conference clock is officially ticking down and the  
conference team is working hard to bring you the best program and  
social events possible. The dates of the best little library tech  
conference anywhere are Thursday Oct 1 - Saturday October 3, with the  
Hackfest taking place September 30. You can get more information and  
register today at:

http://vre.upei.ca/access2009 

We are working hard on the program, but have finished the important  
bits: Cory Doctorow is our keynote (and a participant at the Hackfest)  
and the social events will be awesome! The session details will be  
filled in over the coming weeks, but we have a great line-up in store.

We would encourage people who want to reserve a spot to register and  
book you hotels as soon as possible. The summer and fall are beautiful  
on PEI and full of incredible events (such as Fall Flavours - 
http://www.fallflavours.ca/) 
  so the hotels book up quickly. We have reserved a bank of rooms in  
the hotels closest to the conference venue but you still need to book  
soon, as the room blocks will expire in August for most. You might  
even want to book an extra day or 2 on either side of the Conference  
to make the most of your stay - we promise you won't regret it...

See you in Charlottetown!

Paul Pound
Systems Programmer/Analyst
Robertson Library
UPEI
902 566 0353
ppo...@upei.ca


Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All

2009-05-14 Thread Mike Taylor
Alexander Johannesen writes:
  Anyway, I'm suspecting I don't see what the problem seems to be. To
  create the best identifier for things seems a bit of a strange
  notion to me, but is this based on that there is only (or rather,
  that you're trying to create) one identifier for any one thing?

Yes, this is exactly it.  RDF things that each concept should have
exactly one identifier; Topic Maps says its fine to have multiple
identifiers.  That seems to be 99% of the conceptual difference
between them.

My position: it seems obvious that one is the CORRECT number of
identifiers for a thing to have.  But since we live in a formal
world, the Topics Map approach may be more practical.

In other words, I might end up _advocating_ Topic Maps, but don't
expect me to _like_ it :-)

 _/|____
/o ) \/  Mike Taylorm...@indexdata.comhttp://www.miketaylor.org.uk
)_v__/\  I think it's too consistently wrong not to be fixable --
 Phil Baldwin.


Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All

2009-05-14 Thread Rob Sanderson
RDF is fine with one 'thing' having multiple identifiers, it just hands
the problem up a level to the application to deal with.

For example, the owl:sameAs predicate is used to express that the
subject and object are the same 'thing'.  Then the application can infer
that if a owl:sameAs b, and a x y, then b x y.

Rob

On Thu, 2009-05-14 at 13:00 +0100, Mike Taylor wrote:
 Alexander Johannesen writes:
   Anyway, I'm suspecting I don't see what the problem seems to be. To
   create the best identifier for things seems a bit of a strange
   notion to me, but is this based on that there is only (or rather,
   that you're trying to create) one identifier for any one thing?
 
 Yes, this is exactly it.  RDF things that each concept should have
 exactly one identifier; Topic Maps says its fine to have multiple
 identifiers.  That seems to be 99% of the conceptual difference
 between them.
 
 My position: it seems obvious that one is the CORRECT number of
 identifiers for a thing to have.  But since we live in a formal
 world, the Topics Map approach may be more practical.
 
 In other words, I might end up _advocating_ Topic Maps, but don't
 expect me to _like_ it :-)
 
  _/|_  ___
 /o ) \/  Mike Taylorm...@indexdata.comhttp://www.miketaylor.org.uk
 )_v__/\  I think it's too consistently wrong not to be fixable --
Phil Baldwin.


Re: [CODE4LIB] Formats and its identifiers

2009-05-14 Thread Mike Taylor
Rob is correct on all points.

Namespace URIs can, in some cases, be overloaded to function as schema
identifiers.  But they absolutely can't be used blindly in this way
for arbitrary formats -- there are all kinds of potential gotchas.
That being so, I think it is wiser and more explicit _always_ to
define a separate identitifier for a format.

 _/|____
/o ) \/  Mike Taylorm...@indexdata.comhttp://www.miketaylor.org.uk
)_v__/\  ... currently trading under the name Gently for reasons which it
 would be otiose, for the moment, to rehearse -- Douglas Adams,
 Dirk Gently


Rob Sanderson writes:
  On Mon, 2009-05-11 at 14:53 +0100, Jakob Voss wrote:
  
A format should be described with a schema (XML Schema, OWL etc.) or at 
least a standard. Mostly this schema already has a namespace or similar 
identifier that can be used for the whole format.

This is unfortunately not the case.
   
   It is mostly the case - but people like to misinterpret schemas and 
   tailor them to their needs.
  
  You're advocating an approach that mostly works, as opposed to one
  that works in all cases?
  
  
For instance MODS Version 3 (currently 3.0, 3.1, 3.2, 3.4) has the XML 
Namespace http://www.loc.gov/mods/v3 so this is the best identifier to 
identify MODS. 

And this is a perfect example of why this is not the case. 
The same mods schema (let alone namespace) defines TWO formats, mods and
modsCollection.
  
   That's your interpretation. According to the schema, the MODS format 
   *is* either a single mods-element or a modsCollection-element. 
  
  According to the __schema__ yes.  Not according to the namespace. The
  namespace is a collection of names only and says precisely nothing about
  structure.
  
  And, yes, given no definition of format, my interpretation is that the
  mods schema defines two formats, as it defines two top level elements
  with different contents (eg one may contain the other).  This is
  typically how people would define format in this context, I would say.  
  
  This is, of course, tangential to the fact that you cannot use the __XML
  Namespace__ as an identifier for the format, no matter how you define
  it.
  
  
   That's 
   exactely what you can refer to with the namespace identifier 
   http://www.loc.gov/mods/v3.
  
  No, that's a collection of elements, not a schema.
  
  
   If you need to identify the specific element 'mods' of the format only, 
   then you need another identifer.
  
  Correct. I'm glad you agree with me.
  
  Given that namespaces do not specify anything to do with structure, you
  thus need a new identifier for EVERY element in a namespace as they
  could be used as the top level tag of ANY schema.
  
  There isn't a widely accepted identifier system for schemas, only schema
  locations.  There are also many methods for defining schemas
  (schematron, relax-ng, DTDs, xml schema) which can all define exactly
  the same format.
  
  
   But if the MODS specification defines that you can refer to any element 
   with an URI fragment identifier, then the right identifier would be 
   http://www.loc.gov/mods/v3#mods
  
  That would be an identifier for the *element*.
  
   The namespace http://www.loc.gov/mods/v3 of the top level element 'mods' 
   does not identify the top level element but the MODS *format* (in any of 
   the versions 3.0-3.4) itself. This format *includes* the top level 
   element 'mods'.
  
  No, it identifies a collection of names.  These names are structured
  according to a schema, which is what we need an identifier for. Beyond
  that, we may also need identifiers for which structure we mean within
  the schema (eg mods vs modsCollection)
  
  
  Rob


Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All

2009-05-14 Thread Alexander Johannesen
On Thu, May 14, 2009 at 17:35, Rob Sanderson azar...@liverpool.ac.uk wrote:
 For example, the owl:sameAs predicate is used to express that the
 subject and object are the same 'thing'.  Then the application can infer
 that if a owl:sameAs b, and a x y, then b x y.

Yes, but there's a snag; as RDF work only on the URI resource level
(no added semantics to the typification of the URI resource) if
someone does an owl:sameAs between an identifier of a thing and a
locator of a thing (a locator being the resource itself as opposed to
being an identifier; example are you talking about Sun Corp
(http://sun.com/) or are you talking about their website
(http://sun.com/)) you can get a nasty case of integrity rot, and I've
not seen any proposals to address this issue (the RDF world is
essentially assuming modeling from the viewpoint of everything being
true).

I guess Mike don't like RDF *nor* Topic Maps now. :)


Regards,

Alex
-- 
---
 Project Wrangler, SOA, Information Alchemist, UX, RESTafarian, Topic Maps
-- http://shelter.nu/blog/ 


[CODE4LIB] RIRI 2009 Open for Registration

2009-05-14 Thread Paul Pound
Apologies for cross-posting:

Register now for the 2009 Red Island Repository Institute on Prince  
Edward Island - July 20-24.

Reserve your spot at the Fedora-focused Repository Institute on Prince 

Edward Island, one of Canada’s premiere travel destinations known for
 
its sandy beaches, golfing, seafood and iconic red dirt roads. The 1- 
week hands-on workshop will be led by Chris Wilper and Thorny Staples 

(Fedora Commons), Matt Zumwault (MediaShelf) and Mark Leggott  
(Islandora, UPEI). Register soon, as the seats are limited to 20  
participants and they tend to go fast. If you plan on attending you  
should also book a room as soon as you can, as the summer is premiere 

time on the Island.

The Institute is hands-on and is targeted at individuals from  
institutions planning or running a repository program and is intended 

for users with a wide range of experience, from managers to  
programmers. Based on last year's feedback, we have added a number of 

break-out sessions tailored for either managers or developers.  
Attendees will be provided all the information and tools needed to  
implement and maintain a flexible repository program using Fedora.  
Since the Institute is a combination of lecture and hands-on  
experience, we encourage all participants to bring their own laptops. 

This will allow participants to return to their place of work with a  
fully-functional Fedora installation for further development and  
testing. Those participants who are not able to bring a laptop will be 

provided with one to use for the duration of the Institute.

Registration includes meals (except dinners for Tuesday to Thursday), 

special events and all materials.  The workshop agenda and  
registration form are now available at http://vre.upei.ca/riri/.

If you have questions about this year's RIRI, please contact Mark  
Leggott at r...@upei.ca.

Paul Pound
System Programmer/Analyst
Robertson Library 
UPEI
902 566 0353
ppo...@upei.ca


Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All

2009-05-14 Thread Alexander Johannesen
On Thu, May 14, 2009 at 17:45, Rob Sanderson azar...@liverpool.ac.uk wrote:
 I'll quote Mike (and most common approaches to the problem):
        Don't Do That Then.
 :)

Oh, for sure. :) But these are very subtle things that are hard to
understand, and certainly the long-term implications, so people *will*
do this, and they *will* put rot into the SemWeb chains people create.
It's unavoidable, but I know lots are trying to work out some kind of
solution. Unfortunately, this one is being routed to software
frameworks rather than the RDF core itself. Oh well.


Regards,

Alex
-- 
---
 Project Wrangler, SOA, Information Alchemist, UX, RESTafarian, Topic Maps
-- http://shelter.nu/blog/ 


Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All

2009-05-14 Thread Eric Lease Morgan
[ /me is creating an email filter/rule against the Code4Lib mailing  
list to automatically delete messages whose subject lines contain One  
Data Format Identifier because he has acquired carpal tunnel syndrome  
after pressing the delete key so often. ]


--
Earache Least Moron


[CODE4LIB] symposium on mass digitization

2009-05-14 Thread Eric Lease Morgan

[Please excuse the cross-postings. --ELM]

The Hesburgh Libraries of the University of Notre Dame is sponsoring a  
mini-symposium on the topic of mass digitization one week from today,  
Thursday, May 21 from 1 - 4:30. You are invited.


The purpose of the symposium it to discuss, learn, and explore the  
possibilities and ramifications of huge amounts of digitized texts in  
libraries. Our speakers include:


  * Maura Marx (Executive Director, Open Knowledge Commons) - Ms.
Marx will speak about the history of mass digitization
initiatives, the Google Books settlement, and why we need to work
towards a viable alternative, and current open plans and
programs. She will describe how the old notions of privacy and
confidentiality are disappearing in the profit driven commercial
environment, where revenues come from better targeting and
personalization. All of this impacts not only how we work today,
but how future generations will access knowledge.

  * Gary Charbonneau (Indiana University) - Mr. Charbonneau will
speak on Google Books and its impact on mass digitization efforts
within the academy. Gary is currently the Systems Librarian at
IU, of which he serves as the Google Digitizing Project Manager
at the Herman B. Wells Library.  Google's innovative search
technologies connect millions of people around the world with
information every day, and its impact on mass digitization
efforts have been extraordinary and in some ways, controversial.

  * Sian Meikle (University of Toronto) - Ms. Meikle will share
many thought-provoking insights into the experiences with the
Open Content Alliance (OCA) which is a collaborative effort of a
group of cultural, technology, non-profit, and governmental
organizations from around the world that helps build a permanent
archive of multilingual digitized texts and multimedia materials.

Everybody is welcome. There is no registration fee or form. For more  
details, please see the website:


  http://www.library.nd.edu/symposium/

--
Eric Lease Morgan
Head, Digital Access and Information Architecture Department
Hesburgh Libraries, University of Notre Dame

(574) 631-8604


[CODE4LIB] openurl.info ?

2009-05-14 Thread Walker, David
It appears that the openurl.info domain name has expired.  I get an error from 
the host:

http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:ctx

I've been using the registry at OCLC as a reference source for OpenURL.  But 
all of the identifiers and links pointing to openurl.info no longer work.

   http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets

Is there a different place I should be going now for OpenURL info instead?  Or 
maybe this is just a snafu?

--Dave

==
David Walker
Library Web Services Manager
California State University
http://xerxes.calstate.edu


[CODE4LIB] Review of Daniel Tunkelang's new book: Faceted Search

2009-05-14 Thread Sol Lederman
Hi,

I've written a review of Daniel Tunkelang's (Chief Scientist at Endeca)
soon-to-be-released book on Faceted Search:

http://federatedsearchblog.com/2009/05/14/review-faceted-search/http://www.linkedin.com/redirect?url=http%3A%2F%2Ffederatedsearchblog%2Ecom%2F2009%2F05%2F14%2Freview-faceted-search%2Furlhash=-rhl_t=disc_detail_link

Enjoy.

Sol Lederman
http://FederatedSearchBlog.com


Re: [CODE4LIB] openurl.info ?

2009-05-14 Thread Klein, Michael
openurl.info's domain registration looks up to date...Registered to NISO
through 12-May-2010. Last updated 10-May-2009. I'm going to speculate (with
very little basis) that someone managed to hijack the DNS by pointing the
record's name server entries somewhere other than where they're supposed to
be. Hence the redirection to a fraudulent parking page.

Michael 

-- 
Michael B. Klein
Digital Initiatives Technology Librarian
Boston Public Library
(617) 859-2391
mkl...@bpl.org


 From: Walker, David dwal...@calstate.edu
 Reply-To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU
 CODE4LIB@LISTSERV.ND.EDU
 Date: Thu, 14 May 2009 12:05:03 -0700
 To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU CODE4LIB@LISTSERV.ND.EDU
 Subject: [CODE4LIB] openurl.info ?
 
 It appears that the openurl.info domain name has expired.  I get an error from
 the host:
 
 http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:ctx
 
 I've been using the registry at OCLC as a reference source for OpenURL.  But
 all of the identifiers and links pointing to openurl.info no longer work.
 
http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets
 
 Is there a different place I should be going now for OpenURL info instead?  Or
 maybe this is just a snafu?
 
 --Dave
 
 ==
 David Walker
 Library Web Services Manager
 California State University
 http://xerxes.calstate.edu


Re: [CODE4LIB] openurl.info ?

2009-05-14 Thread Sol Lederman
I wouldn't assume that the DNS entry has been hijacked. I very recently had
this same experience with a domain I work on. It got redirected to a parking
page. I thought it had been hijacked. It turned out that the hosting
provider had accidentally changed the IP address associated with the domain
and that that had somehow caused the site to go to the parking page.

Sol

On Thu, May 14, 2009 at 2:32 PM, Klein, Michael mkl...@bpl.org wrote:

 openurl.info's domain registration looks up to date...Registered to NISO
 through 12-May-2010. Last updated 10-May-2009. I'm going to speculate (with
 very little basis) that someone managed to hijack the DNS by pointing the
 record's name server entries somewhere other than where they're supposed to
 be. Hence the redirection to a fraudulent parking page.

 Michael

 --
 Michael B. Klein
 Digital Initiatives Technology Librarian
 Boston Public Library
 (617) 859-2391
 mkl...@bpl.org


  From: Walker, David dwal...@calstate.edu
  Reply-To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU
  CODE4LIB@LISTSERV.ND.EDU
  Date: Thu, 14 May 2009 12:05:03 -0700
  To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU 
 CODE4LIB@LISTSERV.ND.EDU
  Subject: [CODE4LIB] openurl.info ?
 
  It appears that the openurl.info domain name has expired.  I get an
 error from
  the host:
 
  http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:ctx
 
  I've been using the registry at OCLC as a reference source for OpenURL.
  But
  all of the identifiers and links pointing to openurl.info no longer
 work.
 
 http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets
 
  Is there a different place I should be going now for OpenURL info
 instead?  Or
  maybe this is just a snafu?
 
  --Dave
 
  ==
  David Walker
  Library Web Services Manager
  California State University
  http://xerxes.calstate.edu



[CODE4LIB] Diebold-o-tron-o-matic IG

2009-05-14 Thread Ross Singer
Hi everybody.

We're probably 6 months (or less) from the voting season in Code4libya
and I want to preemptively counter the catcalls, jeers, the calls for
the Drupal voting module, etc. prior to 4 days before the first vote
opening.

So, if you're interested in participating in this, let me know.  If
you're interested in /leading/ this, /please/ let me know, because I'm
perfectly happy just firing up the Diebold-o-tron-o-matic for another
year, so if you've got a real bone to pick with how things work, stand
and deliver.

Thanks,
-Ross.