[CODE4LIB] Access 2009 Conference
Apologies for cross-posting... The Access 2009 Conference clock is officially ticking down and the conference team is working hard to bring you the best program and social events possible. The dates of the best little library tech conference anywhere are Thursday Oct 1 - Saturday October 3, with the Hackfest taking place September 30. You can get more information and register today at: http://vre.upei.ca/access2009 We are working hard on the program, but have finished the important bits: Cory Doctorow is our keynote (and a participant at the Hackfest) and the social events will be awesome! The session details will be filled in over the coming weeks, but we have a great line-up in store. We would encourage people who want to reserve a spot to register and book you hotels as soon as possible. The summer and fall are beautiful on PEI and full of incredible events (such as Fall Flavours - http://www.fallflavours.ca/) so the hotels book up quickly. We have reserved a bank of rooms in the hotels closest to the conference venue but you still need to book soon, as the room blocks will expire in August for most. You might even want to book an extra day or 2 on either side of the Conference to make the most of your stay - we promise you won't regret it... See you in Charlottetown! Paul Pound Systems Programmer/Analyst Robertson Library UPEI 902 566 0353 ppo...@upei.ca
Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All
Alexander Johannesen writes: Anyway, I'm suspecting I don't see what the problem seems to be. To create the best identifier for things seems a bit of a strange notion to me, but is this based on that there is only (or rather, that you're trying to create) one identifier for any one thing? Yes, this is exactly it. RDF things that each concept should have exactly one identifier; Topic Maps says its fine to have multiple identifiers. That seems to be 99% of the conceptual difference between them. My position: it seems obvious that one is the CORRECT number of identifiers for a thing to have. But since we live in a formal world, the Topics Map approach may be more practical. In other words, I might end up _advocating_ Topic Maps, but don't expect me to _like_ it :-) _/|____ /o ) \/ Mike Taylorm...@indexdata.comhttp://www.miketaylor.org.uk )_v__/\ I think it's too consistently wrong not to be fixable -- Phil Baldwin.
Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All
RDF is fine with one 'thing' having multiple identifiers, it just hands the problem up a level to the application to deal with. For example, the owl:sameAs predicate is used to express that the subject and object are the same 'thing'. Then the application can infer that if a owl:sameAs b, and a x y, then b x y. Rob On Thu, 2009-05-14 at 13:00 +0100, Mike Taylor wrote: Alexander Johannesen writes: Anyway, I'm suspecting I don't see what the problem seems to be. To create the best identifier for things seems a bit of a strange notion to me, but is this based on that there is only (or rather, that you're trying to create) one identifier for any one thing? Yes, this is exactly it. RDF things that each concept should have exactly one identifier; Topic Maps says its fine to have multiple identifiers. That seems to be 99% of the conceptual difference between them. My position: it seems obvious that one is the CORRECT number of identifiers for a thing to have. But since we live in a formal world, the Topics Map approach may be more practical. In other words, I might end up _advocating_ Topic Maps, but don't expect me to _like_ it :-) _/|_ ___ /o ) \/ Mike Taylorm...@indexdata.comhttp://www.miketaylor.org.uk )_v__/\ I think it's too consistently wrong not to be fixable -- Phil Baldwin.
Re: [CODE4LIB] Formats and its identifiers
Rob is correct on all points. Namespace URIs can, in some cases, be overloaded to function as schema identifiers. But they absolutely can't be used blindly in this way for arbitrary formats -- there are all kinds of potential gotchas. That being so, I think it is wiser and more explicit _always_ to define a separate identitifier for a format. _/|____ /o ) \/ Mike Taylorm...@indexdata.comhttp://www.miketaylor.org.uk )_v__/\ ... currently trading under the name Gently for reasons which it would be otiose, for the moment, to rehearse -- Douglas Adams, Dirk Gently Rob Sanderson writes: On Mon, 2009-05-11 at 14:53 +0100, Jakob Voss wrote: A format should be described with a schema (XML Schema, OWL etc.) or at least a standard. Mostly this schema already has a namespace or similar identifier that can be used for the whole format. This is unfortunately not the case. It is mostly the case - but people like to misinterpret schemas and tailor them to their needs. You're advocating an approach that mostly works, as opposed to one that works in all cases? For instance MODS Version 3 (currently 3.0, 3.1, 3.2, 3.4) has the XML Namespace http://www.loc.gov/mods/v3 so this is the best identifier to identify MODS. And this is a perfect example of why this is not the case. The same mods schema (let alone namespace) defines TWO formats, mods and modsCollection. That's your interpretation. According to the schema, the MODS format *is* either a single mods-element or a modsCollection-element. According to the __schema__ yes. Not according to the namespace. The namespace is a collection of names only and says precisely nothing about structure. And, yes, given no definition of format, my interpretation is that the mods schema defines two formats, as it defines two top level elements with different contents (eg one may contain the other). This is typically how people would define format in this context, I would say. This is, of course, tangential to the fact that you cannot use the __XML Namespace__ as an identifier for the format, no matter how you define it. That's exactely what you can refer to with the namespace identifier http://www.loc.gov/mods/v3. No, that's a collection of elements, not a schema. If you need to identify the specific element 'mods' of the format only, then you need another identifer. Correct. I'm glad you agree with me. Given that namespaces do not specify anything to do with structure, you thus need a new identifier for EVERY element in a namespace as they could be used as the top level tag of ANY schema. There isn't a widely accepted identifier system for schemas, only schema locations. There are also many methods for defining schemas (schematron, relax-ng, DTDs, xml schema) which can all define exactly the same format. But if the MODS specification defines that you can refer to any element with an URI fragment identifier, then the right identifier would be http://www.loc.gov/mods/v3#mods That would be an identifier for the *element*. The namespace http://www.loc.gov/mods/v3 of the top level element 'mods' does not identify the top level element but the MODS *format* (in any of the versions 3.0-3.4) itself. This format *includes* the top level element 'mods'. No, it identifies a collection of names. These names are structured according to a schema, which is what we need an identifier for. Beyond that, we may also need identifiers for which structure we mean within the schema (eg mods vs modsCollection) Rob
Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All
On Thu, May 14, 2009 at 17:35, Rob Sanderson azar...@liverpool.ac.uk wrote: For example, the owl:sameAs predicate is used to express that the subject and object are the same 'thing'. Then the application can infer that if a owl:sameAs b, and a x y, then b x y. Yes, but there's a snag; as RDF work only on the URI resource level (no added semantics to the typification of the URI resource) if someone does an owl:sameAs between an identifier of a thing and a locator of a thing (a locator being the resource itself as opposed to being an identifier; example are you talking about Sun Corp (http://sun.com/) or are you talking about their website (http://sun.com/)) you can get a nasty case of integrity rot, and I've not seen any proposals to address this issue (the RDF world is essentially assuming modeling from the viewpoint of everything being true). I guess Mike don't like RDF *nor* Topic Maps now. :) Regards, Alex -- --- Project Wrangler, SOA, Information Alchemist, UX, RESTafarian, Topic Maps -- http://shelter.nu/blog/
[CODE4LIB] RIRI 2009 Open for Registration
Apologies for cross-posting: Register now for the 2009 Red Island Repository Institute on Prince Edward Island - July 20-24. Reserve your spot at the Fedora-focused Repository Institute on Prince Edward Island, one of Canada’s premiere travel destinations known for its sandy beaches, golfing, seafood and iconic red dirt roads. The 1- week hands-on workshop will be led by Chris Wilper and Thorny Staples (Fedora Commons), Matt Zumwault (MediaShelf) and Mark Leggott (Islandora, UPEI). Register soon, as the seats are limited to 20 participants and they tend to go fast. If you plan on attending you should also book a room as soon as you can, as the summer is premiere time on the Island. The Institute is hands-on and is targeted at individuals from institutions planning or running a repository program and is intended for users with a wide range of experience, from managers to programmers. Based on last year's feedback, we have added a number of break-out sessions tailored for either managers or developers. Attendees will be provided all the information and tools needed to implement and maintain a flexible repository program using Fedora. Since the Institute is a combination of lecture and hands-on experience, we encourage all participants to bring their own laptops. This will allow participants to return to their place of work with a fully-functional Fedora installation for further development and testing. Those participants who are not able to bring a laptop will be provided with one to use for the duration of the Institute. Registration includes meals (except dinners for Tuesday to Thursday), special events and all materials. The workshop agenda and registration form are now available at http://vre.upei.ca/riri/. If you have questions about this year's RIRI, please contact Mark Leggott at r...@upei.ca. Paul Pound System Programmer/Analyst Robertson Library UPEI 902 566 0353 ppo...@upei.ca
Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All
On Thu, May 14, 2009 at 17:45, Rob Sanderson azar...@liverpool.ac.uk wrote: I'll quote Mike (and most common approaches to the problem): Don't Do That Then. :) Oh, for sure. :) But these are very subtle things that are hard to understand, and certainly the long-term implications, so people *will* do this, and they *will* put rot into the SemWeb chains people create. It's unavoidable, but I know lots are trying to work out some kind of solution. Unfortunately, this one is being routed to software frameworks rather than the RDF core itself. Oh well. Regards, Alex -- --- Project Wrangler, SOA, Information Alchemist, UX, RESTafarian, Topic Maps -- http://shelter.nu/blog/
Re: [CODE4LIB] One Data Format Identifier (and Registry) to Rule Them All
[ /me is creating an email filter/rule against the Code4Lib mailing list to automatically delete messages whose subject lines contain One Data Format Identifier because he has acquired carpal tunnel syndrome after pressing the delete key so often. ] -- Earache Least Moron
[CODE4LIB] symposium on mass digitization
[Please excuse the cross-postings. --ELM] The Hesburgh Libraries of the University of Notre Dame is sponsoring a mini-symposium on the topic of mass digitization one week from today, Thursday, May 21 from 1 - 4:30. You are invited. The purpose of the symposium it to discuss, learn, and explore the possibilities and ramifications of huge amounts of digitized texts in libraries. Our speakers include: * Maura Marx (Executive Director, Open Knowledge Commons) - Ms. Marx will speak about the history of mass digitization initiatives, the Google Books settlement, and why we need to work towards a viable alternative, and current open plans and programs. She will describe how the old notions of privacy and confidentiality are disappearing in the profit driven commercial environment, where revenues come from better targeting and personalization. All of this impacts not only how we work today, but how future generations will access knowledge. * Gary Charbonneau (Indiana University) - Mr. Charbonneau will speak on Google Books and its impact on mass digitization efforts within the academy. Gary is currently the Systems Librarian at IU, of which he serves as the Google Digitizing Project Manager at the Herman B. Wells Library. Google's innovative search technologies connect millions of people around the world with information every day, and its impact on mass digitization efforts have been extraordinary and in some ways, controversial. * Sian Meikle (University of Toronto) - Ms. Meikle will share many thought-provoking insights into the experiences with the Open Content Alliance (OCA) which is a collaborative effort of a group of cultural, technology, non-profit, and governmental organizations from around the world that helps build a permanent archive of multilingual digitized texts and multimedia materials. Everybody is welcome. There is no registration fee or form. For more details, please see the website: http://www.library.nd.edu/symposium/ -- Eric Lease Morgan Head, Digital Access and Information Architecture Department Hesburgh Libraries, University of Notre Dame (574) 631-8604
[CODE4LIB] openurl.info ?
It appears that the openurl.info domain name has expired. I get an error from the host: http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:ctx I've been using the registry at OCLC as a reference source for OpenURL. But all of the identifiers and links pointing to openurl.info no longer work. http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets Is there a different place I should be going now for OpenURL info instead? Or maybe this is just a snafu? --Dave == David Walker Library Web Services Manager California State University http://xerxes.calstate.edu
[CODE4LIB] Review of Daniel Tunkelang's new book: Faceted Search
Hi, I've written a review of Daniel Tunkelang's (Chief Scientist at Endeca) soon-to-be-released book on Faceted Search: http://federatedsearchblog.com/2009/05/14/review-faceted-search/http://www.linkedin.com/redirect?url=http%3A%2F%2Ffederatedsearchblog%2Ecom%2F2009%2F05%2F14%2Freview-faceted-search%2Furlhash=-rhl_t=disc_detail_link Enjoy. Sol Lederman http://FederatedSearchBlog.com
Re: [CODE4LIB] openurl.info ?
openurl.info's domain registration looks up to date...Registered to NISO through 12-May-2010. Last updated 10-May-2009. I'm going to speculate (with very little basis) that someone managed to hijack the DNS by pointing the record's name server entries somewhere other than where they're supposed to be. Hence the redirection to a fraudulent parking page. Michael -- Michael B. Klein Digital Initiatives Technology Librarian Boston Public Library (617) 859-2391 mkl...@bpl.org From: Walker, David dwal...@calstate.edu Reply-To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU CODE4LIB@LISTSERV.ND.EDU Date: Thu, 14 May 2009 12:05:03 -0700 To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] openurl.info ? It appears that the openurl.info domain name has expired. I get an error from the host: http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:ctx I've been using the registry at OCLC as a reference source for OpenURL. But all of the identifiers and links pointing to openurl.info no longer work. http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets Is there a different place I should be going now for OpenURL info instead? Or maybe this is just a snafu? --Dave == David Walker Library Web Services Manager California State University http://xerxes.calstate.edu
Re: [CODE4LIB] openurl.info ?
I wouldn't assume that the DNS entry has been hijacked. I very recently had this same experience with a domain I work on. It got redirected to a parking page. I thought it had been hijacked. It turned out that the hosting provider had accidentally changed the IP address associated with the domain and that that had somehow caused the site to go to the parking page. Sol On Thu, May 14, 2009 at 2:32 PM, Klein, Michael mkl...@bpl.org wrote: openurl.info's domain registration looks up to date...Registered to NISO through 12-May-2010. Last updated 10-May-2009. I'm going to speculate (with very little basis) that someone managed to hijack the DNS by pointing the record's name server entries somewhere other than where they're supposed to be. Hence the redirection to a fraudulent parking page. Michael -- Michael B. Klein Digital Initiatives Technology Librarian Boston Public Library (617) 859-2391 mkl...@bpl.org From: Walker, David dwal...@calstate.edu Reply-To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU CODE4LIB@LISTSERV.ND.EDU Date: Thu, 14 May 2009 12:05:03 -0700 To: Code for Libraries CODE4LIB@LISTSERV.ND.EDU CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] openurl.info ? It appears that the openurl.info domain name has expired. I get an error from the host: http://www.openurl.info/registry/docs/mtx/info:ofi/fmt:kev:mtx:ctx I've been using the registry at OCLC as a reference source for OpenURL. But all of the identifiers and links pointing to openurl.info no longer work. http://alcme.oclc.org/openurl/servlet/OAIHandler?verb=ListSets Is there a different place I should be going now for OpenURL info instead? Or maybe this is just a snafu? --Dave == David Walker Library Web Services Manager California State University http://xerxes.calstate.edu
[CODE4LIB] Diebold-o-tron-o-matic IG
Hi everybody. We're probably 6 months (or less) from the voting season in Code4libya and I want to preemptively counter the catcalls, jeers, the calls for the Drupal voting module, etc. prior to 4 days before the first vote opening. So, if you're interested in participating in this, let me know. If you're interested in /leading/ this, /please/ let me know, because I'm perfectly happy just firing up the Diebold-o-tron-o-matic for another year, so if you've got a real bone to pick with how things work, stand and deliver. Thanks, -Ross.