Re: [CODE4LIB] RFC 5005 ATOM extension and OAI
Hi Clay, I completely agree with everything you just wrote, especially about Atom + APP being more than just a technology for blogs. APP is a great lightweight alternative to WebDAV, and promising for all sorts of data transfer. The fact that it has developer groundswell is a huge plus. During my Princeton days Kevin Clarke and I briefly talked about what a METS + APP metadata editing application could do. (I can't remember the answer, but I bet it would be snazzy.) On the one side you are right: Atom + APP is becoming popular and the standards are good, so digital libraries should get into it. On the other side I was just reminded to the ECDL2006-paper Repository Replication Using NNTP and SMTP: You can almost use any protocol (HTTP, OAI, ATOM APP, WebDAV, NNTP...) for most of digital libraries' use cases - but the best standard without approriate tools and support is pretty worthless. I came to this realization out of frustration that most OAI toolkits (at the time, ca. 2005) didn't support that functionality well -- or at all. I don't know if that's still the case. However, the need to delete records is a reality for most projects, and OAI has somewhat awkwardly made us rethink how to delete a record in repositories and the like, both on the service and data provider end. You almost have to build your entire system around handling deleted records just for OAI exposure. In reality it seems like you just end up masquerading or re-representing its outward visibility on our local systems, which gets onerous. I guess the difference is that the growing number of Atom developers are heeding the requirement for deletions, whereas the few existing OAI toolkit developers have deemed that functionality as optional. Most repositories do not even track deletions so they cannot syndicate them. If OAI-delete was mandatory, maybe OAI-PMH had not been used that much? OAI did a good job in promoting and documenting OAI-PMH but deletions were always treated as an orphan - I would not blame the standard but the lacking implementation. Also ATOM and RFC 5005 is not much better than other solutions - but its much more likely to get it implemented in Weblog and other software then OAI which is not that known outside the library world. Greetings, Jakob P.S: Maybe we would all be happy with Z39.50 if we had that wonderful Indexdata tools right from the beginning - instead there were only closed source specifications and different closed source partial implementations. A standard without easy to use open source implementations is condemned to be violated and die. -- Jakob Voß [EMAIL PROTECTED], skype: nichtich Verbundzentrale des GBV (VZG) / Common Library Network Platz der Goettinger Sieben 1, 37073 Göttingen, Germany +49 (0)551 39-10242, http://www.gbv.de
Re: [CODE4LIB] RFC 5005 ATOM extension and OAI
Peter wrote: Also, re: blog mirroring, I highly recommend the current discussions floating aroung the blogosphere regarding distributed source control (Git, Mercurial, etc.). It's a fundamental paradigm shift from centralized control to distributed control that points the way toward the future of libraries as they (we) become less and less the gatekeepers for the stuff be it digital or physical and more and more the facilitators of the bidirectional replication that assures ubiquitous access and long-term preservation. The library becomes (actually it has already happended) simply a node on a network of trust and should act accordingly. See the thoroughly entertaining/thought-provoking Google tech talk by Linus Torvalds on Git: http://www.youtube.com/watch?v=4XpnKHJAok8 Thanks for pointing to this interesting discussion. This goes even further then the current paradigm shift from the old model (author - publisher - distributor - reader) to a world of user-generated content and collaboration! I was glad if we finally got to model and archive Weblogs and Wikis - modelling and archiving the whole process of content copying, changing and remixing and republication is far beyong libraries capabilities! Greetings, Jakob -- Jakob Voß [EMAIL PROTECTED], skype: nichtich Verbundzentrale des GBV (VZG) / Common Library Network Platz der Goettinger Sieben 1, 37073 Göttingen, Germany +49 (0)551 39-10242, http://www.gbv.de
Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?
On Oct 24, 2007, at 6:04 PM, Jonathan Rochkind wrote: I have some behavior I can't explain. There's this article that is in OAISter, called Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures by Robert S. Jensen. I do an SRU search with query: dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures And I find the record, one hit. Good. You too could try, and see what the DC returned looks like. It does have a dc:creator of Robert S. Jensen. But I try a search that includes the author. dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures and dc.creator = Jensen Yes, this looks weird to me too. I tried a number of query variations and could not get a hit. -- Eric Lease Morgan Head, Digital Access and Information Architecture Department University Libraries of Notre Dame (574) 631-8604
Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?
It doesn't actually work with OAIster's own SRU interface, either. A query for the title works, but adding the author's name fails. OAIster's SRU implementation is very strange, though, so I could very well be doing something wrong. -Ross. On 10/25/07, Eric Lease Morgan [EMAIL PROTECTED] wrote: On Oct 24, 2007, at 6:04 PM, Jonathan Rochkind wrote: I have some behavior I can't explain. There's this article that is in OAISter, called Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures by Robert S. Jensen. I do an SRU search with query: dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures And I find the record, one hit. Good. You too could try, and see what the DC returned looks like. It does have a dc:creator of Robert S. Jensen. But I try a search that includes the author. dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures and dc.creator = Jensen Yes, this looks weird to me too. I tried a number of query variations and could not get a hit. -- Eric Lease Morgan Head, Digital Access and Information Architecture Department University Libraries of Notre Dame (574) 631-8604
Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?
You're not getting any hits because the name is not Jensen, it's Jansen. I'm not sure where Jensen came from but the OAIster indexes here have Jansen. josh On 10/24/07 6:04 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote: I'm messing with SRU search of http://indexdata.dk/opencontent/oaister I have some behavior I can't explain. There's this article that is in OAISter, called Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures by Robert S. Jensen. I do an SRU search with query: dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures And I find the record, one hit. Good. You too could try, and see what the DC returned looks like. It does have a dc:creator of Robert S. Jensen. But I try a search that includes the author. dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures and dc.creator = Jensen 0 hits. and cql.serverChoice = Jensen= 0 hits Same using full name Robert S. Jensen (just as it appears in the record), with cql.serverChoice or dc.creator. Is this just a bad index, or is something else going on, or what? As I try sample searches on title and author, I keep running into false negatives for things that ought to be in the OAISter index. Sometimes I can figure out why (title not quite right; title has curly quotes, index does not, etc.), but in this case I have no idea. But the net result is it's hard to actually find your known item this way, via an automated search on known item metadata. Jonathan -- Jonathan Rochkind Digital Services Software Engineer The Sheridan Libraries Johns Hopkins University 410.516.8886 rochkind (at) jhu.edu
Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?
That was a typo in my problem report, I'm afraid. I was actually searching Jansen, and that still exhibits the problems I mentioned. I've also moved this conversation to indexdata's own list for this service, at http://lists.indexdata.dk/cgi-bin/mailman/listinfo/oclist (thanks to Jason Ronallo for bringing that list to my attention). Jonathan Joshua Santelli wrote: You're not getting any hits because the name is not Jensen, it's Jansen. I'm not sure where Jensen came from but the OAIster indexes here have Jansen. josh On 10/24/07 6:04 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote: I'm messing with SRU search of http://indexdata.dk/opencontent/oaister I have some behavior I can't explain. There's this article that is in OAISter, called Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures by Robert S. Jensen. I do an SRU search with query: dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures And I find the record, one hit. Good. You too could try, and see what the DC returned looks like. It does have a dc:creator of Robert S. Jensen. But I try a search that includes the author. dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures and dc.creator = Jensen 0 hits. and cql.serverChoice = Jensen= 0 hits Same using full name Robert S. Jensen (just as it appears in the record), with cql.serverChoice or dc.creator. Is this just a bad index, or is something else going on, or what? As I try sample searches on title and author, I keep running into false negatives for things that ought to be in the OAISter index. Sometimes I can figure out why (title not quite right; title has curly quotes, index does not, etc.), but in this case I have no idea. But the net result is it's hard to actually find your known item this way, via an automated search on known item metadata. Jonathan -- Jonathan Rochkind Digital Services Software Engineer The Sheridan Libraries Johns Hopkins University 410.516.8886 rochkind (at) jhu.edu -- Jonathan Rochkind Digital Services Software Engineer The Sheridan Libraries Johns Hopkins University 410.516.8886 rochkind (at) jhu.edu
[CODE4LIB] Distributed Models the Library (was: Re: [CODE4LIB] RFC 5005 ATOM extension and OAI)
Hi Jakob- Yes, I think you are correct that it is a bit much to think that a distributed archiving model is a bit much for libraries to even consider now, but I do think there are useful insights to be gained here. As it stands now, linux developers using Git can carry around the entire change history of the linux kernel (well, I think they just included the 2.6 kernel when they moved to Git) on their laptop, make changes, create patches, etc and then make that available to others. Well, undoubtedly change history is is a bit much for the library to think about, by why not, for instance, and entire library catalog? If I could check out the library catalog onto my computer use whatever tools I wished to search, organize, annotate, etc., then perhaps mix-in data (say holdings data from other that are near me) OR even create the sort of relationships between records that the Open Library folks are talking about (http://www.hyperorg.com/blogger/mtarchive/berkman_lunch_aaron_swartz_on.html) then share that added data, we have quite a powerful distributed development model. It may seem a bit far-fetched, but I think that some of the pieces (or at least a better understanding of how this might all work) are beginning to take shape. -Peter On Thu, 25 Oct 2007, Jakob Voss wrote: Peter wrote: Also, re: blog mirroring, I highly recommend the current discussions floating aroung the blogosphere regarding distributed source control (Git, Mercurial, etc.). It's a fundamental paradigm shift from centralized control to distributed control that points the way toward the future of libraries as they (we) become less and less the gatekeepers for the stuff be it digital or physical and more and more the facilitators of the bidirectional replication that assures ubiquitous access and long-term preservation. The library becomes (actually it has already happended) simply a node on a network of trust and should act accordingly. See the thoroughly entertaining/thought-provoking Google tech talk by Linus Torvalds on Git: http://www.youtube.com/watch?v=4XpnKHJAok8 Thanks for pointing to this interesting discussion. This goes even further then the current paradigm shift from the old model (author - publisher - distributor - reader) to a world of user-generated content and collaboration! I was glad if we finally got to model and archive Weblogs and Wikis - modelling and archiving the whole process of content copying, changing and remixing and republication is far beyong libraries capabilities! Greetings, Jakob -- Jakob Voß [EMAIL PROTECTED], skype: nichtich Verbundzentrale des GBV (VZG) / Common Library Network Platz der Goettinger Sieben 1, 37073 Göttingen, Germany +49 (0)551 39-10242, http://www.gbv.de
Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?
Dumb question, no experience with the syntax, but should there be a wildcard -or- use of something other than equals (sorry, if I'm way off base, but most query syntax I use requires like or wildcarding). -t On Thu, 25 Oct 2007, Jonathan Rochkind wrote: That was a typo in my problem report, I'm afraid. I was actually searching Jansen, and that still exhibits the problems I mentioned. I've also moved this conversation to indexdata's own list for this service, at http://lists.indexdata.dk/cgi-bin/mailman/listinfo/oclist (thanks to Jason Ronallo for bringing that list to my attention). Jonathan Joshua Santelli wrote: You're not getting any hits because the name is not Jensen, it's Jansen. I'm not sure where Jensen came from but the OAIster indexes here have Jansen. josh On 10/24/07 6:04 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote: I'm messing with SRU search of http://indexdata.dk/opencontent/oaister I have some behavior I can't explain. There's this article that is in OAISter, called Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures by Robert S. Jensen. I do an SRU search with query: dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures And I find the record, one hit. Good. You too could try, and see what the DC returned looks like. It does have a dc:creator of Robert S. Jensen. But I try a search that includes the author. dc.title = Resurrection and Appropriation: Reputational Trajectories, Memory Work, and the Political Use of Historical Figures and dc.creator = Jensen 0 hits. and cql.serverChoice = Jensen= 0 hits Same using full name Robert S. Jensen (just as it appears in the record), with cql.serverChoice or dc.creator. Is this just a bad index, or is something else going on, or what? As I try sample searches on title and author, I keep running into false negatives for things that ought to be in the OAISter index. Sometimes I can figure out why (title not quite right; title has curly quotes, index does not, etc.), but in this case I have no idea. But the net result is it's hard to actually find your known item this way, via an automated search on known item metadata. Jonathan -- Jonathan Rochkind Digital Services Software Engineer The Sheridan Libraries Johns Hopkins University 410.516.8886 rochkind (at) jhu.edu -- Jonathan Rochkind Digital Services Software Engineer The Sheridan Libraries Johns Hopkins University 410.516.8886 rochkind (at) jhu.edu
[CODE4LIB] Distributed Models the Library (was: Re: [CODE4LIB] RFC 5005 ATOM extension and OAI)
not, for instance, and entire library catalog? If I could check out the library catalog onto my computer use whatever tools I wished to search, Peter, You might be interested in Art Rhyno's experiment. Here's Jon Udell's summary: Art Rhyno’s science project Art Rhyno’s title is Systems Librarian but he should consider adding Mad Scientist to his business card because his is full of wild and crazy and — to me, at least — brilliant ideas. Last year, when I was a judge for the Talis “Mashing up the Library” competion, one of my favorite entries was this one from Art. The project mirrors a library catalog to the desktop and integrates it with desktop search. The searcher in this case is Google Desktop, but could be another, and the integration is accomplished by exposing the catalog as a set of Web Folders, which Art correctly describes as “Microsoft’s in-built and oft-overlooked WebDAV option.” http://blog.jonudell.net/2007/03/16/art-rhynos-science-project/ Jason -- Jason Stirnaman OME/Biomedical Digital Projects Librarian A.R. Dykes Library The University of Kansas Medical Center Kansas City, Kansas Work: 913-588-7319 Email: [EMAIL PROTECTED] On 10/25/2007 at 10:47 AM, in message [EMAIL PROTECTED], pkeane [EMAIL PROTECTED] wrote: Hi Jakob- Yes, I think you are correct that it is a bit much to think that a distributed archiving model is a bit much for libraries to even consider now, but I do think there are useful insights to be gained here. As it stands now, linux developers using Git can carry around the entire change history of the linux kernel (well, I think they just included the 2.6 kernel when they moved to Git) on their laptop, make changes, create patches, etc and then make that available to others. Well, undoubtedly change history is is a bit much for the library to think about, by why not, for instance, and entire library catalog? If I could check out the library catalog onto my computer use whatever tools I wished to search, organize, annotate, etc., then perhaps mix-in data (say holdings data from other that are near me) OR even create the sort of relationships between records that the Open Library folks are talking about (http://www.hyperorg.com/blogger/mtarchive/berkman_lunch_aaron_swartz_on.htm l) then share that added data, we have quite a powerful distributed development model. It may seem a bit far-fetched, but I think that some of the pieces (or at least a better understanding of how this might all work) are beginning to take shape. -Peter On Thu, 25 Oct 2007, Jakob Voss wrote: Peter wrote: Also, re: blog mirroring, I highly recommend the current discussions floating aroung the blogosphere regarding distributed source control (Git, Mercurial, etc.). It's a fundamental paradigm shift from centralized control to distributed control that points the way toward the future of libraries as they (we) become less and less the gatekeepers for the stuff be it digital or physical and more and more the facilitators of the bidirectional replication that assures ubiquitous access and long-term preservation. The library becomes (actually it has already happended) simply a node on a network of trust and should act accordingly. See the thoroughly entertaining/thought-provoking Google tech talk by Linus Torvalds on Git: http://www.youtube.com/watch?v=4XpnKHJAok8 Thanks for pointing to this interesting discussion. This goes even further then the current paradigm shift from the old model (author - publisher - distributor - reader) to a world of user-generated content and collaboration! I was glad if we finally got to model and archive Weblogs and Wikis - modelling and archiving the whole process of content copying, changing and remixing and republication is far beyong libraries capabilities! Greetings, Jakob -- Jakob Voß [EMAIL PROTECTED], skype: nichtich Verbundzentrale des GBV (VZG) / Common Library Network Platz der Goettinger Sieben 1, 37073 Göttingen, Germany +49 (0)551 39-10242, http://www.gbv.de
[CODE4LIB] OpenURL Referrer for IE
OCLC's OpenURL Referrer is now available for Internet Explorer! Previously available only for Firefox, this popular browser extension inserts OpenURLs into Google Scholar and Google News Archive search results. It also detects and makes links out of web COinS, such as those found in Wikipedia and Worldcat.org. The extension can be downloaded for free at http://openly.oclc.org/openurlref/ OpenURL Referrer uses your institution's link resolver settings from the OCLC WorldCat Registry, so there is no need to manually configure the extension. Institutions can register their resolver in the OCLC WorldCat Registry by visiting http://worldcat.org/registry/ institutions. All institutions can register for free, even if they are not OCLC member libraries. The IE version leans much more heavily on the Worldcat Resolver Registry than the Firefox version does. The reason for this is that IE does not have a nice XUL-based way to make user interfaces, so we instead rely on the Registry to do baseurl management. I hope this proves to be useful. Eric Hellman, Director OCLC Openly Informatics Division [EMAIL PROTECTED] 2 Broad St., Suite 208 tel 1-973-509-7800 Bloomfield, NJ 07003 http://openly.oclc.org/
Re: [CODE4LIB] Distributed Models the Library (was: Re: [CODE4LIB] RFC 5005 ATOM extension and OAI)
Very interesting! I will check it out -Peter On Thu, 25 Oct 2007, Jason Stirnaman wrote: not, for instance, and entire library catalog? If I could check out the library catalog onto my computer use whatever tools I wished to search, Peter, You might be interested in Art Rhyno's experiment. Here's Jon Udell's summary: Art Rhyno?s science project Art Rhyno?s title is Systems Librarian but he should consider adding Mad Scientist to his business card because his is full of wild and crazy and ? to me, at least ? brilliant ideas. Last year, when I was a judge for the Talis ?Mashing up the Library? competion, one of my favorite entries was this one from Art. The project mirrors a library catalog to the desktop and integrates it with desktop search. The searcher in this case is Google Desktop, but could be another, and the integration is accomplished by exposing the catalog as a set of Web Folders, which Art correctly describes as ?Microsoft?s in-built and oft-overlooked WebDAV option.? http://blog.jonudell.net/2007/03/16/art-rhynos-science-project/ Jason -- Jason Stirnaman OME/Biomedical Digital Projects Librarian A.R. Dykes Library The University of Kansas Medical Center Kansas City, Kansas Work: 913-588-7319 Email: [EMAIL PROTECTED] On 10/25/2007 at 10:47 AM, in message [EMAIL PROTECTED], pkeane [EMAIL PROTECTED] wrote: Hi Jakob- Yes, I think you are correct that it is a bit much to think that a distributed archiving model is a bit much for libraries to even consider now, but I do think there are useful insights to be gained here. As it stands now, linux developers using Git can carry around the entire change history of the linux kernel (well, I think they just included the 2.6 kernel when they moved to Git) on their laptop, make changes, create patches, etc and then make that available to others. Well, undoubtedly change history is is a bit much for the library to think about, by why not, for instance, and entire library catalog? If I could check out the library catalog onto my computer use whatever tools I wished to search, organize, annotate, etc., then perhaps mix-in data (say holdings data from other that are near me) OR even create the sort of relationships between records that the Open Library folks are talking about (http://www.hyperorg.com/blogger/mtarchive/berkman_lunch_aaron_swartz_on.htm l) then share that added data, we have quite a powerful distributed development model. It may seem a bit far-fetched, but I think that some of the pieces (or at least a better understanding of how this might all work) are beginning to take shape. -Peter On Thu, 25 Oct 2007, Jakob Voss wrote: Peter wrote: Also, re: blog mirroring, I highly recommend the current discussions floating aroung the blogosphere regarding distributed source control (Git, Mercurial, etc.). It's a fundamental paradigm shift from centralized control to distributed control that points the way toward the future of libraries as they (we) become less and less the gatekeepers for the stuff be it digital or physical and more and more the facilitators of the bidirectional replication that assures ubiquitous access and long-term preservation. The library becomes (actually it has already happended) simply a node on a network of trust and should act accordingly. See the thoroughly entertaining/thought-provoking Google tech talk by Linus Torvalds on Git: http://www.youtube.com/watch?v=4XpnKHJAok8 Thanks for pointing to this interesting discussion. This goes even further then the current paradigm shift from the old model (author - publisher - distributor - reader) to a world of user-generated content and collaboration! I was glad if we finally got to model and archive Weblogs and Wikis - modelling and archiving the whole process of content copying, changing and remixing and republication is far beyong libraries capabilities! Greetings, Jakob -- Jakob Voß [EMAIL PROTECTED], skype: nichtich Verbundzentrale des GBV (VZG) / Common Library Network Platz der Goettinger Sieben 1, 37073 Göttingen, Germany +49 (0)551 39-10242, http://www.gbv.de