Re: [CODE4LIB] RFC 5005 ATOM extension and OAI

2007-10-25 Thread Jakob Voss

Hi Clay,


I completely agree with everything you just wrote, especially about
Atom + APP being more than just a technology for blogs.  APP is a
great lightweight alternative to WebDAV, and promising for all sorts
of data transfer.  The fact that it has developer groundswell is a
huge plus.  During my Princeton days Kevin Clarke and I briefly
talked about what a METS + APP metadata editing application could
do.  (I can't remember the answer, but I bet it would be snazzy.)


On the one side you are right: Atom + APP is becoming popular and the
standards are good, so digital libraries should get into it. On the
other side I was just reminded to the ECDL2006-paper Repository
Replication Using NNTP and SMTP: You can almost use any protocol (HTTP,
OAI, ATOM APP, WebDAV, NNTP...) for most of digital libraries' use cases
- but the best standard without approriate tools and support is pretty
worthless.


I came to this realization out of frustration that most OAI toolkits
(at the time, ca. 2005) didn't support that functionality well -- or
at all.  I don't know if that's still the case.  However, the need to
delete records is a reality for most projects, and OAI has somewhat
awkwardly made us rethink how to delete a record in repositories
and the like, both on the service and data provider end.   You almost
have to build your entire system around handling deleted records
just for OAI exposure.   In reality it seems like you just end up
masquerading or re-representing its outward visibility on our local
systems, which gets onerous.

I guess the difference is that the growing number of Atom developers
are heeding the requirement for deletions, whereas the few existing
OAI toolkit developers have deemed that functionality as optional.


Most repositories do not even track deletions so they cannot syndicate
them. If OAI-delete was mandatory, maybe OAI-PMH had not been used that
much? OAI did a good job in promoting and documenting OAI-PMH but
deletions were always treated as an orphan - I would not blame the
standard but the lacking implementation.

Also ATOM and RFC 5005 is not much better than other solutions - but its
much more likely to get it implemented in Weblog and other software then
OAI which is not that known outside the library world.

Greetings,
Jakob

P.S: Maybe we would all be happy with Z39.50 if we had that wonderful
Indexdata tools right from the beginning - instead there were only
closed source specifications and different closed source partial
implementations. A standard without easy to use open source
implementations is condemned to be violated and die.


--
Jakob Voß [EMAIL PROTECTED], skype: nichtich
Verbundzentrale des GBV (VZG) / Common Library Network
Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
+49 (0)551 39-10242, http://www.gbv.de


Re: [CODE4LIB] RFC 5005 ATOM extension and OAI

2007-10-25 Thread Jakob Voss

Peter wrote:


Also, re: blog mirroring, I highly recommend the current discussions
floating aroung the blogosphere regarding distributed source control (Git,
Mercurial, etc.).  It's a fundamental paradigm shift from centralized
control to distributed control that points the way toward the future of
libraries as they (we) become less and less the gatekeepers for the
stuff be it digital or physical and more and more the facilitators of
the bidirectional replication that assures ubiquitous access and
long-term preservation.  The library becomes (actually it has already
happended) simply a node on a network of trust and should act accordingly.

See the thoroughly entertaining/thought-provoking Google tech talk by
Linus Torvalds on Git:  http://www.youtube.com/watch?v=4XpnKHJAok8


Thanks for pointing to this interesting discussion. This goes even
further then the current paradigm shift from the old model
(author - publisher - distributor - reader) to a world of
user-generated content and collaboration! I was glad if we finally got
to model and archive Weblogs and Wikis - modelling and archiving the
whole process of content copying, changing and remixing and
republication is far beyong libraries capabilities!

Greetings,
Jakob

--
Jakob Voß [EMAIL PROTECTED], skype: nichtich
Verbundzentrale des GBV (VZG) / Common Library Network
Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
+49 (0)551 39-10242, http://www.gbv.de


Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?

2007-10-25 Thread Eric Lease Morgan

On Oct 24, 2007, at 6:04 PM, Jonathan Rochkind wrote:


I have some behavior I can't explain. There's this article that is in
OAISter, called Resurrection and Appropriation: Reputational
Trajectories, Memory Work, and the Political Use of Historical
Figures
by Robert S. Jensen.

I do an SRU search with query:
dc.title = Resurrection and Appropriation: Reputational Trajectories,
Memory Work, and the Political Use of Historical Figures
And I find the record, one hit. Good. You too could try, and see what
the DC returned looks like. It does have a dc:creator of Robert S.
Jensen.

But I try a search that includes the author.
dc.title = Resurrection and Appropriation: Reputational Trajectories,
Memory Work, and the Political Use of Historical Figures and
dc.creator
= Jensen



Yes, this looks weird to me too. I tried a number of query variations
and could not get a hit.

--
Eric Lease Morgan
Head, Digital Access and Information Architecture Department
University Libraries of Notre Dame

(574) 631-8604


Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?

2007-10-25 Thread Ross Singer
It doesn't actually work with OAIster's own SRU interface, either.  A
query for the title works, but adding the author's name fails.

OAIster's SRU implementation is very strange, though, so I could very
well be doing something wrong.

-Ross.

On 10/25/07, Eric Lease Morgan [EMAIL PROTECTED] wrote:
 On Oct 24, 2007, at 6:04 PM, Jonathan Rochkind wrote:

  I have some behavior I can't explain. There's this article that is in
  OAISter, called Resurrection and Appropriation: Reputational
  Trajectories, Memory Work, and the Political Use of Historical
  Figures
  by Robert S. Jensen.
 
  I do an SRU search with query:
  dc.title = Resurrection and Appropriation: Reputational Trajectories,
  Memory Work, and the Political Use of Historical Figures
  And I find the record, one hit. Good. You too could try, and see what
  the DC returned looks like. It does have a dc:creator of Robert S.
  Jensen.
 
  But I try a search that includes the author.
  dc.title = Resurrection and Appropriation: Reputational Trajectories,
  Memory Work, and the Political Use of Historical Figures and
  dc.creator
  = Jensen


 Yes, this looks weird to me too. I tried a number of query variations
 and could not get a hit.

 --
 Eric Lease Morgan
 Head, Digital Access and Information Architecture Department
 University Libraries of Notre Dame

 (574) 631-8604



Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?

2007-10-25 Thread Joshua Santelli
You're not getting any hits because the name is not Jensen, it's Jansen.
I'm not sure where Jensen came from but the OAIster indexes here have
Jansen.

josh


On 10/24/07 6:04 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote:

 I'm messing with SRU search of http://indexdata.dk/opencontent/oaister

 I have some behavior I can't explain. There's this article that is in
 OAISter, called Resurrection and Appropriation: Reputational
 Trajectories, Memory Work, and the Political Use of Historical Figures
 by Robert S. Jensen.

 I do an SRU search with query:
 dc.title = Resurrection and Appropriation: Reputational Trajectories,
 Memory Work, and the Political Use of Historical Figures
 And I find the record, one hit. Good. You too could try, and see what
 the DC returned looks like. It does have a dc:creator of Robert S. Jensen.

 But I try a search that includes the author.
 dc.title = Resurrection and Appropriation: Reputational Trajectories,
 Memory Work, and the Political Use of Historical Figures and dc.creator
 = Jensen

 0 hits.
 and cql.serverChoice = Jensen= 0 hits

 Same using full name Robert S. Jensen (just as it appears in the
 record), with cql.serverChoice or dc.creator.

 Is this just a bad index, or is something else going on, or what?  As I
 try sample searches on title and author, I keep running into false
 negatives for things that ought to be in the OAISter index. Sometimes I
 can figure out why (title not quite right; title has curly quotes, index
 does not, etc.), but in this case I have no idea. But the net result is
 it's hard to actually find your known item this way, via an automated
 search on known item metadata.

 Jonathan


 --
 Jonathan Rochkind
 Digital Services Software Engineer
 The Sheridan Libraries
 Johns Hopkins University
 410.516.8886
 rochkind (at) jhu.edu


Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?

2007-10-25 Thread Jonathan Rochkind

That was a typo in my problem report, I'm afraid. I was actually
searching Jansen, and that still exhibits the problems I mentioned.

I've also moved this conversation to indexdata's own list for this
service, at

http://lists.indexdata.dk/cgi-bin/mailman/listinfo/oclist (thanks to Jason 
Ronallo for bringing that list to my attention).

Jonathan


Joshua Santelli wrote:

You're not getting any hits because the name is not Jensen, it's Jansen.
I'm not sure where Jensen came from but the OAIster indexes here have
Jansen.

josh


On 10/24/07 6:04 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote:



I'm messing with SRU search of http://indexdata.dk/opencontent/oaister

I have some behavior I can't explain. There's this article that is in
OAISter, called Resurrection and Appropriation: Reputational
Trajectories, Memory Work, and the Political Use of Historical Figures
by Robert S. Jensen.

I do an SRU search with query:
dc.title = Resurrection and Appropriation: Reputational Trajectories,
Memory Work, and the Political Use of Historical Figures
And I find the record, one hit. Good. You too could try, and see what
the DC returned looks like. It does have a dc:creator of Robert S. Jensen.

But I try a search that includes the author.
dc.title = Resurrection and Appropriation: Reputational Trajectories,
Memory Work, and the Political Use of Historical Figures and dc.creator
= Jensen

0 hits.
and cql.serverChoice = Jensen= 0 hits

Same using full name Robert S. Jensen (just as it appears in the
record), with cql.serverChoice or dc.creator.

Is this just a bad index, or is something else going on, or what?  As I
try sample searches on title and author, I keep running into false
negatives for things that ought to be in the OAISter index. Sometimes I
can figure out why (title not quite right; title has curly quotes, index
does not, etc.), but in this case I have no idea. But the net result is
it's hard to actually find your known item this way, via an automated
search on known item metadata.

Jonathan


--
Jonathan Rochkind
Digital Services Software Engineer
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu






--
Jonathan Rochkind
Digital Services Software Engineer
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu


[CODE4LIB] Distributed Models the Library (was: Re: [CODE4LIB] RFC 5005 ATOM extension and OAI)

2007-10-25 Thread pkeane

Hi Jakob-

Yes, I think you are correct that it is a bit much to think that a
distributed archiving model is a bit much for libraries to even consider
now, but I do think there are useful insights to be gained here.

As it stands now, linux developers using Git can carry around the entire
change history of the linux kernel (well, I think they just included the
2.6 kernel when they moved to Git) on their laptop, make changes, create
patches, etc and then make that available to others.  Well, undoubtedly
change history is is a bit much for the library to think about, by why
not, for instance, and entire library catalog?  If I could check out the
library catalog onto my computer  use whatever tools I wished to search,
organize, annotate, etc., then perhaps mix-in data (say holdings data
from other that are near me) OR even create the sort of relationships
between records that the Open Library folks are talking about
(http://www.hyperorg.com/blogger/mtarchive/berkman_lunch_aaron_swartz_on.html)
then share that added data, we have quite a powerful distributed
development model.  It may seem a bit far-fetched, but I think that some
of the pieces (or at least a better understanding of how this might all
work) are beginning to take shape.

-Peter

On Thu, 25 Oct 2007, Jakob Voss wrote:


Peter wrote:


Also, re: blog mirroring, I highly recommend the current discussions
floating aroung the blogosphere regarding distributed source control (Git,
Mercurial, etc.).  It's a fundamental paradigm shift from centralized
control to distributed control that points the way toward the future of
libraries as they (we) become less and less the gatekeepers for the
stuff be it digital or physical and more and more the facilitators of
the bidirectional replication that assures ubiquitous access and
long-term preservation.  The library becomes (actually it has already
happended) simply a node on a network of trust and should act accordingly.

See the thoroughly entertaining/thought-provoking Google tech talk by
Linus Torvalds on Git:  http://www.youtube.com/watch?v=4XpnKHJAok8


Thanks for pointing to this interesting discussion. This goes even
further then the current paradigm shift from the old model
(author - publisher - distributor - reader) to a world of
user-generated content and collaboration! I was glad if we finally got
to model and archive Weblogs and Wikis - modelling and archiving the
whole process of content copying, changing and remixing and
republication is far beyong libraries capabilities!

Greetings,
Jakob

--
Jakob Voß [EMAIL PROTECTED], skype: nichtich
Verbundzentrale des GBV (VZG) / Common Library Network
Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
+49 (0)551 39-10242, http://www.gbv.de


Re: [CODE4LIB] OpenContent SRU search of OAISter, weirdness?

2007-10-25 Thread Tim Shearer

Dumb question, no experience with the syntax, but should there be a
wildcard -or- use of something other than equals (sorry, if I'm way off
base, but most query syntax I use requires like or wildcarding).

-t

On Thu, 25 Oct 2007, Jonathan Rochkind wrote:


That was a typo in my problem report, I'm afraid. I was actually
searching Jansen, and that still exhibits the problems I mentioned.

I've also moved this conversation to indexdata's own list for this
service, at

http://lists.indexdata.dk/cgi-bin/mailman/listinfo/oclist (thanks to Jason
Ronallo for bringing that list to my attention).

Jonathan


Joshua Santelli wrote:

You're not getting any hits because the name is not Jensen, it's Jansen.
I'm not sure where Jensen came from but the OAIster indexes here have
Jansen.

josh


On 10/24/07 6:04 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote:



I'm messing with SRU search of http://indexdata.dk/opencontent/oaister

I have some behavior I can't explain. There's this article that is in
OAISter, called Resurrection and Appropriation: Reputational
Trajectories, Memory Work, and the Political Use of Historical Figures
by Robert S. Jensen.

I do an SRU search with query:
dc.title = Resurrection and Appropriation: Reputational Trajectories,
Memory Work, and the Political Use of Historical Figures
And I find the record, one hit. Good. You too could try, and see what
the DC returned looks like. It does have a dc:creator of Robert S. Jensen.

But I try a search that includes the author.
dc.title = Resurrection and Appropriation: Reputational Trajectories,
Memory Work, and the Political Use of Historical Figures and dc.creator
= Jensen

0 hits.
and cql.serverChoice = Jensen= 0 hits

Same using full name Robert S. Jensen (just as it appears in the
record), with cql.serverChoice or dc.creator.

Is this just a bad index, or is something else going on, or what?  As I
try sample searches on title and author, I keep running into false
negatives for things that ought to be in the OAISter index. Sometimes I
can figure out why (title not quite right; title has curly quotes, index
does not, etc.), but in this case I have no idea. But the net result is
it's hard to actually find your known item this way, via an automated
search on known item metadata.

Jonathan


--
Jonathan Rochkind
Digital Services Software Engineer
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu






--
Jonathan Rochkind
Digital Services Software Engineer
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu



[CODE4LIB] Distributed Models the Library (was: Re: [CODE4LIB] RFC 5005 ATOM extension and OAI)

2007-10-25 Thread Jason Stirnaman
 not, for instance, and entire library catalog?  If I could check out the
 library catalog onto my computer  use whatever tools I wished to search,

Peter,

You might be interested in Art Rhyno's experiment.  Here's Jon Udell's summary:

Art Rhyno’s science project
Art Rhyno’s title is Systems Librarian but he should consider adding Mad 
Scientist to his business card because his is full of wild and crazy and — to 
me, at least — brilliant ideas. Last year, when I was a judge for the Talis 
“Mashing up the Library” competion, one of my favorite entries was this one 
from Art. The project mirrors a library catalog to the desktop and integrates 
it with desktop search. The searcher in this case is Google Desktop, but could 
be another, and the integration is accomplished by exposing the catalog as a 
set of Web Folders, which Art correctly describes as “Microsoft’s in-built and 
oft-overlooked WebDAV option.”

http://blog.jonudell.net/2007/03/16/art-rhynos-science-project/

Jason
--

Jason Stirnaman
OME/Biomedical  Digital Projects Librarian
A.R. Dykes Library
The University of Kansas Medical Center
Kansas City, Kansas
Work: 913-588-7319
Email: [EMAIL PROTECTED]


 On 10/25/2007 at 10:47 AM, in message
[EMAIL PROTECTED], pkeane
[EMAIL PROTECTED] wrote:
 Hi Jakob-

 Yes, I think you are correct that it is a bit much to think that a
 distributed archiving model is a bit much for libraries to even consider
 now, but I do think there are useful insights to be gained here.

 As it stands now, linux developers using Git can carry around the entire
 change history of the linux kernel (well, I think they just included the
 2.6 kernel when they moved to Git) on their laptop, make changes, create
 patches, etc and then make that available to others.  Well, undoubtedly
 change history is is a bit much for the library to think about, by why
 not, for instance, and entire library catalog?  If I could check out the
 library catalog onto my computer  use whatever tools I wished to search,
 organize, annotate, etc., then perhaps mix-in data (say holdings data
 from other that are near me) OR even create the sort of relationships
 between records that the Open Library folks are talking about
 (http://www.hyperorg.com/blogger/mtarchive/berkman_lunch_aaron_swartz_on.htm
 l)
 then share that added data, we have quite a powerful distributed
 development model.  It may seem a bit far-fetched, but I think that some
 of the pieces (or at least a better understanding of how this might all
 work) are beginning to take shape.

 -Peter

 On Thu, 25 Oct 2007, Jakob Voss wrote:

 Peter wrote:

 Also, re: blog mirroring, I highly recommend the current discussions
 floating aroung the blogosphere regarding distributed source control (Git,
 Mercurial, etc.).  It's a fundamental paradigm shift from centralized
 control to distributed control that points the way toward the future of
 libraries as they (we) become less and less the gatekeepers for the
 stuff be it digital or physical and more and more the facilitators of
 the bidirectional replication that assures ubiquitous access and
 long-term preservation.  The library becomes (actually it has already
 happended) simply a node on a network of trust and should act accordingly.

 See the thoroughly entertaining/thought-provoking Google tech talk by
 Linus Torvalds on Git:  http://www.youtube.com/watch?v=4XpnKHJAok8

 Thanks for pointing to this interesting discussion. This goes even
 further then the current paradigm shift from the old model
 (author - publisher - distributor - reader) to a world of
 user-generated content and collaboration! I was glad if we finally got
 to model and archive Weblogs and Wikis - modelling and archiving the
 whole process of content copying, changing and remixing and
 republication is far beyong libraries capabilities!

 Greetings,
 Jakob

 --
 Jakob Voß [EMAIL PROTECTED], skype: nichtich
 Verbundzentrale des GBV (VZG) / Common Library Network
 Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
 +49 (0)551 39-10242, http://www.gbv.de



[CODE4LIB] OpenURL Referrer for IE

2007-10-25 Thread Eric Hellman

OCLC's OpenURL Referrer is now available for Internet Explorer!
Previously available only for Firefox, this popular browser extension
inserts OpenURLs into Google Scholar and Google News Archive search
results.  It also detects and makes links out of web COinS, such as
those found in Wikipedia and Worldcat.org.

The extension can be downloaded for free at
http://openly.oclc.org/openurlref/

OpenURL Referrer uses your institution's link resolver settings from
the OCLC WorldCat Registry, so there is no need to manually configure
the extension. Institutions can register their resolver in the OCLC
WorldCat Registry by visiting http://worldcat.org/registry/
institutions.  All institutions can register for free, even if they
are not OCLC member libraries.

The IE version leans much more heavily on the Worldcat Resolver
Registry than the Firefox version does. The reason for this is that
IE does not have a nice XUL-based way to make user interfaces, so we
instead rely on the Registry to do baseurl management. I hope this
proves to be useful.


Eric Hellman, Director  OCLC Openly Informatics
Division
[EMAIL PROTECTED]   2 Broad St., Suite 208
tel 1-973-509-7800   Bloomfield, NJ 07003
http://openly.oclc.org/


Re: [CODE4LIB] Distributed Models the Library (was: Re: [CODE4LIB] RFC 5005 ATOM extension and OAI)

2007-10-25 Thread pkeane

Very interesting!  I will check it out

-Peter

On Thu, 25 Oct 2007, Jason Stirnaman wrote:


not, for instance, and entire library catalog?  If I could check out the
library catalog onto my computer  use whatever tools I wished to search,


Peter,

You might be interested in Art Rhyno's experiment.  Here's Jon Udell's summary:

Art Rhyno?s science project
Art Rhyno?s title is Systems Librarian but he should consider adding Mad 
Scientist to his business card because his is full of wild and crazy and ? to 
me, at least ? brilliant ideas. Last year, when I was a judge for the Talis 
?Mashing up the Library? competion, one of my favorite entries was this one 
from Art. The project mirrors a library catalog to the desktop and integrates 
it with desktop search. The searcher in this case is Google Desktop, but could 
be another, and the integration is accomplished by exposing the catalog as a 
set of Web Folders, which Art correctly describes as ?Microsoft?s in-built and 
oft-overlooked WebDAV option.?

http://blog.jonudell.net/2007/03/16/art-rhynos-science-project/

Jason
--

Jason Stirnaman
OME/Biomedical  Digital Projects Librarian
A.R. Dykes Library
The University of Kansas Medical Center
Kansas City, Kansas
Work: 913-588-7319
Email: [EMAIL PROTECTED]



On 10/25/2007 at 10:47 AM, in message

[EMAIL PROTECTED], pkeane
[EMAIL PROTECTED] wrote:

Hi Jakob-

Yes, I think you are correct that it is a bit much to think that a
distributed archiving model is a bit much for libraries to even consider
now, but I do think there are useful insights to be gained here.

As it stands now, linux developers using Git can carry around the entire
change history of the linux kernel (well, I think they just included the
2.6 kernel when they moved to Git) on their laptop, make changes, create
patches, etc and then make that available to others.  Well, undoubtedly
change history is is a bit much for the library to think about, by why
not, for instance, and entire library catalog?  If I could check out the
library catalog onto my computer  use whatever tools I wished to search,
organize, annotate, etc., then perhaps mix-in data (say holdings data
from other that are near me) OR even create the sort of relationships
between records that the Open Library folks are talking about
(http://www.hyperorg.com/blogger/mtarchive/berkman_lunch_aaron_swartz_on.htm
l)
then share that added data, we have quite a powerful distributed
development model.  It may seem a bit far-fetched, but I think that some
of the pieces (or at least a better understanding of how this might all
work) are beginning to take shape.

-Peter

On Thu, 25 Oct 2007, Jakob Voss wrote:


Peter wrote:


Also, re: blog mirroring, I highly recommend the current discussions
floating aroung the blogosphere regarding distributed source control (Git,
Mercurial, etc.).  It's a fundamental paradigm shift from centralized
control to distributed control that points the way toward the future of
libraries as they (we) become less and less the gatekeepers for the
stuff be it digital or physical and more and more the facilitators of
the bidirectional replication that assures ubiquitous access and
long-term preservation.  The library becomes (actually it has already
happended) simply a node on a network of trust and should act accordingly.

See the thoroughly entertaining/thought-provoking Google tech talk by
Linus Torvalds on Git:  http://www.youtube.com/watch?v=4XpnKHJAok8


Thanks for pointing to this interesting discussion. This goes even
further then the current paradigm shift from the old model
(author - publisher - distributor - reader) to a world of
user-generated content and collaboration! I was glad if we finally got
to model and archive Weblogs and Wikis - modelling and archiving the
whole process of content copying, changing and remixing and
republication is far beyong libraries capabilities!

Greetings,
Jakob

--
Jakob Voß [EMAIL PROTECTED], skype: nichtich
Verbundzentrale des GBV (VZG) / Common Library Network
Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
+49 (0)551 39-10242, http://www.gbv.de