Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Kuba
Hi,

Does the current draft include any support for asynchronous operation
of the protocol (either by status notifications and polling and/or
streaming), e.g some chunk of results coming back before others?
Sometime ago I read through an early draft published on the LOC site
and it mentioned support for federated search but it's hard to imagine
how could that be implemented without any async support.

On Tue, May 18, 2010 at 12:39 AM, Jonathan Rochkind rochk...@jhu.edu wrote:
 Cool, we're on the same page.

 This means that the asset parameter not only does not need to be, but
 should NOT be, an actual parameterized value in the OpenSearch desc URL
 template, in contrast to Tony's current SRU-in-opensearch-extension draft
 spec.

 At least that's my argument. Seems pretty clear to me.

 Make sense:

 Url type=application/rss+xml
 template=http://my-sru-server.com?q={searchTerms}amp;accept=application%2Frss/

 Does NOT make sense:

 Url type=application/rss+xml
 template=http://my-sru-server.com?q={searchTerms}amp;accept={sru:accept}/

 [that parameter conflicts with the type in the Url template, does not make
 sense to be parameterized in the OpenSearch URL template.  Whether it's
 parameterized to your SRU server is of no concern of OpenSearch, but to the
 extent it's in the URL, it has to be FIXED in the OpenSearch URL template.
 Is my argument. ]



 Ray Denenberg, Library of Congress wrote:

 Rather, OpenSearch descriptions provide a _different URL template_ for
 every response IANA content type available

 Yes, of course, that's what I meant, I said it somewhat slopily, but in
 many of the examples we've looked at, it comes down to the same thing, that
 the different templates differ only in a single (hard-coded) parameter.

 --Ray


 - Original Message - From: Jonathan Rochkind rochk...@jhu.edu
 To: CODE4LIB@LISTSERV.ND.EDU
 Sent: Monday, May 17, 2010 6:11 PM
 Subject: Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts




 Ray Denenberg, Library of Congress wrote:


 Ralph will probably be able to articulate this better than I can, but
 the
 accept parameter is driven by the requirement to be able to use
 OpenSearch
 (for example) to query an SRU server. The description document isn't
 going
 to provide templates that allow you to do this via content negotiation,
 they
 provide a parameter instead, to allow the client to tell the server that
 it
 wants, for example, an rss response.



 No, they don't.  I am having this same debate with Tony Hammond.

 OpenSearch descriptions do NOT provide a parameter to allow the client to
 tell the server what response it wants. They also don't easily provide for
 content negotiation, it is true.
 Rather, OpenSearch descriptions provide a _different URL template_ for
 every response IANA content type available.  Here is the example from the
 OpenSearch documentation:

 Url type=application/rss+xml
 xmlns:example=http://example.com/opensearchextensions/1.0/;

 template=http://example.com?q={searchTerms}amp;c={example:color?}/

 Please note that application/rss+xml is an attribute of the URL template
 itself, it is NOT a parameter in the template.

 If SRU added an accept parameter to try and make OpenSearch happy, this
 is a big mistake, because it in fact _conflicts_ with OpenSearch desc -- to
 make it available as an actual parameter in the OpenSearch URL template.

 If on the other hand, you just want to hard-code it in though, that could
 make some sense.

 Url type=application/rss+xml
 xmlns:example=http://example.com/opensearchextensions/1.0/;


 template=http://my-sru-server.com?q={searchTerms}amp;accept=application%2Frss/

 That might make sense, if that's the use case.  But actually trying to
 provide a parameter for the _client_ to fill out in an OpenSearch desc in
 fact _conflicts_ with OpenSearch, for better or for worse the respones type
 is _hard coded_ into the template.

 Jonathan





 (I suggest, though, that you move further discussion of this to the SRU
 list.)

 --Ray

 -Original Message-
 From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
 Robert Sanderson
 Sent: Monday, May 17, 2010 3:44 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

 In today's RESTful world, what's the requirement for the httpAccept
 parameter?  Isn't straight content negotiation sufficient rather than
 pulling the headers into the URI?
 What happens if the accept header and the httpAccept parameter say
 different
 things?


 Rob

 On Mon, May 17, 2010 at 1:37 PM, LeVan,Ralph le...@oclc.org wrote:



 I'd code it.  (I have already coded to it.)  For me, the httpAccept
 parameter and support for content negotiation on responses is a wonderful
 addition to the standard.  It lets us be OpenSearch compliant finally.

 The virtue of coding to the draft is that there's a chance we can fix
 any problems you encounter.  While we consider the draft stable, that
 doesn't 

Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Ray Denenberg, Library of Congress
There is no synchronous operation in SRU.   

As for federated  search .  

To digress a moment, you may recall -- I believe it was on this list --
there was discussion (maybe a year ago?) of what that even means and whether
it is the same or differs from metasearch, whatever that means.  That
discussion was inconclusive.  Anyway, earlier drafts of SRU 2.0  describe a
metasearch model.  Recently, the committee decided that the terms
metasearch and federated search are undefined jargon.  We now choose to
call it multi-server search.

So to answer the question of whether there is federated search support: yes,
limited support, if by federated search you mean multi server search.   

There is no multi-server support in terms of separate result sets for
different servers.   However,  for (1) faceted search results, and (2)
subquery results, these can be grouped according to server. 

--Ray



-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Kuba
Sent: Tuesday, May 18, 2010 4:07 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

Hi,

Does the current draft include any support for asynchronous operation of the
protocol (either by status notifications and polling and/or streaming), e.g
some chunk of results coming back before others?
Sometime ago I read through an early draft published on the LOC site and it
mentioned support for federated search but it's hard to imagine how could
that be implemented without any async support.


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Mike Taylor
On 18 May 2010 15:24, Ray Denenberg, Library of Congress r...@loc.gov wrote:
 There is no synchronous operation in SRU.

 As for federated  search .

 To digress a moment, you may recall -- I believe it was on this list --
 there was discussion (maybe a year ago?) of what that even means and whether
 it is the same or differs from metasearch, whatever that means.  That
 discussion was inconclusive.  Anyway, earlier drafts of SRU 2.0  describe a
 metasearch model.  Recently, the committee decided that the terms
 metasearch and federated search are undefined jargon.  We now choose to
 call it multi-server search.

Way to go.  Introducting yet ANOTHER synonym can only help!

(And don't forget broadcast search.)


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Jonathan Rochkind
What terms do you suggest, Mike? 

I think we're doomed no matter what with these, after certain 
communities started to use federated search and metasearch in 
directly opposite ways. 

I also was told recently that what is called an accordion in English 
is called a bandoneon in Spanish, and what is called a accordeon in 
Spanish is called a bandoneon in English.


Hope this helps.

Mike Taylor wrote:

On 18 May 2010 15:24, Ray Denenberg, Library of Congress r...@loc.gov wrote:
  

There is no synchronous operation in SRU.

As for federated  search .

To digress a moment, you may recall -- I believe it was on this list --
there was discussion (maybe a year ago?) of what that even means and whether
it is the same or differs from metasearch, whatever that means.  That
discussion was inconclusive.  Anyway, earlier drafts of SRU 2.0  describe a
metasearch model.  Recently, the committee decided that the terms
metasearch and federated search are undefined jargon.  We now choose to
call it multi-server search.



Way to go.  Introducting yet ANOTHER synonym can only help!

(And don't forget broadcast search.)

  


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Ray Denenberg, Library of Congress
On 18 May 2010 15:24, Ray Denenberg, Library of Congress r...@loc.gov
wrote:
 There is no synchronous operation in SRU.

Sorry, meant to say no asynchronous .

--Ray


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Jonathan Rochkind

Mike Taylor wrote:


What communities?  
All  I know is we here on this very list had some people _insisting_ 
that federated search _really_ meant aggregated index, and meta 
search _really_ meant broadcast search, and other people insisting the 
opposite.  Both sides had citations to the literature.


I'm uninterested in arguing about what words really mean. Clearly 
these words have been used both ways in the profession and the literature.


So I don't see any way around it by trying to find other words that 
attempt be harder to use more than one way. As you can see, I like 
broadcast search and aggregated index.



Maybe we on the CODE4LIB list collectively carry
enough weight that we could take the most prevalent meanings and
propagate them?
  
Ha, ha. Good luck with that.  What we CAN do is spend a couple months 
arguing about what the words REALLY mean on the list (we're good at 
that), and then vote, and then have everyone else ignore the vote anyway.



Jonathan


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Mike Taylor
On 18 May 2010 16:29, Jonathan Rochkind rochk...@jhu.edu wrote:
 Mike Taylor wrote:

 What communities?

 All  I know is we here on this very list had some people _insisting_ that
 federated search _really_ meant aggregated index, and meta search
 _really_ meant broadcast search, and other people insisting the opposite.
  Both sides had citations to the literature.

 I'm uninterested in arguing about what words really mean. Clearly these
 words have been used both ways in the profession and the literature.

 So I don't see any way around it by trying to find other words that attempt
 be harder to use more than one way. As you can see, I like broadcast
 search and aggregated index.

 Maybe we on the CODE4LIB list collectively carry
 enough weight that we could take the most prevalent meanings and
 propagate them?


 Ha, ha. Good luck with that.  What we CAN do is spend a couple months
 arguing about what the words REALLY mean on the list (we're good at that),
 and then vote, and then have everyone else ignore the vote anyway.

Ha, ha indeed.  You are of course right.

You're also right that broadcast search and aggregated index are
both explicit terms not susceptible to misinterpretation.  We should
try to remember to use those instead.  Sign me up!



 Jonathan




Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Kuba
That is quite unfortunate, as we were looking at SRU 2.0 as a possible
candidate for the front-end protocol for Index Data's pazpar2. The
main problem with federate/broadcast/meta (however you want to call it
;) searching is that the back-end databases are scattered in different
locations or simply slow in their response times and in order to
provide decent user experience you need to be able to present some
results sooner than others. Waiting for the slowest database to
respond is usually not an option.

On Tue, May 18, 2010 at 5:24 PM, Ray Denenberg, Library of Congress
r...@loc.gov wrote:
 On 18 May 2010 15:24, Ray Denenberg, Library of Congress r...@loc.gov
 wrote:
 There is no synchronous operation in SRU.

 Sorry, meant to say no asynchronous .

 --Ray




-- 

Cheers,
Jakub


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Ray Denenberg, Library of Congress
What advantage do you see in having a concurrent operations feature (like
Z39.50) versus opening several connections?

(Concurrent operations introduced significant complexity into Z39.50 -
including reference ids, operations, etc, and I'm not sure anyone ever
really thought it was worth it.)

--Ray

-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Kuba
Sent: Tuesday, May 18, 2010 12:58 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

That is quite unfortunate, as we were looking at SRU 2.0 as a possible
candidate for the front-end protocol for Index Data's pazpar2. The main
problem with federate/broadcast/meta (however you want to call it
;) searching is that the back-end databases are scattered in different
locations or simply slow in their response times and in order to provide
decent user experience you need to be able to present some results sooner
than others. Waiting for the slowest database to respond is usually not an
option.

On Tue, May 18, 2010 at 5:24 PM, Ray Denenberg, Library of Congress
r...@loc.gov wrote:
 On 18 May 2010 15:24, Ray Denenberg, Library of Congress 
 r...@loc.gov
 wrote:
 There is no synchronous operation in SRU.

 Sorry, meant to say no asynchronous .

 --Ray




-- 

Cheers,
Jakub


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Jonathan Rochkind

Jakub Skoczen wrote:



I wonder if someone, like Kuba, could design an 'extended async SRU' on top
of SRU, that is very SRU like, but builds on top of it to add just enough
operations for Kuba's use case area.  I think that's the right way to
approach it.



Is there a particular extensibility feature in the protocol that
allows for this?
  
I don't know, but that's not what I was suggesting. I was suggesting you 
read the SRU spec, and then design your own SRU-async spec, which is 
defined as exactly like SRU 2.0, except it also has the following 
operations, and is identified in an Explain document like X.


Jonathan


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Ray Denenberg, Library of Congress
First, no. There are extensibility features in SRU but nothing that would
help here. 

Actually, Jonathan, what I though you were suggesting was the creation of a
(I hesitate to say it) metasearch engine. I use that term because it is what
NISO called it, when they started their metasearch initiative five or so
years ago, to create a standard for a metasearch engine, but they got
distracted and the effort really came to nothing.   

The premise of the metasearch engine is that there exists a single-thread
protocol, for example, SRU, and the need is to manage many threads, which is
what the metasearch engine would have done if it had ever been defined. This
is probably not an area for OASIS work, but if someone wanted to revive the
effort in NISO (and put it on the right track) it could be useful. 

--Ray


-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
Jonathan Rochkind
Sent: Tuesday, May 18, 2010 2:56 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

Jakub Skoczen wrote:

 I wonder if someone, like Kuba, could design an 'extended async SRU' 
 on top of SRU, that is very SRU like, but builds on top of it to add 
 just enough operations for Kuba's use case area.  I think that's the 
 right way to approach it.
 

 Is there a particular extensibility feature in the protocol that 
 allows for this?
   
I don't know, but that's not what I was suggesting. I was suggesting you
read the SRU spec, and then design your own SRU-async spec, which is
defined as exactly like SRU 2.0, except it also has the following
operations, and is identified in an Explain document like X.

Jonathan


Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

2010-05-18 Thread Jakub Skoczen
On Tue, May 18, 2010 at 9:17 PM, Ray Denenberg, Library of Congress
r...@loc.gov wrote:
 First, no. There are extensibility features in SRU but nothing that would
 help here.

 Actually, Jonathan, what I though you were suggesting was the creation of a
 (I hesitate to say it) metasearch engine. I use that term because it is what
 NISO called it, when they started their metasearch initiative five or so
 years ago, to create a standard for a metasearch engine, but they got
 distracted and the effort really came to nothing.

I'm not sure if Jonathan was suggesting that but that's exactly what I
had in mind - using SRU 2.0 as a front-end protocol for a meta-search
engine. And yes while creating a third-party, SRU-inspired protocol
for that purpose could work, I see very little value in such exercise.
I suspect that, as any standard, SRU has certain limitations and, as
an implementer, you have to work around them but you do end up with an
obvious gain: standards compliance. SRU-inspired protocol is not quite
the same thing, and it's probably easier to go all the way and create
a custom, proprietary protocol.

 The premise of the metasearch engine is that there exists a single-thread
 protocol, for example, SRU, and the need is to manage many threads, which is
 what the metasearch engine would have done if it had ever been defined. This
 is probably not an area for OASIS work, but if someone wanted to revive the
 effort in NISO (and put it on the right track) it could be useful.

 --Ray


 -Original Message-
 From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
 Jonathan Rochkind
 Sent: Tuesday, May 18, 2010 2:56 PM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] OASIS SRU and CQL, access to most-current drafts

 Jakub Skoczen wrote:

 I wonder if someone, like Kuba, could design an 'extended async SRU'
 on top of SRU, that is very SRU like, but builds on top of it to add
 just enough operations for Kuba's use case area.  I think that's the
 right way to approach it.


 Is there a particular extensibility feature in the protocol that
 allows for this?

 I don't know, but that's not what I was suggesting. I was suggesting you
 read the SRU spec, and then design your own SRU-async spec, which is
 defined as exactly like SRU 2.0, except it also has the following
 operations, and is identified in an Explain document like X.

 Jonathan




-- 

Cheers,
Jakub