Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread helix84
I used to use this SRU module[1] in DSpace 1.6-1.8. It should still
work, but it uses Lucene as the search backend. It was an annoyance to
upgrade though, so I changed my client to talk to DSpace Solr
directly, the client is a component on another server, so I didn't
have to secure Solr, only add its IP to RemoteAddrValve. Back then I
had this idea for creating a SRU interface as a thin layer on top of
Solr, it should be quite easily doable [2], see the link there.

Anyway, I asked you what your use case was. Is it just search query
autosuggestion using JQuery? We used to have that before 3.0, but it
was removed in the huge patch which brought Access Rights Awareness to
Discovery, supposedly due to being memory-inefficient. It's worth
looking there how it was implemented.

For completeness, I'll also mention solr-security-proxy [3] here, but
generally you might want to avoid that approach [4]. Going off on a
tangent, Elasticsearch has the same problem, but the company behind ES
very recently released a commercial product that addresses this highly
requested feature [5].


[1] https://code.google.com/p/oclcsrwdspacelucene/
[2] https://jira.duraspace.org/browse/DS-1264
[3] https://github.com/dergachev/solr-security-proxy
[4] http://stackoverflow.com/questions/3637014/running-solr-in-read-only-mode
[5] http://www.elasticsearch.org/overview/shield/

Regards,
~~helix84

Compulsory reading: DSpace Mailing List Etiquette
https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette


Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread LiYu Lilly
It might not be very relevant-- but has anyone started or considering 
integrating VIVO and DSpace? 
Thanks,Lilly

 From: heli...@centrum.sk
 Date: Thu, 5 Mar 2015 21:20:45 +0100
 To: pe...@longsight.com
 CC: dspace-tech@lists.sourceforge.net
 Subject: Re: [Dspace-tech] External Searching of DSpace contents
 
 I used to use this SRU module[1] in DSpace 1.6-1.8. It should still
 work, but it uses Lucene as the search backend. It was an annoyance to
 upgrade though, so I changed my client to talk to DSpace Solr
 directly, the client is a component on another server, so I didn't
 have to secure Solr, only add its IP to RemoteAddrValve. Back then I
 had this idea for creating a SRU interface as a thin layer on top of
 Solr, it should be quite easily doable [2], see the link there.
 
 Anyway, I asked you what your use case was. Is it just search query
 autosuggestion using JQuery? We used to have that before 3.0, but it
 was removed in the huge patch which brought Access Rights Awareness to
 Discovery, supposedly due to being memory-inefficient. It's worth
 looking there how it was implemented.
 
 For completeness, I'll also mention solr-security-proxy [3] here, but
 generally you might want to avoid that approach [4]. Going off on a
 tangent, Elasticsearch has the same problem, but the company behind ES
 very recently released a commercial product that addresses this highly
 requested feature [5].
 
 
 [1] https://code.google.com/p/oclcsrwdspacelucene/
 [2] https://jira.duraspace.org/browse/DS-1264
 [3] https://github.com/dergachev/solr-security-proxy
 [4] http://stackoverflow.com/questions/3637014/running-solr-in-read-only-mode
 [5] http://www.elasticsearch.org/overview/shield/
 
 Regards,
 ~~helix84
 
 Compulsory reading: DSpace Mailing List Etiquette
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
 
 --
 Dive into the World of Parallel Programming The Go Parallel Website, sponsored
 by Intel and developed in partnership with Slashdot Media, is your hub for all
 things parallel software development, from weekly thought leadership blogs to
 news, videos, case studies, tutorials and more. Take a look and join the 
 conversation now. http://goparallel.sourceforge.net/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette: 
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette
  --
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette

Re: [Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread Bruno Zanette
Peter,

It was good you have touched on this subject!! I'm also thinking about
it for some time since it's a requirement for the project i'm working
on.

I thought about creating a RubyGem to integrate with SOLR directly,
and use it side-by-side with rest-api gem, but i'm pretty sure this
idea has several security issues. The other ideia is to use REST API,
as you said, but i don't know how far it can go. But i think that's
the best solution for our problem, once we're already using rest-api.

I'll do some research about OpenSearch, and the other ideas you gave.
I haven't heard anything about them yet.

Right now we're fixing some bugs on our site and in next weeks we will
start working on the search feature.

I'll follow this thread closely!!!

2015-03-05 14:27 GMT-03:00 Terry Brady terry.br...@georgetown.edu:
 We have a bento-box style search built into our Drupal website.

 http://www.library.georgetown.edu/search/gusuper/yearbook

 We have de-emphasized this interface since we adopted a discovery layer
 since the discovery layer also searches our DSpace instance.

 If you are curious, here is the code:
 https://github.com/Georgetown-University-Libraries/gusuper

 The code constructs a PHP SolrClient to perform the search.

 https://github.com/Georgetown-University-Libraries/gusuper/blob/master/gudspacesolr/gudspacesolr.module#L83

 Terry


 On Thu, Mar 5, 2015 at 9:15 AM, Peter Dietz pe...@longsight.com wrote:

 I was wondering if anyone has built any integrations where an external web
 site (such as your institution website or CMS) can search and show results
 from your DSpace? One approach that ought to be pretty straight forward that
 I'm thinking of is the OpenSearch web service.

 https://trydspace.longsight.com/open-search/discover?query=test

 Getting that to work on a site would require some AJAX to update as the
 user types in a search query, and then display the updated search results.

 Another route could be the DSpace 5 REST API. There is a
 find-by-metadata-field (key, value), but isn't suited for full text
 searching. Other possible routes that I'm thinking are SRW/U or Z39.50.

 So, I was just wondering if anyone has already accomplished something in
 this area, and has client integration examples. I suppose a cheap route
 would be to have some DSpace-iframe solution, where you homepage/searchpage,
 just embeds DSpace search page inside.

 Thanks
 
 Peter Dietz
 Longsight
 www.longsight.com
 pe...@longsight.com
 p: 740-599-5005 x809


 --
 Dive into the World of Parallel Programming The Go Parallel Website,
 sponsored
 by Intel and developed in partnership with Slashdot Media, is your hub for
 all
 things parallel software development, from weekly thought leadership blogs
 to
 news, videos, case studies, tutorials and more. Take a look and join the
 conversation now. http://goparallel.sourceforge.net/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette:
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette




 --
 Terry Brady
 Applications Programmer Analyst
 Georgetown University Library Information Technology
 https://www.library.georgetown.edu/lit/code
 425-298-5498 (Seattle, WA)

 --
 Dive into the World of Parallel Programming The Go Parallel Website,
 sponsored
 by Intel and developed in partnership with Slashdot Media, is your hub for
 all
 things parallel software development, from weekly thought leadership blogs
 to
 news, videos, case studies, tutorials and more. Take a look and join the
 conversation now. http://goparallel.sourceforge.net/
 ___
 DSpace-tech mailing list
 DSpace-tech@lists.sourceforge.net
 https://lists.sourceforge.net/lists/listinfo/dspace-tech
 List Etiquette:
 https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette



-- 
Bruno Nocera Zanette
+55 41 9992-2508

--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette


[Dspace-tech] External Searching of DSpace contents

2015-03-05 Thread Peter Dietz
I was wondering if anyone has built any integrations where an external web
site (such as your institution website or CMS) can search and show results
from your DSpace? One approach that ought to be pretty straight forward
that I'm thinking of is the OpenSearch web service.

https://trydspace.longsight.com/open-search/discover?query=test

Getting that to work on a site would require some AJAX to update as the
user types in a search query, and then display the updated search results.

Another route could be the DSpace 5 REST API. There is a
find-by-metadata-field (key, value), but isn't suited for full text
searching. Other possible routes that I'm thinking are SRW/U or Z39.50.

So, I was just wondering if anyone has already accomplished something in
this area, and has client integration examples. I suppose a cheap route
would be to have some DSpace-iframe solution, where you
homepage/searchpage, just embeds DSpace search page inside.

Thanks

Peter Dietz
Longsight
www.longsight.com
pe...@longsight.com
p: 740-599-5005 x809
--
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/___
DSpace-tech mailing list
DSpace-tech@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dspace-tech
List Etiquette: https://wiki.duraspace.org/display/DSPACE/Mailing+List+Etiquette