[CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Erik Mitchell
Hi all,

I have been toying with the process of implementing common LIS
applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
the box distributions so far and have not made much progress but was
wondering if someone else had tried this or had ideas about what
issues I might run into.

Thanks,

Erik

Erik Mitchell
Assistant Professor
College of Information Studies
University of Maryland, College Park
http://ischool.umd.edu


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Seth van Hooland
Dear Erik,

Bram Wiercx and myself have given a talk on how to put together a package to 
install CollectiveAccess on Red Hat's OpenShift: 
http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacces-as-a-service-solution.

My students are currently happily playing around with CollectiveAccess, which 
they have installed on OpenShift. My teaching assistant Max De Wilde has 
developed clear guidelines on how to run the installation procedure: 
http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.

It would be wonderful to aggregate these kind of installation procedure's for 
other types of LIS applications...

Kind regards and looking forward to your book!

Seth van Hooland
Président du Master en Sciences et Technologies de l'Information et de la 
Communication (MaSTIC)
Université Libre de Bruxelles
Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
http://homepages.ulb.ac.be/~svhoolan/
http://twitter.com/#!/sethvanhooland
http://mastic.ulb.ac.be
0032 2 650 4765
Office: DC11.113

Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :

 Hi all,
 
 I have been toying with the process of implementing common LIS
 applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
 like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
 the box distributions so far and have not made much progress but was
 wondering if someone else had tried this or had ideas about what
 issues I might run into.
 
 Thanks,
 
 Erik
 
 Erik Mitchell
 Assistant Professor
 College of Information Studies
 University of Maryland, College Park
 http://ischool.umd.edu


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Rosalyn Metz
Erik,

I haven't tried it (recently) on PaaS providers, but I have on IaaS.  The
AMIs I've created in association with start up scripts (if you're
interested in seeing those let me know, I'd have to look for them somewhere
or other) mean that the application automagically starts up on its own, all
you need to do is go to the URL.  I've used this as a back up method in the
past and I think would be a great way for people to be able to play with
the different apps before committing.

To this end, I created an AMI for Blacklight a while back:
http://www.rosalynmetz.com/ami-3c10f255/  I guarantee you it is grossly out
of date.  I also have instructions on creating an EBS backed AMI:
http://rosalynmetz.com/ideas/2011/04/14/creating-an-ebs-backed-ami/ which
is the method I used for creating the Blacklight AMI. These instructions
are also fairly old, but I still get comments on my blog now and then that
the method works.

I also played around with it on Heroku, but that was so long ago I don't
think any of the things I learned still apply (this was when Heroku was
fairly new to the scene).  Hope some of this helps.

Rosalyn



On Thu, Mar 29, 2012 at 8:34 AM, Seth van Hooland svhoo...@ulb.ac.bewrote:

 Dear Erik,

 Bram Wiercx and myself have given a talk on how to put together a package
 to install CollectiveAccess on Red Hat's OpenShift:
 http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacces-as-a-service-solution
 .

 My students are currently happily playing around with CollectiveAccess,
 which they have installed on OpenShift. My teaching assistant Max De Wilde
 has developed clear guidelines on how to run the installation procedure:
 http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.

 It would be wonderful to aggregate these kind of installation procedure's
 for other types of LIS applications...

 Kind regards and looking forward to your book!

 Seth van Hooland
 Président du Master en Sciences et Technologies de l'Information et de la
 Communication (MaSTIC)
 Université Libre de Bruxelles
 Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
 http://homepages.ulb.ac.be/~svhoolan/
 http://twitter.com/#!/sethvanhooland
 http://mastic.ulb.ac.be
 0032 2 650 4765
 Office: DC11.113

 Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :

  Hi all,
 
  I have been toying with the process of implementing common LIS
  applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
  like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
  the box distributions so far and have not made much progress but was
  wondering if someone else had tried this or had ideas about what
  issues I might run into.
 
  Thanks,
 
  Erik
 
  Erik Mitchell
  Assistant Professor
  College of Information Studies
  University of Maryland, College Park
  http://ischool.umd.edu



Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Chris Fitzpatrick
Hi,

I've deployed Blacklight on both Heroku and Elastic BeanStalk.

Heroku is still a much better choice. The only issue I had was I
needed to make sure the sass-rails gem in installed in the :production
gem group and not just development.

 I still have an issue of getting heroku to compile all my
sass/coffeescript/etc assets on update, but it actually doesn't seem
to make much of an impact on performance. The minor issue is that it
would be nice to figure out a way to slim down BL's slug size. The
lowest I've been able to get it is about 30mb and Heroku recommends
having it be below 25mb.

I have not used Heroku's solr service (I still use EC2 for my solr
deployments).
EngineYard would also be another option.

There is also an AMI for DSpace, so deploying that to EC2 should be
pretty easy

b,chris.



On Thu, Mar 29, 2012 at 3:55 PM, Rosalyn Metz rosalynm...@gmail.com wrote:
 Erik,

 I haven't tried it (recently) on PaaS providers, but I have on IaaS.  The
 AMIs I've created in association with start up scripts (if you're
 interested in seeing those let me know, I'd have to look for them somewhere
 or other) mean that the application automagically starts up on its own, all
 you need to do is go to the URL.  I've used this as a back up method in the
 past and I think would be a great way for people to be able to play with
 the different apps before committing.

 To this end, I created an AMI for Blacklight a while back:
 http://www.rosalynmetz.com/ami-3c10f255/  I guarantee you it is grossly out
 of date.  I also have instructions on creating an EBS backed AMI:
 http://rosalynmetz.com/ideas/2011/04/14/creating-an-ebs-backed-ami/ which
 is the method I used for creating the Blacklight AMI. These instructions
 are also fairly old, but I still get comments on my blog now and then that
 the method works.

 I also played around with it on Heroku, but that was so long ago I don't
 think any of the things I learned still apply (this was when Heroku was
 fairly new to the scene).  Hope some of this helps.

 Rosalyn



 On Thu, Mar 29, 2012 at 8:34 AM, Seth van Hooland svhoo...@ulb.ac.bewrote:

 Dear Erik,

 Bram Wiercx and myself have given a talk on how to put together a package
 to install CollectiveAccess on Red Hat's OpenShift:
 http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacces-as-a-service-solution
 .

 My students are currently happily playing around with CollectiveAccess,
 which they have installed on OpenShift. My teaching assistant Max De Wilde
 has developed clear guidelines on how to run the installation procedure:
 http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.

 It would be wonderful to aggregate these kind of installation procedure's
 for other types of LIS applications...

 Kind regards and looking forward to your book!

 Seth van Hooland
 Président du Master en Sciences et Technologies de l'Information et de la
 Communication (MaSTIC)
 Université Libre de Bruxelles
 Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
 http://homepages.ulb.ac.be/~svhoolan/
 http://twitter.com/#!/sethvanhooland
 http://mastic.ulb.ac.be
 0032 2 650 4765
 Office: DC11.113

 Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :

  Hi all,
 
  I have been toying with the process of implementing common LIS
  applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
  like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
  the box distributions so far and have not made much progress but was
  wondering if someone else had tried this or had ideas about what
  issues I might run into.
 
  Thanks,
 
  Erik
 
  Erik Mitchell
  Assistant Professor
  College of Information Studies
  University of Maryland, College Park
  http://ischool.umd.edu



Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Sean Hannan
If you already have everything indexed in Solr elsewhere, a way to cut down
the BL slug size is to remove/ignore the SolrMarc.jar. It's pretty sizable.

-Sean


On 3/29/12 12:16 PM, Chris Fitzpatrick chrisfitz...@gmail.com wrote:

 Hi,
 
 I've deployed Blacklight on both Heroku and Elastic BeanStalk.
 
 Heroku is still a much better choice. The only issue I had was I
 needed to make sure the sass-rails gem in installed in the :production
 gem group and not just development.
 
  I still have an issue of getting heroku to compile all my
 sass/coffeescript/etc assets on update, but it actually doesn't seem
 to make much of an impact on performance. The minor issue is that it
 would be nice to figure out a way to slim down BL's slug size. The
 lowest I've been able to get it is about 30mb and Heroku recommends
 having it be below 25mb.
 
 I have not used Heroku's solr service (I still use EC2 for my solr
 deployments).
 EngineYard would also be another option.
 
 There is also an AMI for DSpace, so deploying that to EC2 should be
 pretty easy
 
 b,chris.
 
 
 
 On Thu, Mar 29, 2012 at 3:55 PM, Rosalyn Metz rosalynm...@gmail.com wrote:
 Erik,
 
 I haven't tried it (recently) on PaaS providers, but I have on IaaS.  The
 AMIs I've created in association with start up scripts (if you're
 interested in seeing those let me know, I'd have to look for them somewhere
 or other) mean that the application automagically starts up on its own, all
 you need to do is go to the URL.  I've used this as a back up method in the
 past and I think would be a great way for people to be able to play with
 the different apps before committing.
 
 To this end, I created an AMI for Blacklight a while back:
 http://www.rosalynmetz.com/ami-3c10f255/  I guarantee you it is grossly out
 of date.  I also have instructions on creating an EBS backed AMI:
 http://rosalynmetz.com/ideas/2011/04/14/creating-an-ebs-backed-ami/ which
 is the method I used for creating the Blacklight AMI. These instructions
 are also fairly old, but I still get comments on my blog now and then that
 the method works.
 
 I also played around with it on Heroku, but that was so long ago I don't
 think any of the things I learned still apply (this was when Heroku was
 fairly new to the scene).  Hope some of this helps.
 
 Rosalyn
 
 
 
 On Thu, Mar 29, 2012 at 8:34 AM, Seth van Hooland svhoo...@ulb.ac.bewrote:
 
 Dear Erik,
 
 Bram Wiercx and myself have given a talk on how to put together a package
 to install CollectiveAccess on Red Hat's OpenShift:
 http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacce
 s-as-a-service-solution
 .
 
 My students are currently happily playing around with CollectiveAccess,
 which they have installed on OpenShift. My teaching assistant Max De Wilde
 has developed clear guidelines on how to run the installation procedure:
 http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.
 
 It would be wonderful to aggregate these kind of installation procedure's
 for other types of LIS applications...
 
 Kind regards and looking forward to your book!
 
 Seth van Hooland
 Président du Master en Sciences et Technologies de l'Information et de la
 Communication (MaSTIC)
 Université Libre de Bruxelles
 Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
 http://homepages.ulb.ac.be/~svhoolan/
 http://twitter.com/#!/sethvanhooland
 http://mastic.ulb.ac.be
 0032 2 650 4765
 Office: DC11.113
 
 Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :
 
 Hi all,
 
 I have been toying with the process of implementing common LIS
 applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
 like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
 the box distributions so far and have not made much progress but was
 wondering if someone else had tried this or had ideas about what
 issues I might run into.
 
 Thanks,
 
 Erik
 
 Erik Mitchell
 Assistant Professor
 College of Information Studies
 University of Maryland, College Park
 http://ischool.umd.edu
 


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Chris Fitzpatrick
Hey Sean,

Jah, I did that...my .slugignore is:
tmp/*
log/*
coverage/*
spec/*
koha/*
jetty/*

That dropped it down to 30 from ~50mb, so that's good .
(koha has some scripts wrote to pull from our ILS).

I think the slug size is a really minor issue. Heroku says under 25mb
is good, but over 50mb is not so good.  Not Good,  but not Chaotic
Evil . Neutral Good.



On Thu, Mar 29, 2012 at 6:26 PM, Sean Hannan shan...@jhu.edu wrote:
 If you already have everything indexed in Solr elsewhere, a way to cut down
 the BL slug size is to remove/ignore the SolrMarc.jar. It's pretty sizable.

 -Sean


 On 3/29/12 12:16 PM, Chris Fitzpatrick chrisfitz...@gmail.com wrote:

 Hi,

 I've deployed Blacklight on both Heroku and Elastic BeanStalk.

 Heroku is still a much better choice. The only issue I had was I
 needed to make sure the sass-rails gem in installed in the :production
 gem group and not just development.

  I still have an issue of getting heroku to compile all my
 sass/coffeescript/etc assets on update, but it actually doesn't seem
 to make much of an impact on performance. The minor issue is that it
 would be nice to figure out a way to slim down BL's slug size. The
 lowest I've been able to get it is about 30mb and Heroku recommends
 having it be below 25mb.

 I have not used Heroku's solr service (I still use EC2 for my solr
 deployments).
 EngineYard would also be another option.

 There is also an AMI for DSpace, so deploying that to EC2 should be
 pretty easy

 b,chris.



 On Thu, Mar 29, 2012 at 3:55 PM, Rosalyn Metz rosalynm...@gmail.com wrote:
 Erik,

 I haven't tried it (recently) on PaaS providers, but I have on IaaS.  The
 AMIs I've created in association with start up scripts (if you're
 interested in seeing those let me know, I'd have to look for them somewhere
 or other) mean that the application automagically starts up on its own, all
 you need to do is go to the URL.  I've used this as a back up method in the
 past and I think would be a great way for people to be able to play with
 the different apps before committing.

 To this end, I created an AMI for Blacklight a while back:
 http://www.rosalynmetz.com/ami-3c10f255/  I guarantee you it is grossly out
 of date.  I also have instructions on creating an EBS backed AMI:
 http://rosalynmetz.com/ideas/2011/04/14/creating-an-ebs-backed-ami/ which
 is the method I used for creating the Blacklight AMI. These instructions
 are also fairly old, but I still get comments on my blog now and then that
 the method works.

 I also played around with it on Heroku, but that was so long ago I don't
 think any of the things I learned still apply (this was when Heroku was
 fairly new to the scene).  Hope some of this helps.

 Rosalyn



 On Thu, Mar 29, 2012 at 8:34 AM, Seth van Hooland svhoo...@ulb.ac.bewrote:

 Dear Erik,

 Bram Wiercx and myself have given a talk on how to put together a package
 to install CollectiveAccess on Red Hat's OpenShift:
 http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacce
 s-as-a-service-solution
 .

 My students are currently happily playing around with CollectiveAccess,
 which they have installed on OpenShift. My teaching assistant Max De Wilde
 has developed clear guidelines on how to run the installation procedure:
 http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.

 It would be wonderful to aggregate these kind of installation procedure's
 for other types of LIS applications...

 Kind regards and looking forward to your book!

 Seth van Hooland
 Président du Master en Sciences et Technologies de l'Information et de la
 Communication (MaSTIC)
 Université Libre de Bruxelles
 Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
 http://homepages.ulb.ac.be/~svhoolan/
 http://twitter.com/#!/sethvanhooland
 http://mastic.ulb.ac.be
 0032 2 650 4765
 Office: DC11.113

 Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :

 Hi all,

 I have been toying with the process of implementing common LIS
 applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
 like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
 the box distributions so far and have not made much progress but was
 wondering if someone else had tried this or had ideas about what
 issues I might run into.

 Thanks,

 Erik

 Erik Mitchell
 Assistant Professor
 College of Information Studies
 University of Maryland, College Park
 http://ischool.umd.edu



Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Jeremy Nelson
Hi Erik,
While my FRBR-Redis datastore is not yet a common LIS application, I develop 
and run it on a Rackspace Linux VM with Django and Redis. I am in the process 
of migrating this stack over to Heroku for a Library App development workshop 
I'm leading in May.

Jeremy Nelson
Metadata and Systems Librarian
Tutt Library, Colorado College

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Erik 
Mitchell
Sent: Thursday, March 29, 2012 6:11 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] Anyone implementing common LIS applications on PaaS 
providers?

Hi all,

I have been toying with the process of implementing common LIS applications 
(e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers like Heroku and 
Amazon Elastic Beanstalk.  I have just tried out of the box distributions so 
far and have not made much progress but was wondering if someone else had tried 
this or had ideas about what issues I might run into.

Thanks,

Erik

Erik Mitchell
Assistant Professor
College of Information Studies
University of Maryland, College Park
http://ischool.umd.edu


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Erik Mitchell
Chris - where did you deploy your SOLR instance and did that create
any issues for deployment (other than ignoring files)?

Erik

On Thu, Mar 29, 2012 at 12:37 PM, Chris Fitzpatrick
chrisfitz...@gmail.com wrote:
 Hey Sean,

 Jah, I did that...my .slugignore is:
 tmp/*
 log/*
 coverage/*
 spec/*
 koha/*
 jetty/*

 That dropped it down to 30 from ~50mb, so that's good .
 (koha has some scripts wrote to pull from our ILS).

 I think the slug size is a really minor issue. Heroku says under 25mb
 is good, but over 50mb is not so good.  Not Good,  but not Chaotic
 Evil . Neutral Good.



 On Thu, Mar 29, 2012 at 6:26 PM, Sean Hannan shan...@jhu.edu wrote:
 If you already have everything indexed in Solr elsewhere, a way to cut down
 the BL slug size is to remove/ignore the SolrMarc.jar. It's pretty sizable.

 -Sean


 On 3/29/12 12:16 PM, Chris Fitzpatrick chrisfitz...@gmail.com wrote:

 Hi,

 I've deployed Blacklight on both Heroku and Elastic BeanStalk.

 Heroku is still a much better choice. The only issue I had was I
 needed to make sure the sass-rails gem in installed in the :production
 gem group and not just development.

  I still have an issue of getting heroku to compile all my
 sass/coffeescript/etc assets on update, but it actually doesn't seem
 to make much of an impact on performance. The minor issue is that it
 would be nice to figure out a way to slim down BL's slug size. The
 lowest I've been able to get it is about 30mb and Heroku recommends
 having it be below 25mb.

 I have not used Heroku's solr service (I still use EC2 for my solr
 deployments).
 EngineYard would also be another option.

 There is also an AMI for DSpace, so deploying that to EC2 should be
 pretty easy

 b,chris.



 On Thu, Mar 29, 2012 at 3:55 PM, Rosalyn Metz rosalynm...@gmail.com wrote:
 Erik,

 I haven't tried it (recently) on PaaS providers, but I have on IaaS.  The
 AMIs I've created in association with start up scripts (if you're
 interested in seeing those let me know, I'd have to look for them somewhere
 or other) mean that the application automagically starts up on its own, all
 you need to do is go to the URL.  I've used this as a back up method in the
 past and I think would be a great way for people to be able to play with
 the different apps before committing.

 To this end, I created an AMI for Blacklight a while back:
 http://www.rosalynmetz.com/ami-3c10f255/  I guarantee you it is grossly out
 of date.  I also have instructions on creating an EBS backed AMI:
 http://rosalynmetz.com/ideas/2011/04/14/creating-an-ebs-backed-ami/ which
 is the method I used for creating the Blacklight AMI. These instructions
 are also fairly old, but I still get comments on my blog now and then that
 the method works.

 I also played around with it on Heroku, but that was so long ago I don't
 think any of the things I learned still apply (this was when Heroku was
 fairly new to the scene).  Hope some of this helps.

 Rosalyn



 On Thu, Mar 29, 2012 at 8:34 AM, Seth van Hooland 
 svhoo...@ulb.ac.bewrote:

 Dear Erik,

 Bram Wiercx and myself have given a talk on how to put together a package
 to install CollectiveAccess on Red Hat's OpenShift:
 http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacce
 s-as-a-service-solution
 .

 My students are currently happily playing around with CollectiveAccess,
 which they have installed on OpenShift. My teaching assistant Max De Wilde
 has developed clear guidelines on how to run the installation procedure:
 http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.

 It would be wonderful to aggregate these kind of installation procedure's
 for other types of LIS applications...

 Kind regards and looking forward to your book!

 Seth van Hooland
 Président du Master en Sciences et Technologies de l'Information et de la
 Communication (MaSTIC)
 Université Libre de Bruxelles
 Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
 http://homepages.ulb.ac.be/~svhoolan/
 http://twitter.com/#!/sethvanhooland
 http://mastic.ulb.ac.be
 0032 2 650 4765
 Office: DC11.113

 Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :

 Hi all,

 I have been toying with the process of implementing common LIS
 applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
 like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
 the box distributions so far and have not made much progress but was
 wondering if someone else had tried this or had ideas about what
 issues I might run into.

 Thanks,

 Erik

 Erik Mitchell
 Assistant Professor
 College of Information Studies
 University of Maryland, College Park
 http://ischool.umd.edu



[CODE4LIB] Strategies and use-cases library search box

2012-03-29 Thread Patrick Hochstenbach
Dear All

At Ghent Universty Library in Belgium we are re-evaluating our web strategy in 
several internal workshops with faculty librarians. The theme of our next 
meeting will be about discovery, and one-search-box integrated search 
solutions. We see these days a lot of vendors and cloud providing solutions 
that provide these features which addresses strategic goals of libraries. We 
are also interested in a methodological question: what are the underlying 
use-cases libraries are trying to solve for where a new suite of services 
provide a solution. Does you have insights you are willing to share. Searching 
for methodologies, principles of heuristics, statistics, research and teaching 
practices. 

Best from Belgium
Patrick

Skype: patrick.hochstenbach
Patrick Hochstenbach  Digital Architect
University Library+32(0)92647980
Ghent University * Rozier 9 * 9000 * Gent


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Mark A. Matienzo
Like Chris, I've deployed Blacklight on Heroku, and this thread
(particularly Rosalyn's message) has gotten me to write up a quick
HOWTO on the Blacklight wiki [0].

For Solr hosting I've used both a VM that I run (on Slicehost) and EC2.

Mark

  [0] https://github.com/projectblacklight/blacklight/wiki/Blacklight-on-Heroku


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Erik Mitchell
Neat!  Thanks Mark,

Erik

On Thu, Mar 29, 2012 at 2:19 PM, Mark A. Matienzo m...@matienzo.org wrote:
 Like Chris, I've deployed Blacklight on Heroku, and this thread
 (particularly Rosalyn's message) has gotten me to write up a quick
 HOWTO on the Blacklight wiki [0].

 For Solr hosting I've used both a VM that I run (on Slicehost) and EC2.

 Mark

  [0] https://github.com/projectblacklight/blacklight/wiki/Blacklight-on-Heroku


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Jonathan Rochkind
Older 3.x versions of Blacklight may have put a solrmarc.jar inside your 
app's ./config/SolrMarc.  That may not be caught by your slug ignore.


This was an error, it was never meant to do that. If you have one in a 
BL 3.x you should be safe to remove it.


Other than that, I'm curious what's making a BL app so large!

Incidentally, you don't need a ./jetty in your local app _at all_, 
unless you actually want to keep a jetty Solr there. BL will optionally 
install one there, but it's not required.


(Does slug size include your gem dependencies? I am not familiar with 
heroku. Cause the BL gem itself _does_ also include a SolrMarc.jar, if 
that's a problem, we'd have to refactor things on the BL side to make it 
an optional dependency instead of baked into BL).


On 3/29/2012 12:37 PM, Chris Fitzpatrick wrote:

Hey Sean,

Jah, I did that...my .slugignore is:
tmp/*
log/*
coverage/*
spec/*
koha/*
jetty/*

That dropped it down to 30 from ~50mb, so that's good .
(koha has some scripts wrote to pull from our ILS).

I think the slug size is a really minor issue. Heroku says under 25mb
is good, but over 50mb is not so good.  Not Good,  but not Chaotic
Evil . Neutral Good.



On Thu, Mar 29, 2012 at 6:26 PM, Sean Hannanshan...@jhu.edu  wrote:

If you already have everything indexed in Solr elsewhere, a way to cut down
the BL slug size is to remove/ignore the SolrMarc.jar. It's pretty sizable.

-Sean


On 3/29/12 12:16 PM, Chris Fitzpatrickchrisfitz...@gmail.com  wrote:


Hi,

I've deployed Blacklight on both Heroku and Elastic BeanStalk.

Heroku is still a much better choice. The only issue I had was I
needed to make sure the sass-rails gem in installed in the :production
gem group and not just development.

  I still have an issue of getting heroku to compile all my
sass/coffeescript/etc assets on update, but it actually doesn't seem
to make much of an impact on performance. The minor issue is that it
would be nice to figure out a way to slim down BL's slug size. The
lowest I've been able to get it is about 30mb and Heroku recommends
having it be below 25mb.

I have not used Heroku's solr service (I still use EC2 for my solr
deployments).
EngineYard would also be another option.

There is also an AMI for DSpace, so deploying that to EC2 should be
pretty easy

b,chris.



On Thu, Mar 29, 2012 at 3:55 PM, Rosalyn Metzrosalynm...@gmail.com  wrote:

Erik,

I haven't tried it (recently) on PaaS providers, but I have on IaaS.  The
AMIs I've created in association with start up scripts (if you're
interested in seeing those let me know, I'd have to look for them somewhere
or other) mean that the application automagically starts up on its own, all
you need to do is go to the URL.  I've used this as a back up method in the
past and I think would be a great way for people to be able to play with
the different apps before committing.

To this end, I created an AMI for Blacklight a while back:
http://www.rosalynmetz.com/ami-3c10f255/  I guarantee you it is grossly out
of date.  I also have instructions on creating an EBS backed AMI:
http://rosalynmetz.com/ideas/2011/04/14/creating-an-ebs-backed-ami/ which
is the method I used for creating the Blacklight AMI. These instructions
are also fairly old, but I still get comments on my blog now and then that
the method works.

I also played around with it on Heroku, but that was so long ago I don't
think any of the things I learned still apply (this was when Heroku was
fairly new to the scene).  Hope some of this helps.

Rosalyn



On Thu, Mar 29, 2012 at 8:34 AM, Seth van Hoolandsvhoo...@ulb.ac.bewrote:


Dear Erik,

Bram Wiercx and myself have given a talk on how to put together a package
to install CollectiveAccess on Red Hat's OpenShift:
http://www.dish2011.nl/sessions/open-source-software-platform-collectiveacce
s-as-a-service-solution
.

My students are currently happily playing around with CollectiveAccess,
which they have installed on OpenShift. My teaching assistant Max De Wilde
has developed clear guidelines on how to run the installation procedure:
http://homepages.ulb.ac.be/~svhoolan/redhat_ca_install.pdf.

It would be wonderful to aggregate these kind of installation procedure's
for other types of LIS applications...

Kind regards and looking forward to your book!

Seth van Hooland
Président du Master en Sciences et Technologies de l'Information et de la
Communication (MaSTIC)
Université Libre de Bruxelles
Av. F.D. Roosevelt, 50 CP 123  | 1050 Bruxelles
http://homepages.ulb.ac.be/~svhoolan/
http://twitter.com/#!/sethvanhooland
http://mastic.ulb.ac.be
0032 2 650 4765
Office: DC11.113

Le 29 mars 2012 à 14:10, Erik Mitchell a écrit :


Hi all,

I have been toying with the process of implementing common LIS
applications (e.g. Vufind, Dspace, Blacklight. .  .) on PaaS providers
like Heroku and Amazon Elastic Beanstalk.  I have just tried out of
the box distributions so far and have not made much progress but was
wondering if someone else had tried this or had ideas 

Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Chris Fitzpatrick
Hey Erik,

I used this AMI for solr =
http://www.lucidimagination.com/blog/2010/02/01/solr-shines-through-the-cloud-lucidworks-solr-on-ec2/

Note : You will have to change the schema and solrconfig files on the image...

b,chris.


On Thu, Mar 29, 2012 at 9:44 PM, Erik Mitchell mitch...@gmail.com wrote:
 Neat!  Thanks Mark,

 Erik

 On Thu, Mar 29, 2012 at 2:19 PM, Mark A. Matienzo m...@matienzo.org wrote:
 Like Chris, I've deployed Blacklight on Heroku, and this thread
 (particularly Rosalyn's message) has gotten me to write up a quick
 HOWTO on the Blacklight wiki [0].

 For Solr hosting I've used both a VM that I run (on Slicehost) and EC2.

 Mark

  [0] 
 https://github.com/projectblacklight/blacklight/wiki/Blacklight-on-Heroku


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Chris Fitzpatrick
Oh, yeah...Mark's wiki page also points out that you'll need to change
to postgres for production. Very important.

From what I understand, the compiled slug is the app + all it's
dependancies. So, yeah,  I guess since the gem does have the
SolrMarc.jar in there, it would have the jar in the slug somewhere

However, I did just see a page on heroku that says:
Generally speaking any slug under 15MB is small and nimble; 30MB is
average; and 50MB or above is weighty.

So maybe I was being slightly over sensitive to my slug's size? I was
just worried all the other slugs would call it names.

I am going to try and figure out why my pipeline assets aren't
compiling when I push. The solution is probably just to compile them
locally and push them rather than rely on Heroku to precompile them
(currently when I push, Heroku's precompile fails, so it reverts to
compile at runtime mode) if anyone has insight into this, please
lemme know...I believe having them compile at runtime does slow down
the application...

b,chris.

On Thu, Mar 29, 2012 at 10:49 PM, Chris Fitzpatrick
chrisfitz...@gmail.com wrote:
 Hey Erik,

 I used this AMI for solr =
 http://www.lucidimagination.com/blog/2010/02/01/solr-shines-through-the-cloud-lucidworks-solr-on-ec2/

 Note : You will have to change the schema and solrconfig files on the image...

 b,chris.


 On Thu, Mar 29, 2012 at 9:44 PM, Erik Mitchell mitch...@gmail.com wrote:
 Neat!  Thanks Mark,

 Erik

 On Thu, Mar 29, 2012 at 2:19 PM, Mark A. Matienzo m...@matienzo.org wrote:
 Like Chris, I've deployed Blacklight on Heroku, and this thread
 (particularly Rosalyn's message) has gotten me to write up a quick
 HOWTO on the Blacklight wiki [0].

 For Solr hosting I've used both a VM that I run (on Slicehost) and EC2.

 Mark

  [0] 
 https://github.com/projectblacklight/blacklight/wiki/Blacklight-on-Heroku


Re: [CODE4LIB] Anyone implementing common LIS applications on PaaS providers?

2012-03-29 Thread Jonathan Rochkind

On 3/29/2012 5:05 PM, Chris Fitzpatrick wrote:

locally and push them rather than rely on Heroku to precompile them
(currently when I push, Heroku's precompile fails, so it reverts to
compile at runtime mode) if anyone has insight into this, please
lemme know...I believe having them compile at runtime does slow down
the application...


Have no idea why it's not working in heroku, no experience with heroku 
(although I'm familiar with the concept).


But compile at runtime _will_ slow down your app, yeah. Here's a 
stackoverflow I asked on it myself:


http://stackoverflow.com/questions/8821864/config-assets-compile-true-in-rails-production-why-not

Compiling locally and then pushing should work, and is arguably better 
in some ways (why waste cycles on the production machine compiling 
assets?)  But, if you choose to compile and check into your source 
control repo,  here's a trick that will keep it from driving you crazy 
in development using your on-disk compiled assets... eh, I can't find 
the blog post on google now, but it's something like changing 
config.assets.path = /dev-assets in environments/development.rb, so in 
development it will ignore your on disk compiled assets.


Re: [CODE4LIB] presenting merged records?

2012-03-29 Thread Peter Noerr
Hi Graham,

What we do in our federated search system, and have been doing for some few 
years, is basically give the designer a choice of what options the user gets 
for de-duped records.

Firstly de-duping can be of a number of levels of sophistication, and a many of 
them lead to the situation you have - records which are similar rather than 
identical. On the web search side of things there are a surprising number of 
real duplicates (well maybe not surprising if you study more than one page of 
web search engine results), and on Twitter the duplicates well outnumber the 
original posts (many thanks 're-tweet').

Where we get duplicate records the usual options are: 1) keep the first and 
just drop all the rest. 2) keep the largest (assumed to have the most 
information) and drop the rest. These work well for WSE results where they are 
all almost identical (the differences often are just in the advertising 
attached to the pages and the results), but not for bibliographic records.

Less draconian is 3) Mark all the duplicates and keep them in the list (so you 
get 1, 2, 3, 4, 5, 5.1, 5.2, 5.3, 6, ...). This groups all the similar records 
together under the sort key of the first one, and does enable the user to 
easily skip them.

More user friendly is 4) Mark all duplicates and hide them in a sub-list 
attached to the head record. This gets them out of the main display, but 
allows the user who is interested in that record to expand the list and see 
the variants. This could be of use to you.

After that we planned to do what you are proposing and actually merge record 
content into a single virtual record, and worked on algorithms to do it. But 
nobody was interested. All our partners (who provide systems to lots of 
libraries, both public, academic, and special) decided that it would confuse 
their users more than it would help. I have my doubts, but they spoke and we 
put the development on ice.

I'm not sure this will help, but it has stood the test of time, and is well 
used in its various guises. Since no-one else seems interested in this topic, 
you could email me off list and we could discuss what we worked through in the 
way of algorithms, etc.

Peter


 -Original Message-
 From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of graham
 Sent: Wednesday, March 28, 2012 8:05 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] presenting merged records?
 
 Hi Michael
 
 On 03/27/12 11:50, Michael Hopwood wrote:
  Hi Graham, do I know you from RHUL?
 
 Yes indeed :-)
 
  My thoughts on merged records would be:
 
  1. don't do it - use separate IDs and just present links between related 
  manifestations; thus
 avoiding potential confusions.
 
 In my case, I can't avoid it as it's a specific requirement: I'm doing a 
 federated search across a
 large number of libraries, and if closely similar items aren't merged, the 
 results become excessively
 large and repetitive. I'm merging all the similar items, displaying a summary 
 of the merged
 bibliographic data, and providing links to each of the libraries with a copy. 
  So it's not really
 FRBRization in the normal sense, I just thought that FRBRization would lead 
 to similar problems, so
 that there might be some well-known discussion of the issues around... The 
 merger of the records does
 have advantages, especially if some libraries have very underpopulated 
 records (eg subject fields).
 
 Cheers
 Graham
 
 
  http://www.bic.org.uk/files/pdfs/identification-digibook.pdf
 
  possible relationships - see 
  http://www.editeur.org/ONIX/book/codelists/current.html - lists 51
 (manifestation)and 164 (work).
 
  2. c.f. the way Amazon displays rough and ready categories (paperback,
  hardback, audiobooks, *ahem* ebooks of some sort...)
 
  On dissection and reconstitution of records - there is a lot of talk going 
  on about RDFizing MaRC
 records and re-using in various ways, e.g.:
 
  http://www.slideshare.net/JenniferBowen/moving-library-metadata-toward
  -linked-data-opportunities-provided-by-the-extensible-catalog
 
  Cheers,
 
  Michael
 
  -Original Message-
  From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf
  Of graham
  Sent: 27 March 2012 11:06
  To: CODE4LIB@LISTSERV.ND.EDU
  Subject: [CODE4LIB] presenting merged records?
 
  Hi
 
  There seems to be a general trend to presenting merged records to users, as 
  part of the move towards
 FRBRization. If records need merging this generally means they weren't 
 totally identical to start with,
 so you can end up with conflicting bibliographic data to display.
 
  Two examples I've come across with this: Summon can merge
  print/electronic versions of texts, so uses a new 'merged' material
  type of 'book/ebook' (it doesn't yet seem to have all the other
  possible permutations, eg book/audiobook). Pazpar2 (which I'm working
  with at the
  moment) has a merge option for publication dates which presents dates as a 
  period eg