Re: [CODE4LIB] Solr for Internal Searching

2008-08-06 Thread Cloutman, David
I think for my purposes, which it to index our processes and procedures
manual and eventually our new 100 page web site, a search appliance is
overkill. I'm going to try both Nutch and Swish-e, and see which works
better. They both seem like perfectly suitable options for my needs.
Actually, both programs suggested probably could do our entire County's
intranet to be quite honest. It might be worth setting up an index for
that as well. :)

Anyhow, thanks for the guidance, everyone. 

- David

---
David Cloutman <[EMAIL PROTECTED]>
Electronic Services Librarian
Marin County Free Library 

-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Nate Vack
Sent: Tuesday, August 05, 2008 8:25 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Solr for Internal Searching


I know this is code4lib, not buystuff4lib, but the Google Mini is
reputed to be rather quick, bulletproof and configurable, and starts
at $3k. For example, it works nicely with lots of file formats
(including Office documents) out of the box. And works with LDAP and
NTLM for authentication and authorization.

I suspect it'll probably be challenging to deliver a quality search
solution for a lower total cost.

Of course, this all depends on what your intranet looks like on the
inside. I've seen 'intranet' mean a things that would call for wildly
different search solutions.

So... solr is great, but this question doesn't contain nearly enough
information to answer whether it's a good fit for your task at hand.

Cheers
-Nate

On Tue, Aug 5, 2008 at 6:03 PM, Cloutman, David
<[EMAIL PROTECTED]> wrote:
> Today my boss asked me to come up with a solution that would let us
> index and search our intranet. I was already thinking of using Solr on
> our public Web site we are building, and thought this might be a good
> opportunity to knock two items off the to-do list with the same
> technology. I know there was a preconference session on Solr this
year,
> and I have the sense that this is gaining traction in the library
> community. Is there any reason why I shouldn't do this?
>
> Thanks,
>
> - David
>
>
>
> ---
> David Cloutman <[EMAIL PROTECTED]>
> Electronic Services Librarian
> Marin County Free Library
>
> Email Disclaimer:
http://www.co.marin.ca.us/nav/misc/EmailDisclaimer.cfm
>


[CODE4LIB] Job Opening: Programmer/Analyst on APIS VI grant (New York University)

2008-08-06 Thread j.g. pawletko
New York University
Programmer/Analyst (7421BR)
 
New York University¹s Division of the Libraries seeks a Programmer/Analyst
to work on the Advanced Papyrological Information System (APIS) project:
http://www.columbia.edu/cu/lweb/projects/digital/apis/index.html. APIS is a
web-based, searchable database of information about, and images of,
papyrological materials (e.g. papyri, ostraca, wood tablets) located in
collections around the world.

One component of APIS is known as the Papyrological Navigator (PN):
http://www.columbia.edu/cu/lweb/digital/pn/about.html. The PN is a custom
web application capable of searching both metadata and texts and displaying
metadata records, texts, and images pulled from various source datasets. The
programmer/analyst will bring the PN into a robust production-ready state,
conduct formal usability testing and make improvements based on the results,
revise portions of the APIS infrastructure to enhance PN integration, load
additional data and images into the system, and work to expand the APIS
long-term digital preservation repository.
 
This is a grant-funded position and is available for 2 years.
 
Candidates should have the following skills:
* Bachelor's degree in computer or information science and 3 years of
relevant experience or equivalent combination
* Must include experience developing applications using Java
* Demonstrated knowledge of Java, Tomcat, Saxon, Lucene, Apache, SQL, XML,
XSLT 
* Experience with metadata standards (e.g. TEI, EpiDoc)
* Experience working in a Unix/Linux environments
* Preferred: Experience with image serving software (eRez/FSI), Java
Portlets, Apache Jetspeed-2, and Velocity templates.  Experience designing,
building, and deploying distributed systems.
* Excellent communication and analytical skills
 
Applicants should submit resume and cover letter, which reflects how
applicant¹s education and experience match the job requirements.
 
Please apply through NYU's application management system:
www.nyu.edu/hr/jobs/apply .
At this page click on "External Applicants" then "Search Openings." Type
7421BR in the "Keyword Search" field and select search.   NYU offers a
generous benefit package including 22 days of vacation annually.  NYU is an
Equal Opportunity/Affirmative Action Employer.
 
New York University Libraries:  Library facilities at New York University
serve the school¹s 40,000 students and faculty and contain more than 4
million volumes.  New York University is a member of the Association of
Research Libraries, the Research Libraries Group, the Digital Library
Federation; serves as the administrative headquarters of the Research
Library Association of South Manhattan, a consortium that includes three
academic institutions. The Library¹s website URL is http://library.nyu.edu


Re: [CODE4LIB] Solr for Internal Searching

2008-08-06 Thread Jason Stirnaman
I haven't used it, but there is an AJAX extension for GSA and Mini that
does faceting - they refer to it as "parametric" search:
http://code.google.com/p/parametric/
We use the Mini on our university web site, but not for intranet (yet).
 
Jason
-- 

Jason Stirnaman
Digital Projects Librarian/School of Medicine Support
A.R. Dykes Library, University of Kansas Medical Center
[EMAIL PROTECTED]
913-588-7319


>>> On 8/6/2008 at 4:00 AM, in message
<[EMAIL PROTECTED]>, "Stephens,
Owen"
<[EMAIL PROTECTED]> wrote:
> Google Mini didn't facet when I used it (about a year ago). It is
very
> simple to setup, and required very little maintenance.
> 
> The Mini had some restrictions compared to the Google Appliance, if
> these apply it would be worth looking at the differences - the
Appliance
> certainly was significantly more expensive than the Mini.
> 
> Overall I'd recommend the Mini if you want something cheap, very
quick
> to get going, with a brand that users will recognise and (generally)
> trust, and you are happy to sacrifice some flexibility for these
> features.
> 
> Owen
> 
> Owen Stephens
> Assistant Director: eStrategy and Information Resources
> Central Library
> Imperial College London
> South Kensington Campus
> London
> SW7 2AZ
>  
> t: +44 (0)20 7594 8829
> e: [EMAIL PROTECTED] 
>> -Original Message-
>> From: Code for Libraries [mailto:[EMAIL PROTECTED] On
Behalf
> Of
>> Tim Spalding
>> Sent: 06 August 2008 05:08
>> To: CODE4LIB@LISTSERV.ND.EDU 
>> Subject: Re: [CODE4LIB] Solr for Internal Searching
>> 
>> Does Google Mini facet? It seems to have a concept of collections,
but
>> does it facet by them?
>> 
>> T
>> 
>> On Wed, Aug 6, 2008 at 12:05 AM, Bill Dueber <[EMAIL PROTECTED]>
wrote:
>> > At UMich, we use space on a Google Appliance as our site search
>> > (different setups for internal vs. public pages) and have been
> pretty
>> > happy. I've been able to abuse the "google ads" space to our
benefit
>> > -- e.g., go to http://lib.umich.edu/ and search "Web Pages" for
>> "grad"
>> > (get today's hours) or 'dueber' (find me).
>> >
>> > On Tue, Aug 5, 2008 at 11:24 PM, Nate Vack <[EMAIL PROTECTED]>
wrote:
>> >> I know this is code4lib, not buystuff4lib, but the Google Mini
is
>> >> reputed to be rather quick, bulletproof and configurable, and
> starts
>> >> at $3k. For example, it works nicely with lots of file formats
>> >> (including Office documents) out of the box. And works with LDAP
> and
>> >> NTLM for authentication and authorization.
>> >>
>> >> I suspect it'll probably be challenging to deliver a quality
search
>> >> solution for a lower total cost.
>> >>
>> >> Of course, this all depends on what your intranet looks like on
the
>> >> inside. I've seen 'intranet' mean a things that would call for
>> wildly
>> >> different search solutions.
>> >>
>> >> So... solr is great, but this question doesn't contain nearly
> enough
>> >> information to answer whether it's a good fit for your task at
> hand.
>> >>
>> >> Cheers
>> >> -Nate
>> >>
>> >> On Tue, Aug 5, 2008 at 6:03 PM, Cloutman, David
>> >> <[EMAIL PROTECTED]> wrote:
>> >>> Today my boss asked me to come up with a solution that would
let
> us
>> >>> index and search our intranet. I was already thinking of using
> Solr
>> on
>> >>> our public Web site we are building, and thought this might be
a
>> good
>> >>> opportunity to knock two items off the to-do list with the same
>> >>> technology. I know there was a preconference session on Solr
this
>> year,
>> >>> and I have the sense that this is gaining traction in the
library
>> >>> community. Is there any reason why I shouldn't do this?
>> >>>
>> >>> Thanks,
>> >>>
>> >>> - David
>> >>>
>> >>>
>> >>>
>> >>> ---
>> >>> David Cloutman <[EMAIL PROTECTED]>
>> >>> Electronic Services Librarian
>> >>> Marin County Free Library
>> >>>
>> >>> Email Disclaimer:
>> http://www.co.marin.ca.us/nav/misc/EmailDisclaimer.cfm 
>> >>>
>> >>
>> >
>> >
>> >
>> > --
>> > Bill Dueber
>> > Library Systems Programmer
>> > University of Michigan Library
>> >
>> 
>> 
>> 
>> --
>> Check out my library at
> http://www.librarything.com/profile/timspalding


Re: [CODE4LIB] Solr for Internal Searching

2008-08-06 Thread Stephens, Owen
Google Mini didn't facet when I used it (about a year ago). It is very
simple to setup, and required very little maintenance.

The Mini had some restrictions compared to the Google Appliance, if
these apply it would be worth looking at the differences - the Appliance
certainly was significantly more expensive than the Mini.

Overall I'd recommend the Mini if you want something cheap, very quick
to get going, with a brand that users will recognise and (generally)
trust, and you are happy to sacrifice some flexibility for these
features.

Owen

Owen Stephens
Assistant Director: eStrategy and Information Resources
Central Library
Imperial College London
South Kensington Campus
London
SW7 2AZ
 
t: +44 (0)20 7594 8829
e: [EMAIL PROTECTED]
> -Original Message-
> From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf
Of
> Tim Spalding
> Sent: 06 August 2008 05:08
> To: CODE4LIB@LISTSERV.ND.EDU
> Subject: Re: [CODE4LIB] Solr for Internal Searching
> 
> Does Google Mini facet? It seems to have a concept of collections, but
> does it facet by them?
> 
> T
> 
> On Wed, Aug 6, 2008 at 12:05 AM, Bill Dueber <[EMAIL PROTECTED]> wrote:
> > At UMich, we use space on a Google Appliance as our site search
> > (different setups for internal vs. public pages) and have been
pretty
> > happy. I've been able to abuse the "google ads" space to our benefit
> > -- e.g., go to http://lib.umich.edu/ and search "Web Pages" for
> "grad"
> > (get today's hours) or 'dueber' (find me).
> >
> > On Tue, Aug 5, 2008 at 11:24 PM, Nate Vack <[EMAIL PROTECTED]> wrote:
> >> I know this is code4lib, not buystuff4lib, but the Google Mini is
> >> reputed to be rather quick, bulletproof and configurable, and
starts
> >> at $3k. For example, it works nicely with lots of file formats
> >> (including Office documents) out of the box. And works with LDAP
and
> >> NTLM for authentication and authorization.
> >>
> >> I suspect it'll probably be challenging to deliver a quality search
> >> solution for a lower total cost.
> >>
> >> Of course, this all depends on what your intranet looks like on the
> >> inside. I've seen 'intranet' mean a things that would call for
> wildly
> >> different search solutions.
> >>
> >> So... solr is great, but this question doesn't contain nearly
enough
> >> information to answer whether it's a good fit for your task at
hand.
> >>
> >> Cheers
> >> -Nate
> >>
> >> On Tue, Aug 5, 2008 at 6:03 PM, Cloutman, David
> >> <[EMAIL PROTECTED]> wrote:
> >>> Today my boss asked me to come up with a solution that would let
us
> >>> index and search our intranet. I was already thinking of using
Solr
> on
> >>> our public Web site we are building, and thought this might be a
> good
> >>> opportunity to knock two items off the to-do list with the same
> >>> technology. I know there was a preconference session on Solr this
> year,
> >>> and I have the sense that this is gaining traction in the library
> >>> community. Is there any reason why I shouldn't do this?
> >>>
> >>> Thanks,
> >>>
> >>> - David
> >>>
> >>>
> >>>
> >>> ---
> >>> David Cloutman <[EMAIL PROTECTED]>
> >>> Electronic Services Librarian
> >>> Marin County Free Library
> >>>
> >>> Email Disclaimer:
> http://www.co.marin.ca.us/nav/misc/EmailDisclaimer.cfm
> >>>
> >>
> >
> >
> >
> > --
> > Bill Dueber
> > Library Systems Programmer
> > University of Michigan Library
> >
> 
> 
> 
> --
> Check out my library at
http://www.librarything.com/profile/timspalding