[email protected] wrote on 5/6/12 1:54 AM:
> 
> 
> 
> I have used Nutch for the last couple of years mostly to maintain an index
> and search website. I am looking however to start looking at using Lucy
> mostly because of the Perl interface. I wanted to find out if Nutch indexes
> will work with Lucy since they are both extended from the Lucene
> project. From what I can see, Lucy does not include the crawling/fetching
> features of Nutch, but my new site is using all Perl with Catalyst MVC. I
> want to move away from maintaining a web server and a Java servlet
> container.

Hi Jerry,

It would be more accurate to say that Lucy is "inspired by" Lucene rather than
derived or based on Lucene. Unlike Plucene or CLucene or any of the other ports,
Lucy has never tried to be index-compatible with Lucene. Only the class
structure and some architectural design is similar to Lucene. Hence the 'loose'
designation.

Swish3[0] -- which is written all in Perl -- provides some of the features of
Nutch, notably a web crawler and document conversion (.pdf, .doc, .xls, etc).

There is a Lucy backend[1] for Swish3.

The Dezi[2] platform gives a REST interface to Swish3 indexes.

Here's an example:

% swish3 -S spider -F lucy -i http://www.peknet.com/ -f dezi.index
% dezi &
% dezi-client -q peknet
--
 uri: http://www.peknet.com/
 title: <b class="h">peknet</b> :: an eddy in the bit stream
 score: 91
========================================
       hits: 1
search time: 0.06957
 build time: 0.14598
      query: peknet



[0] http://swish-e.org/swish3/
[1] https://metacpan.org/module/SWISH::Prog::Lucy
[2] http://dezi.org/

-- 
Peter Karman  .  http://peknet.com/  .  [email protected]

Reply via email to