: and abandon moving to Perl/Catalyst. It is too bad that all of the Lucene : based projects are disjointed islands and you are forced to basically stick : with only one technology once you start.
a) I'm not sure what you mean by "disjointed" .. any project using Lucene Core has an index that should be readable by any other project using Lucene Core -- there will just be some caveats you have to keep in mind (ie: if you want to use Solr with an index you build elsewhere, you have to configure Solr with an appropraite schema.xml) b) Lucy doesn't use Lucene Core - so it's a completley seperate thing c) The current versions of Nutch, last i heard, do not even build indexes directly using the Lucene Core -- instead nutch focuses on the crawling, and then uses an indexer to push the crawled/parsed documents to Solr for searching -- so you could probably use Nutch with Lucy very easily by hooking into that pipeline. Either replace the "SolrIndexer" in nutch with something that writes directly to a Lucy index, or use the "SolrIndexer" as is and write a little app that emulates the Solr HTTP interface and writes to Lucy... http://wiki.apache.org/nutch/bin/nutch%20solrindex -Hoss
