Re: Differences between Nutch and Solr

Jasper Kamperman Wed, 22 Oct 2008 09:37:22 -0700

Don't pin me down on the details but AFAIK there are many othersignificant differences between Nutch and Solr. At a high level thisis my understanding:

Solr is really meant for enterprise search where there is morestructured data and it makes sense to enforce a schema on at leastpart of the index, so you can do better sorting (e.g. by date or partnumber). I also believe it has a more controlled way of dealing withcaches, warming etc.

Nutch has a rich set of plugins that is geared to analyzingunstructured data including office documents etc. There is a patchfor SOLR to achieve this as well but as far as I can see it is notyet in the main line.

But at a high level, if you want to use Solr for general webindexing, you'll need a spider/crawler from some other place.


On Oct 22, 2008, at 4:50 AM, John Martyniak wrote:

Are the main differences between Nutch and Solr, that Solr doesn'thave a spider. So in order to use it you would have to spider theweb your self, or with some other tool?
-John

Re: Differences between Nutch and Solr

Reply via email to