Yes, Nutch works quite well as a crawler for Solr.
- Original Message -
From: Tony Wang ivyt...@gmail.com
To: solr-user@lucene.apache.org
Sent: Thursday, March 5, 2009 5:32:57 PM GMT -06:00 US/Canada Central
Subject: what crawler do you use for Solr indexing?
Hi,
I wonder if there's any
I'm wondering, is there some way (out of the box) to tell Solr that
we're only interested in indexing certain parts of a page? For example,
let's say I have a bunch of pages in my site that contain some common
navigation elements, roughly like this:
html
headtitle/title/head
body
div