I want to run nutch on a set of local files that will be available through HTTP running on the same machine. I'd rather avoid the overhead of fetching the files to index them, and then keeping a local cached copy.
What's the best way to do this? Failing that, pointers into the source
code appreciated. :)
/r$
--
STSM, Senior Security Architect
DataPower SOA Appliances
http://www.ibm.com/software/integration/datapower/
