Although i haven't crawled the local file system, i have no doubt this will just work as expected. Make just your url filters won't filter out your HTTP url's and you might need to check your http.* parameters in your Nutch configuration, although they just work out of the box.
On Wednesday, October 06, 2010 03:46:00 pm webdev1977 wrote: > That is a very good question! I am currently only crawling my local file > system, but am about to add an http url, I would love to know the answer. > > Have you given it a try yet? -- Markus Jelsma - CTO - Openindex http://www.linkedin.com/in/markus17 050-8536600 / 06-50258350

