I've recently just installed and configured Nutch from source. From what I've read by default, Nutch will parse text and html based documents only. I have a site I'm trying to crawl which is all asp pages. I put the asp mime type in the mime-type.xml document. What else do I need to do in order for Nutch to crawl asp pages?
Thanks, Seth [EMAIL PROTECTED]
