Hi,

I posted to the user list but didn't get a reply. I want to crawl a protected site, but there doesn't seem to be an option for that in Nutch at the moment.

However, it doesn't sound like something that would be too hard to add, assuming the java http client library can handle that. As I'm not familiar with the code, could someone point me at the file (or files) in the source which do the crawling please? I'm not professing to be a top Java programmer (perl's my speciality) but I'll give it a shot, unless anyone else wants to?!

Many thanks,

Ed.




-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to