On Tue, May 6, 2008 at 10:47 PM, Duan, Niu <[EMAIL PROTECTED]> wrote: > Looks like Nutch doesn't support form-based authentication out of the box. > You may have to create your own httpclient or modify it for >dealing with > form-based authentication. Form-based authentication requires dedicated > input parameters (j_username, j_password) to be >placed in the initial > request message sent to the server. Once authenticated, a cookie named > jsessionid is going to be used to track the >user session.
Thank you Nick. What I'm actually looking for is a little different. My server uses a custom cookie name and value to indicate an authenticated user. I have this cookie (a valid version thereof, and let's assume for now I've gotten past expiration issues) in a text file. How do I tell Nutch's crawler to include a cookie name and value with each HTTP request? Yoav
