On Tue, May 6, 2008 at 10:47 PM, Duan, Niu <[EMAIL PROTECTED]> wrote:
> Looks like Nutch doesn't support form-based authentication out of the box.  
> You may have to create your own httpclient or modify it for >dealing with 
> form-based authentication.  Form-based authentication requires dedicated 
> input parameters (j_username, j_password) to be >placed in the initial 
> request message sent to the server.  Once authenticated, a cookie named 
> jsessionid is going to be used to track the >user session.

Thank you Nick.

What I'm actually looking for is a little different.  My server uses a
custom cookie name and value to indicate an authenticated user.  I
have this cookie (a valid version thereof, and let's assume for now
I've gotten past expiration issues) in a text file.

How do I tell Nutch's crawler to include a cookie name and value with
each HTTP request?

Yoav

Reply via email to