Hi Guys, I've different website which set a cookie session and then allow the user to surf on the site. I would like to crawl those site but I don't know if Nutch know how to manage cookie session. Could you confirm ?
I'm completly lost with the different plugin which are use to crawl with the HTTP protocol. Is it lib-http, protocol-http or protocol-httpclient ? What is the difference between all of them ? I would appreciate your view, it will help me to implement the management of cookie in Nutch. Thanks
