Sameer,
Thanks for the reply. I could configure and use protocol-http plugin for
crawling site that's using https protocol.
Also, has anyone worked with crawling password protected sites?
My requirement is crawling an intranet site that uses https and user
authentication. I searched through the forum but couldn't find anybody
who has successfully implemented it. I'm also going through the source
files for protocol-http plugin to see if any changes can be made there
for my specific requirement.
Thanks,
Mohini


-----Original Message-----
From: Sameer Tamsekar [mailto:[EMAIL PROTECTED] 
Sent: Wednesday, March 01, 2006 10:31 PM
To: [email protected]
Subject: Re: https plugin for Nutch

If you use protocol-httpclient (versus protocol-http) then it should
support https.

I have got this reply from one of the mailing list user.

Regards,

Sameer

On 3/2/06, Mohini Padhye <[EMAIL PROTECTED]> wrote:
>
> I am using nutch-0.7.1. I wanted to know if anyone has successfully 
> implemented https plugin for nutch.
> If not, can someone provide guidelines about developing it and I can 
> start with the implementation?
> -Mohini
>
>


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid0944&bid$1720&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to