Hi,
I have been trying to use the patch httpclient so as to enable fetcher
operations through proxy.
When I try to compile the files, class http is looking for following imports
import org.apache.nutch.protocol.ProtocolOutput;
import org.apache.nutch.protocol.ProtocolStatus;
I have downloaded current build from nutch nightly downloads.
havnt got any lucky in locating these two files.
Andrzej, Piotr, Andy, Ken can anyone give me some pointers.
Regards
Chetan
_______________________________
Tel +91-20-5652 5000 ext 2513
KPITCummins Infosystems Limited
Hinjwadi
Pune INDIA
_______________________________
-----Original Message-----
From: Piotr Kosiorowski [mailto:[EMAIL PROTECTED]
Sent: Wednesday, May 11, 2005 6:08 PM
To: [email protected]
Subject: Re: proxy
Hello,
As far as I remember in current implementation it is not possible to
fetch through a proxy that requires authorization. But Andrzej Bialecki
is working on httpclient based implementation right now and as
httpclient allows one to use proxy with authorization it should be no
problem to do so in nutch.
You can check current status of httpclient based patch by looking for
thread: "Update: HTTPClient for protocol-http and protocol-https".
You can even try the patch yourself.
Regards
Piotr
k-team wrote:
> Hi all,
> I'm testing nutch on my PC, and need to get through a proxy to crawls pages.
> I've tried to set the "http.proxy.host" property like this:
> user:[EMAIL PROTECTED]
> but I get this error message:
> fetch of http://www.host.com/ failed with:
> net.nutch.protocol.http.HttpException: java.net.UnknownHostException:
> user:[EMAIL PROTECTED]
>
> How can I set my proxy with user/pwd? Is it possible?
>
> thanks,
> Kteam