Hi,

       I have been trying to use the patch httpclient so as to enable fetcher 
operations through proxy.

When I try to compile the files, class http is looking for following imports

import org.apache.nutch.protocol.ProtocolOutput;
import org.apache.nutch.protocol.ProtocolStatus;

I have downloaded current build from nutch nightly downloads.
havnt got any lucky in locating these two files.

Andrzej, Piotr, Andy, Ken can anyone give me some pointers.

Regards 
Chetan 
_______________________________ 

Tel +91-20-5652 5000 ext 2513

KPITCummins Infosystems Limited 
Hinjwadi
Pune INDIA 
_______________________________ 



-----Original Message-----
From: Piotr Kosiorowski [mailto:[EMAIL PROTECTED]
Sent: Wednesday, May 11, 2005 6:08 PM
To: [email protected]
Subject: Re: proxy


Hello,
As far as I remember in current implementation it is not possible to 
fetch through a proxy that requires authorization. But Andrzej Bialecki 
is working on httpclient based implementation right now and as 
httpclient allows one to use proxy with authorization it should be no 
problem to do so in nutch.
You can check current status of httpclient based patch by  looking for 
thread: "Update: HTTPClient for protocol-http and protocol-https".
You can even try the patch yourself.
Regards
Piotr


k-team wrote:
> Hi all, 
> I'm testing nutch on my PC, and need to get through a proxy to crawls pages. 
> I've tried to set the "http.proxy.host" property like this:
> user:[EMAIL PROTECTED]
> but I get this error message:
> fetch of http://www.host.com/ failed with:
> net.nutch.protocol.http.HttpException: java.net.UnknownHostException:
> user:[EMAIL PROTECTED]
> 
> How can I set my proxy with user/pwd? Is it possible?
> 
> thanks,
> Kteam

Reply via email to