Mayank,

 

The best source of information is the Nutch wiki
(http://wiki.apache.org/nutch/). On it you will find a step by step
tutorial helping you configure a Nutch 0.9 installation
(http://peterpuwang.googlepages.com/NutchGuideForDummies.htm).

 

About your error, I've just found that on the web: 

 

407 Proxy Authentication Required

This code is similar to 401, but is only returned by proxies. A proxy
returns a 407 message upon receipt of a client request that doesn't have
valid authentication credentials. 

 

If you check your nutch-default.xml file you will find those two
properties:

 

<property>

  <name>http.proxy.host</name>

  <value></value>

  <description>The proxy hostname.  If empty, no proxy is
used.</description>

</property>

 

<property>

  <name>http.proxy.port</name>

  <value></value>

  <description>The proxy port.</description>

</property>

 

 

You can directly set those two properties directly in this file, but
best practices guidelines suggest you to add them in your nutch-file.xml
file. They will be taking into account the next time you crawl your
source(s).

 

Sincerely,

 

David

 

-----------------------------------------

David Poirier

E-business Consultant - Software Engineer

 

 

 

 

________________________________

From: pooja m [mailto:[EMAIL PROTECTED] 
Sent: vendredi, 28. mars 2008 18:34
To: POIRIER David
Subject: need ur help

 

Hey, i found u thru the nutch users list. I've just downloaed Nutch. I'm
quite new to the concepts of serching and indexing. Can u plz tell me
step
by step how to configure it on my machine . Earlier also i tried but was
getting HTTP 407 authentication error. can u plz help me out. This is
very
urgent.


Best REagrds,
Mayank

Reply via email to