On Mon, Jul 6, 2009 at 3:10 PM, youyou wu<wuyouyou...@hotmail.com> wrote:
> hi Susam,
>
> I have read your work about HttpAuthenticationSchemes,and configed all the
> file according to what you said ,  but no authentication occured .
> I can find the word "Credentials" but no "auth.AuthChallengeProcessor " in
> my logs. I have also tried it in different websites requiring
> Authentication, no one succeeded.
> Is there anything wrong with my Configuration that lead to the failure?
> The sixth of "before asking for help" points out "probably something needs
> to be fixed at the server side", would you mind telling me the details?
>
> The version of nutch running in my machine is 1.0.
> My 'conf/httpclient-auth.xml' is like this:
> <auth-configuration>
> <credentials username="wendysky98" password="666666">
>  <default/>
> </credentials>
> </auth-configuration>
>
> in the attachment is the complete hadoop.log
>
>
> looking forward to your answer.
> Thanks a lot!
>
> Yours
>
> Wuyu
>
> ________________________________
> What can you do with the new Windows Live? Find out

If you can not see "auth.AuthChallengeProcessor " lines in the log
files, probably the website does not require authentication. What kind
of authentication were you expecting? You can also verify whether the
server really requires authentication using telnet, netcat (nc) or a
sniffer. With telnet you would type this and press the enter key
twice:

telnet example.server.com 80
GET / HTTP/1.0
Host: example.server.com

If the server requires authentication, you would see a
"WWW-Authenticate" in the response.

Please delete the current logs, enable DEBUG logs for
log4j.logger.org.apache.nutch.protocol.httpclient, start a new crawl
with only one site URL that you are unable to authenticate to and then
send the following files:

1. conf/nutch-site.xml
2. conf/httpclient-auth.xml
3. logs/hadoop.log
4. Output from telnet, netcat, etc.

Please go through "Need Help?" section of
http://wiki.apache.org/nutch/HttpAuthenticationSchemes to make sure
you haven't missed anything important.

Regards,
Susam Pal

Reply via email to