Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "HttpAuthenticationSchemes" page has been changed by LewisJohnMcgibbney:
https://wiki.apache.org/nutch/HttpAuthenticationSchemes?action=diff&rev1=27&rev2=28

  <<TableOfContents>>
  
  == Introduction ==
- This is a feature in Nutch that allows the crawler to authenticate itself to 
websites requiring NTLM, Basic or Digest authentication. This feature can not 
do POST based authentication that depends on cookies. More information on this 
can be found at: HttpPostAuthentication
+ This is a feature in Nutch that allows the crawler to authenticate itself to 
websites requiring NTLM, Basic or Digest authentication. Work and information 
to support POST based authentication that depends on cookies can be found at: 
HttpPostAuthentication
  
  == Necessity ==
  There were two plugins already present, viz. 'protocol-http' and 
'protocol-httpclient'. However, 'protocol-http' could not support HTTP 1.1, 
HTTPS and NTLM, Basic and Digest authentication schemes. 'protocol-httpclient' 
supported HTTPS and had code for NTLM authentication but the NTLM 
authentication didn't work due to a bug. Some portions of 'protocol-httpclient' 
were re-written to solve these problems, provide additional features like 
authentication support for proxy server and better inline documentation for the 
properties to be used to configure authentication.

Reply via email to