Steve Yao created NUTCH-2280:
--------------------------------

             Summary: HTTP Post form authentication CookiePolicy configuration
                 Key: NUTCH-2280
                 URL: https://issues.apache.org/jira/browse/NUTCH-2280
             Project: Nutch
          Issue Type: New Feature
          Components: protocol
    Affects Versions: 1.11
            Reporter: Steve Yao
            Priority: Minor


The protocol-httpclient plugin supports HTTP form authentication with form 
values post back to the assigned login URL and store the session cookie for 
following content retrieving.
The httpclient default CookiePolicy setting is in use. This default setting 
will reject cookie has domain set starting as ".", for example 
domain=".domain.com". This kind of domain value could be accepted by most web 
browsers. 
I suggest to add an configurable option in conf/httpclient-auth.xml:
{code:xml}<credentials authMethod="formMethod" ...>
...
  <loginCookie>
    <policy>DEFAULT | BROWSER_COMPATIBILITY | NETSCAPE RFC_2109 | 
RFC_2965</policy>
  </loginCookie>
</credentials>{code}
Then, the httpclient could take this Cookie policy value.

I am working on a patch for this feature. But before i implement the 
configuration format change, i would like to hear any other suggestions or 
comments. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to