[
https://issues.apache.org/jira/browse/NUTCH-2363?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15889994#comment-15889994
]
Markus Jelsma commented on NUTCH-2363:
--------------------------------------
Hello Julien, NUTCH-2355 dealt with sending cookies to servers and implemented
it int protocol-http and protocol-httpclient, just as you already pictured.
The fetcher code deals with the problem of distributing a received cookie (and
merging with existing cookie) to unfetched records in the queue, e.g. a
session-cookie or cookie-wall-cookie (what a word). The merged cookie is put
back in the content metadata so the scoring filter can distribute it to
outlinks.
> Fetcher support for reading and setting cookies
> -----------------------------------------------
>
> Key: NUTCH-2363
> URL: https://issues.apache.org/jira/browse/NUTCH-2363
> Project: Nutch
> Issue Type: New Feature
> Components: fetcher
> Reporter: Markus Jelsma
> Assignee: Markus Jelsma
> Fix For: 1.13
>
> Attachments: NUTCH-2363.patch
>
>
> Patch adds basic support for cookies in the fetcher, and a scoring plugin
> that passes cookies to its outlinks, within the domain. Sub-domain or path
> based is not supported.
> This is useful if you want to maintain sessions or need to get around a
> cookie wall.
--
This message was sent by Atlassian JIRA
(v6.3.15#6346)