[
https://issues.apache.org/jira/browse/NUTCH-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Ferdy Galema updated NUTCH-1448:
--------------------------------
Description: This is specifically for Nutch2.x. Handling a redirects url
like an outlink is much more cleaner because this makes it more simple to trace
how new urls are added to the webpage database. Instant fetching of redirects
won't work, but this is a small price to pay. (Note that this currently does
not work at all, because the http.max.redirect property has no effect). Will be
attaching a patch in the upcoming days.
Fix Version/s: 2.1
> Redirected urls should be handled more cleanly (more like an outlink url)
> -------------------------------------------------------------------------
>
> Key: NUTCH-1448
> URL: https://issues.apache.org/jira/browse/NUTCH-1448
> Project: Nutch
> Issue Type: Improvement
> Reporter: Ferdy Galema
> Fix For: 2.1
>
>
> This is specifically for Nutch2.x. Handling a redirects url like an outlink
> is much more cleaner because this makes it more simple to trace how new urls
> are added to the webpage database. Instant fetching of redirects won't work,
> but this is a small price to pay. (Note that this currently does not work at
> all, because the http.max.redirect property has no effect). Will be attaching
> a patch in the upcoming days.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira