Hi Rafael,

The honest truth is that there needs to be comprehensive documentation on
the wiki for the way that Nutch handles redirects. This is a question that
has gone fully unanswered for sometime. That's just the way it is I suppose.
I'll get my head around everything and try to get some wiki page up and
running ASAP. In the meantime, can you adivise if there is anything over
and above the files in nutch-default.xml and o.a.n.protocol package which
you would like to see documented?

Thanks

On Wed, Nov 16, 2011 at 7:17 PM, Rafael Pappert <[email protected]> wrote:

> Hello List,
>
> is it possible to follow http 301 redirects immediately?
>
> I tried to set http.redirect.max to 3 but the page is
> still not indexed. readdb is still showing 1 page is
> unfetched / db_redir_perm. And I can't find the
> redirection target in the crawldb.
>
> How does nutch handle redirects?
>
> Thanks in advance,
> Rafael.
>
>
>
>
>


-- 
*Lewis*

Reply via email to