Hello-

I should point out that these are HTTP codes, not nutch specific stuff, so if you want more information you might get more thorough results referencing that.

                       see you
                           -J


----- Original Message ----- From: "Andrzej Bialecki" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Tuesday, September 18, 2007 8:57 AM
Subject: Re: nutch fetch status codes


eyal edri wrote:
hi,

Can someone explain on the various status codes and their meaning?
fetched, unfetched  - pretty obvious

db_gone - ?

We tried several times to retrieve this page (3 times by default), and it was either forbidden by robots.txt, or we got HTTP 404.

db_redir_perm - ?

This url is redirected to a different url using HTTP 301 (Permanently Moved). The HTTP spec says that in this case the original url should not be used anymore.

db_redir_temp - ?

This url is redirected to a different url using HTTP 302 (Temporarily Moved).


--
Best regards,
Andrzej Bialecki     <><
 ___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com


Reply via email to