Please check out the http.redirect.max property in your nutch-default and subsequently nutch-site.xml file. This should be set to a responsible level taking into consideration the nature of the pages you are crawling.
hth Lewis On Wed, May 23, 2012 at 9:40 AM, Tolga <[email protected]> wrote: > Yes, a redirect. > > > On 5/23/12 11:37 AM, Lewis John Mcgibbney wrote: >> >> Can you please elaborate on a re-write rule? Do you mean a redirect? >> >> On Wed, May 23, 2012 at 7:39 AM, Tolga<[email protected]> wrote: >>> >>> Thank you all, especially Lewis, Markus, and whomever I might have >>> forgotten! It is working; I can crawl, index and search. >>> >>> One last question though. On my drupal website, I am redirecting >>> www.example.com to example.com. However, I noticed that nutch doesn't >>> crawl >>> the web site if there is a rewrite rule involved. Is there a workaround? >>> >>> Thanks a lot! >> >> >> > -- Lewis

