Hi Vyacheslav, On Thu, Jun 15, 2017 at 1:41 AM, <user-digest-h...@nutch.apache.org> wrote:
> > From: Vyacheslav Pascarel <vpasc...@opentext.com> > To: "user@nutch.apache.org" <user@nutch.apache.org> > Cc: > Bcc: > Date: Wed, 14 Jun 2017 22:15:49 +0000 > Subject: Outlinks field is not populated when page from seed URL when > fetched page contains "refresh" meta tag > Hello, > > I am trying to crawl http://www.msnbc.com/ but having problem to get > anything else beside the original seed URL. The INJECT/GENERATE/FETCH steps > complete without problems but after executing PARSE I see only one outlink > pointing to the original seed URL: > > ... Which version of Nutch are you using? Lewis