Hi Vyacheslav, On Thu, Jun 15, 2017 at 1:41 AM, <[email protected]> wrote:
> > From: Vyacheslav Pascarel <[email protected]> > To: "[email protected]" <[email protected]> > Cc: > Bcc: > Date: Wed, 14 Jun 2017 22:15:49 +0000 > Subject: Outlinks field is not populated when page from seed URL when > fetched page contains "refresh" meta tag > Hello, > > I am trying to crawl http://www.msnbc.com/ but having problem to get > anything else beside the original seed URL. The INJECT/GENERATE/FETCH steps > complete without problems but after executing PARSE I see only one outlink > pointing to the original seed URL: > > ... Which version of Nutch are you using? Lewis

