Hi Vyacheslav,

On Thu, Jun 15, 2017 at 1:41 AM, <[email protected]> wrote:

>
> From: Vyacheslav Pascarel <[email protected]>
> To: "[email protected]" <[email protected]>
> Cc:
> Bcc:
> Date: Wed, 14 Jun 2017 22:15:49 +0000
> Subject: Outlinks field is not populated when page from seed URL when
> fetched page contains "refresh" meta tag
> Hello,
>
> I am trying to crawl http://www.msnbc.com/ but having problem to get
> anything else beside the original seed URL. The INJECT/GENERATE/FETCH steps
> complete without problems but after executing PARSE I see only one outlink
> pointing to the original seed URL:
>
> ...
Which version of Nutch are you using?
Lewis

Reply via email to