I am using the one you told.
Now my question is after specifying protocol-selenium as initial Fetcher,
What will happen if I try to crawl a https website.
And what will happen if don't setup the selenium and try crawl a website.
Because it's not throwing any error.

On Mon, 5 Mar 2018, 13:59 Sebastian Nagel, <wastl.na...@googlemail.com>
wrote:

> Hi,
>
> it is not used as Fetcher but Fetcher will use it if it fetches content
> via http.
> If not used at all, it's likely a configuration issue (plugin.includes) or
> an unsupported protocol (that's true for https, see NUTCH-2310).
>
> Just to confirm: are you really using
>   https://github.com/momer/nutch-selenium-grid-plugin
> instead of protocol-selenium which is part of Nutch?
>
> Best,
> Sebastian
>
> On 03/05/2018 09:00 AM, narendra singh arya wrote:
> > How can I know that protocol-selinium is used as Fetcher. Because I don't
> > think after going through all the steps it is being used at all.
> >
> > On Fri, 2 Mar 2018, 18:28 narendra singh arya, <nsary...@gmail.com>
> wrote:
> >
> >> I want to crawl ajax populated content using nutch.
> >> I tried this with selenium-grid-plugin on nutch 1.14.
> >> After following all the steps from github page
> nutch-selenium-grid-plugin
> >> I am not able to fetch the ajax loaded content.
> >> I have docker-selnium hub and node running on my mac.
> >> But I am still not able to fetch the ajax loaded content.
> >> Help regarding any version of nutch will be appreciated.
> >> Thanks
> >>
> >
>
>

Reply via email to