I am using the one you told. Now my question is after specifying protocol-selenium as initial Fetcher, What will happen if I try to crawl a https website. And what will happen if don't setup the selenium and try crawl a website. Because it's not throwing any error.
On Mon, 5 Mar 2018, 13:59 Sebastian Nagel, <wastl.na...@googlemail.com> wrote: > Hi, > > it is not used as Fetcher but Fetcher will use it if it fetches content > via http. > If not used at all, it's likely a configuration issue (plugin.includes) or > an unsupported protocol (that's true for https, see NUTCH-2310). > > Just to confirm: are you really using > https://github.com/momer/nutch-selenium-grid-plugin > instead of protocol-selenium which is part of Nutch? > > Best, > Sebastian > > On 03/05/2018 09:00 AM, narendra singh arya wrote: > > How can I know that protocol-selinium is used as Fetcher. Because I don't > > think after going through all the steps it is being used at all. > > > > On Fri, 2 Mar 2018, 18:28 narendra singh arya, <nsary...@gmail.com> > wrote: > > > >> I want to crawl ajax populated content using nutch. > >> I tried this with selenium-grid-plugin on nutch 1.14. > >> After following all the steps from github page > nutch-selenium-grid-plugin > >> I am not able to fetch the ajax loaded content. > >> I have docker-selnium hub and node running on my mac. > >> But I am still not able to fetch the ajax loaded content. > >> Help regarding any version of nutch will be appreciated. > >> Thanks > >> > > > >