Is there a way to fetch https websites using selenium? On 5 Mar 2018 14:10, "Sebastian Nagel" <wastl.na...@googlemail.com> wrote:
> > What will happen if I try to crawl a https website. > > I didn't try it, but I would expect that > - if except protocol-selenium no other protocol plugins are active: > fetching fails (as reported in NUTCH-2310) > - if another protocol plugin is active which supports https: > Fetcher will uses it to fetch https content > > > On 03/05/2018 09:35 AM, narendra singh arya wrote: > > I am using the one you told. > > Now my question is after specifying protocol-selenium as initial Fetcher, > > What will happen if I try to crawl a https website. > > And what will happen if don't setup the selenium and try crawl a website. > > Because it's not throwing any error. > > > > On Mon, 5 Mar 2018, 13:59 Sebastian Nagel, <wastl.na...@googlemail.com> > > wrote: > > > >> Hi, > >> > >> it is not used as Fetcher but Fetcher will use it if it fetches content > >> via http. > >> If not used at all, it's likely a configuration issue (plugin.includes) > or > >> an unsupported protocol (that's true for https, see NUTCH-2310). > >> > >> Just to confirm: are you really using > >> https://github.com/momer/nutch-selenium-grid-plugin > >> instead of protocol-selenium which is part of Nutch? > >> > >> Best, > >> Sebastian > >> > >> On 03/05/2018 09:00 AM, narendra singh arya wrote: > >>> How can I know that protocol-selinium is used as Fetcher. Because I > don't > >>> think after going through all the steps it is being used at all. > >>> > >>> On Fri, 2 Mar 2018, 18:28 narendra singh arya, <nsary...@gmail.com> > >> wrote: > >>> > >>>> I want to crawl ajax populated content using nutch. > >>>> I tried this with selenium-grid-plugin on nutch 1.14. > >>>> After following all the steps from github page > >> nutch-selenium-grid-plugin > >>>> I am not able to fetch the ajax loaded content. > >>>> I have docker-selnium hub and node running on my mac. > >>>> But I am still not able to fetch the ajax loaded content. > >>>> Help regarding any version of nutch will be appreciated. > >>>> Thanks > >>>> > >>> > >> > >> > > > >