Is there a way to fetch https websites using selenium?

On 5 Mar 2018 14:10, "Sebastian Nagel" <wastl.na...@googlemail.com> wrote:

> > What will happen if I try to crawl a https website.
>
> I didn't try it, but I would expect that
> - if except protocol-selenium no other protocol plugins are active:
>    fetching fails (as reported in NUTCH-2310)
> - if another protocol plugin is active which supports https:
>    Fetcher will uses it to fetch https content
>
>
> On 03/05/2018 09:35 AM, narendra singh arya wrote:
> > I am using the one you told.
> > Now my question is after specifying protocol-selenium as initial Fetcher,
> > What will happen if I try to crawl a https website.
> > And what will happen if don't setup the selenium and try crawl a website.
> > Because it's not throwing any error.
> >
> > On Mon, 5 Mar 2018, 13:59 Sebastian Nagel, <wastl.na...@googlemail.com>
> > wrote:
> >
> >> Hi,
> >>
> >> it is not used as Fetcher but Fetcher will use it if it fetches content
> >> via http.
> >> If not used at all, it's likely a configuration issue (plugin.includes)
> or
> >> an unsupported protocol (that's true for https, see NUTCH-2310).
> >>
> >> Just to confirm: are you really using
> >>   https://github.com/momer/nutch-selenium-grid-plugin
> >> instead of protocol-selenium which is part of Nutch?
> >>
> >> Best,
> >> Sebastian
> >>
> >> On 03/05/2018 09:00 AM, narendra singh arya wrote:
> >>> How can I know that protocol-selinium is used as Fetcher. Because I
> don't
> >>> think after going through all the steps it is being used at all.
> >>>
> >>> On Fri, 2 Mar 2018, 18:28 narendra singh arya, <nsary...@gmail.com>
> >> wrote:
> >>>
> >>>> I want to crawl ajax populated content using nutch.
> >>>> I tried this with selenium-grid-plugin on nutch 1.14.
> >>>> After following all the steps from github page
> >> nutch-selenium-grid-plugin
> >>>> I am not able to fetch the ajax loaded content.
> >>>> I have docker-selnium hub and node running on my mac.
> >>>> But I am still not able to fetch the ajax loaded content.
> >>>> Help regarding any version of nutch will be appreciated.
> >>>> Thanks
> >>>>
> >>>
> >>
> >>
> >
>
>

Reply via email to