[ 
https://issues.apache.org/jira/browse/NUTCH-3018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17781483#comment-17781483
 ] 

Tim Allison edited comment on NUTCH-3018 at 10/31/23 6:46 PM:
--------------------------------------------------------------

It looks like we cannot create more web drivers than the number of worker nodes 
X {{SE_NODE_MAX_SESSIONS}}. 

I think it would still be useful to reuse the webdriver(s) if we can. We could 
reconnect on exception, etc...

This may be a horribly misguided approach.  Let me know. :D


was (Author: talli...@mitre.org):
It looks like we cannot create more web drivers than the 
{{SE_NODE_MAX_SESSIONS}} which defaults to 1. 

I think it would still be useful to reuse the webdriver(s) if we can. We could 
reconnect on exception, etc...

This may be a horribly misguided approach.  Let me know. :D

> Consider pooling remote webdrivers for Selenium?
> ------------------------------------------------
>
>                 Key: NUTCH-3018
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3018
>             Project: Nutch
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Minor
>
> It looks like it takes between 2x and 4x of the time to initialize the remote 
> webdriver in selenium than it does to render/fetch a couple of test pages I'm 
> working with.
> On a mac with a chrome driver, ~1.5 seconds to load the driver and then .5 of 
> a second to fetch/render the page. On a mac, ~1.2 seconds to load and then .5 
> of a second to fetch/render.  
> On a mac with firefox driver, ~3.7 seconds to load the driver and ~1 second 
> to fetch/render a page.
> Is it worth pooling webdrivers or does that add too much complexity/overhead?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to