Jira (BOLT-1454) Bolt is super slow

2019-07-12 Thread Cyril Cordoui (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Cyril Cordoui commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 Nice, you pin down the issue, just by adding a temp UserKnownHostsFile=/dev/null in our ssh config: time /usr/local/bin/bolt command run 'echo \"{"hello": "tha world"}\"' --nodes=all > /dev/null /usr/local/bin/bolt command run 'echo \"{"hello": "tha world"}\"' > 8.22s user 0.50s system 76% cpu 11.367 total We have indeed tens of thousands hosts in that file  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.13548.1562929620089%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-11 Thread Nick Lewis (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Nick Lewis commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 I filed a PR against net-ssh with some improvements to known_hosts parsing. There's more work to be done to only parse it once, but this is a substantial improvement anyway. https://github.com/net-ssh/net-ssh/pull/682  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.13408.1562889900200%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-11 Thread Nick Lewis (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Nick Lewis commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 I've found a couple things. The first is that net-ssh's parsing of known_hosts is inherently quite slow (~300ms with 20k lines). The other is that net-ssh parses the known_hosts file once for every host being targeted. The first issue is relatively easy to fix, but the latter requires a bigger restructure.  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.13215.1562885220158%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-11 Thread Nick Lewis (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Nick Lewis commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 It's also only using one core on my machine. That's a limitation of Ruby, unfortunately. But it doesn't seem to be a computationally intensive task in my case, since it's still running very quickly. I found a potentially related issue: https://github.com/net-ssh/net-ssh/issues/567 If I cat my known_hosts file into itself a few times, Bolt takes more than twice as long to run. Does the host you're running from have a particularly large known_hosts file? Weirdly, this seems to even be the case if I run with --no-host-key-check set...  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.13059.1562882580098%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-11 Thread Cyril Cordoui (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Cyril Cordoui commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 We are on RHEL7.6, 128G of ram, 24 threads, the authentication is done through ssh keys (on the three tools used in the benchmark) When you run the test on your box, is bolt using multiple cores? because it seems to be the bottleneck from what we observed.  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.12097.1562842920097%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-10 Thread Nick Lewis (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Nick Lewis commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 I tested on 50 nodes and a simple command run took ~3.3 seconds. The same thing in pssh takes ~0.8 seconds and ~6.6 seconds in ansible. This is with four cores and concurrency set to 50 for all of them. Is it possible that Bolt is using a different, slower authentication method? I'm not sure how that would account for such a large difference though. Unfortunately, since it seems to be an environmental issue of some sort, it's hard to say what might be going on...  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.11683.1562798220395%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-10 Thread Cyril Cordoui (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Cyril Cordoui commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 No same issue time /usr/local/bin/bolt command run 'echo \"{"hello": "tha world"}\"' --nodes=all > /dev/null /usr/local/bin/bolt command run 'echo \"{"hello": "tha world"}\"' > 81.92s user 2.22s system 103% cpu 1:21.32 total  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.10384.1562760540274%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-09 Thread Nick Lewis (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Nick Lewis commented on  BOLT-1454  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
  Re: Bolt is super slow   
 

  
 
 
 
 

 
 This is really interesting, thanks for the report. Do you notice the same slowness when using bolt command run as well, or is it specific to tasks? I wonder if the difference could be related to having to scp the task file... I wouldn't be surprised if Bolt were a little slower in this case, but 80 seconds is extremely wrong.  
 

  
 
 
 
 

 
 
 

 
 
 Add Comment  
 

  
 

  
 
 
 
  
 

  
 
 
 
 

 
 This message was sent by Atlassian JIRA (v7.7.1#77002-sha1:e75ca93)  
 
 

 
   
 

  
 

  
 

   





-- 
You received this message because you are subscribed to the Google Groups "Puppet Bugs" group.
To unsubscribe from this group and stop receiving emails from it, send an email to puppet-bugs+unsubscr...@googlegroups.com.
To post to this group, send email to puppet-bugs@googlegroups.com.
Visit this group at https://groups.google.com/group/puppet-bugs.
To view this discussion on the web visit https://groups.google.com/d/msgid/puppet-bugs/JIRA.315226.1562163484000.9838.1562713800094%40Atlassian.JIRA.
For more options, visit https://groups.google.com/d/optout.


Jira (BOLT-1454) Bolt is super slow

2019-07-03 Thread Cyril Cordoui (JIRA)
Title: Message Title


 
 
 
 

 
 
 

 
   
 Cyril Cordoui created an issue  
 

  
 
 
 
 

 
 
  
 
 
 
 

 
 Puppet Task Runner /  BOLT-1454  
 
 
  Bolt is super slow   
 

  
 
 
 
 

 
Issue Type: 
  Bug  
 
 
Assignee: 
 Unassigned  
 
 
Created: 
 2019/07/03 7:18 AM  
 
 
Priority: 
  Normal  
 
 
Reporter: 
 Cyril Cordoui  
 

  
 
 
 
 

 
 When running bolt on a couple of servers, tasks are super slow to execute. With a simple shell (or python) doing only an echo it takes more than 80 seconds to run on 100 servers, whereas other tools are nearly twenty times faster:    
 
 
 
 
 # ansible -f100 -oi inventory.ini -a '/bin/echo "{\"hello\":\"Tha world\"}"' srvs > /dev/null  
 
 
 ansible -f100 -oi inventory.ini -a srvs > /dev/null 14.38s user 14.77s system 366% cpu 7.964 total  
 
 
 # ansible -oi inventory.ini -a '/bin/echo "{\"hello\":\"Tha world\"}"' srvs > /dev/null  
 
 
 ansible -oi inventory.ini -a '/bin/echo "{\"hello\":\"Tha world\"}"' srv 11.70s user 9.47s system 144% cpu 14.626 total