Re: [zones-discuss] Re: Re: Re: zone to zone networking slow!!

Jeff Savit Fri, 12 Jan 2007 13:39:03 -0800

Gino Ruopolo wrote:

When I get a chance, I'll reproduce many
connections/s (instead of
bytes/s) across zones, compared to intra-zone, using
Apache's 'ab'
tool, and report what I get.  -- Jeff
that would be very similar to my setup.thankyou
gino
This message posted from opensolaris.org

As promised, I've done some tests on the scenario Gino described: higherCPU overhead for a workload containing many connections per second whenclient and server processes are in different zones than when in the samezone. Sorry for the delay, but besides other activities getting in theway, I wanted to conduct good tests where I had a reasonable degree ofconfidence that I was correctly measuring the right thing. My apologiesfor the long e-mail - send to bit bucket if it's too much detail :-)

I tested with Nevada build 55 (perhaps I would get different numbers onstock Solaris 10, so maybe I'll retry this next week on a machinerunning S10), with most tests done on a fairly old PC (Pentium III at1Ghz). I created zones named zone1 and zone2 (very creative, I know...),and configured Apache 2. For workload, I used the Apache benchmark tool('ab'), and the http_load from Acme.com. In each test, I fetched thesame URL, pointing to static HTML, in order to eliminate variableoverhead from dynamic content and/or disk I/O.

Many thousands of web hits later and multiple runs with an otherwiseidle box, I measured an average 692.82 hits per second when bothhttp_load and httpd were in the same zone, and 679.27 going from zone2to zone1. That's a ratio of 0.9804, so inter-zone results were 98.04% ofthe rate of same-zone traffic. Normally I'd be inclined to considerunder 2% difference "noise", but the results over multiple trials (andalso when using 'ab' instead of http_load) seemed consistent: a smalldifference in the same direction.

I then used DTrace to see where I was spending time, both elapsed andCPU time (timestamp and vtimestamp) and got the following results (shownis output from same-zone traffic) for the most dominant system calls:


vtime by function

httpdwritev 71642973httpdaccept 71720121httpdshutdown 76473328http_loadclose 89913014http_loadconnect 149196319


elapsed time by function

httpdwritev 85014366httpdshutdown 90492746http_loadclose 102127944http_loadconnect 162967191httpdaccept 17908653425


total vtime by execname:        http_load     351417154, httpd 434824250
total wall time by execname:  http_load     456124949; httpd 18503892793

The actual times aren't of consequence: what is significant is wheretime was spent (the same system calls consistently appeared in bothsame-zone and different-zone cases), and the different between same-zoneand different zone tests. I measured CPU time slightly higher in thedifferent-zone case: 5.76% higher for client (http_load), and 2.04%higher for Apache httpd. So pending further investigation, it does seemthere is a small difference in time when an application with manyconnections per second is in the same zone compared to different zones.

This does not explain the issues Gino raised in his original post, wherehe reported a bigger change in performance. It does not seem to berelated to the cost of network connections, and we have a few ideas onwhere they may be. We've corresponded privately, and I've sent him someDTrace scripts, so we will pursue this further together. Thank goodnessfor DTrace!


cheers, Jeff

_______________________________________________
zones-discuss mailing list
zones-discuss@opensolaris.org

Re: [zones-discuss] Re: Re: Re: zone to zone networking slow!!

Reply via email to