Re: Strange behavior when concurrent user from 9K to 10K

Shanti Subramanyam - PAE Fri, 25 Sep 2009 11:00:09 -0700

We have received 3-4 copies of this email over different times - notsure what's going on.

Congratulations on reaching such a high scale in your testing. As youprobably know by now that scaling isn't a straight-forward task andinvolves much analysis and tuning.If you're cpu is 90% utilized at 10000 users, I don't see how you canexpect to get more throughput from this system ? In fact, you willprobably find that beyond a certain point (say 6 to 8000 users), youwill need more cpu/user as scalability drops.In any case, at such high rates, there can be lots of issues and it isdifficult to preddict what exactly you may be hitting.

Shanti


On 09/24/09 19:25, Mingfan Lu wrote:

I using faban/oliophp to stress a machine (16 Core) as web server withtwo other DB nodes ( a master_slave cluster, master using a high speedSATA disk while slave using a SSD disk)
When #concurrent users scaling from 9K 10K 11K 12K 13K 14K 15K 16K thethroughput increasing and then decreasing.
9k      *1810.323*
10K    *1969.393*
11K    *1859.053*
12K    *1842.368*
13K    *1849.213*
14K    *1843.955*

 It seems that there are some bottleneck here.

Detail to see the attached run.xml

My ramp time is 300s while steady time is 600s and the rampdown is 60s
The client start up:
    Time between starts (ms) :1
    Start simultaneously: No
    Start agents in parallel: No
But my profiling data shows that the CPU( Highest is about 80%~90% when#concurrent user is 10000, softirq% is about 14% with *4tx and 4rx*queues ) / Networks Bandwidth(70% of 1Gb) /Memory Usage/Disk are not thebottleneck. The Apache error log is very clean with no exception anderror. At the same time I have disabled the static images serving (Justdisable all *img* tag in the HTML)
From the pictures inhttp://docs.google.com/present/view?id=df7282nf_30x8gwmrch&autoStart=true<http://docs.google.com/present/view?id=df7282nf_30x8gwmrch&autoStart=true>, when 9K concurrent user, the response time is steady enough, when 10K,there is pulse lasting 600sec (what happen?) and down to very smallenough in the last 300sec.I want to know what cause the strange pulse when concurrent users reach10K?

Re: Strange behavior when concurrent user from 9K to 10K

Reply via email to