Re: "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio"
Another update: This is the same testing but for cfs-v7. Summary: 1. context switch rate is much higher than for cfs-v6 for the same sched_granularity_ns 20 2. Video playback suffered. FPS is much lower. 3. Trying different values of sched-granularity_ns didn't help (neither did the default). 4. Firefox is actually much more responsive when the system isn't under stress, better than any other scheduler so far. 5. Generally the system completes operations faster with cfs-v7, but beryl FPS suffers. 6. I've monitored idle context switch rates at ~2000, and some spikes at ~35000 (emerge -s firefox) for example. -- cfs-v7 700m kernel # cat sched_granularity_ns 200 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 2 0 0 28536 88267200 100 0 3793 13889 67 6 27 0 5 0 0 888708536 8827680096 0 3435 13969 69 7 24 0 1 0 0 888468536 8829920096 0 3704 13645 64 8 28 0 0 0 0 888284536 8830640072 0 4378 13557 53 7 40 0 6 0 0 888224536 8831320068 0 3909 13700 60 9 31 0 0 0 0 888164536 8832160084 0 3726 13670 65 6 29 0 top - 16:53:54 up 10 min, 5 users, load average: 3.33, 2.78, 1.43 Tasks: 102 total, 2 running, 100 sleeping, 0 stopped, 0 zombie Cpu(s): 57.9%us, 5.7%sy, 0.0%ni, 35.4%id, 0.0%wa, 0.7%hi, 0.3%si, 0.0%st Mem: 2057700k total, 1172136k used, 885564k free, 536k buffers Swap: 987988k total,0k used, 987988k free, 885976k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 11701 hechacke 20 0 51268 30m 18m R 18.9 1.5 1:46.30 gmplayer 11620 hechacke 20 0 127m 35m 18m S 15.3 1.7 0:40.64 mono 11695 hechacke 20 0 180m 67m 20m S 14.3 3.4 1:51.19 totem 9421 root 20 0 272m 49m 15m S 11.3 2.4 0:59.29 Xorg 11442 hechacke 20 0 63304 6232 4316 S 3.7 0.3 0:11.86 beryl Observation: Music plays perfectly. Audio of video's play perfectly. New Processes start in about the same amount of time as cfs-v6 Already open applications are responsive. Behavior of video: video's both moving forward. totem is doing ~0.9fps. mplayer is doing ~1 fps. Video's are slideshows in cfs-v7 vs cfs-v6 which was had more fps. Going back to sd-0.46 since so far it provides the most constant FPS under load (and thus the most responsiveness/smoothness). - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio
Another update: This is the same testing but for cfs-v7. Summary: 1. context switch rate is much higher than for cfs-v6 for the same sched_granularity_ns 20 2. Video playback suffered. FPS is much lower. 3. Trying different values of sched-granularity_ns didn't help (neither did the default). 4. Firefox is actually much more responsive when the system isn't under stress, better than any other scheduler so far. 5. Generally the system completes operations faster with cfs-v7, but beryl FPS suffers. 6. I've monitored idle context switch rates at ~2000, and some spikes at ~35000 (emerge -s firefox) for example. -- cfs-v7 700m kernel # cat sched_granularity_ns 200 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 2 0 0 28536 88267200 100 0 3793 13889 67 6 27 0 5 0 0 888708536 8827680096 0 3435 13969 69 7 24 0 1 0 0 888468536 8829920096 0 3704 13645 64 8 28 0 0 0 0 888284536 8830640072 0 4378 13557 53 7 40 0 6 0 0 888224536 8831320068 0 3909 13700 60 9 31 0 0 0 0 888164536 8832160084 0 3726 13670 65 6 29 0 top - 16:53:54 up 10 min, 5 users, load average: 3.33, 2.78, 1.43 Tasks: 102 total, 2 running, 100 sleeping, 0 stopped, 0 zombie Cpu(s): 57.9%us, 5.7%sy, 0.0%ni, 35.4%id, 0.0%wa, 0.7%hi, 0.3%si, 0.0%st Mem: 2057700k total, 1172136k used, 885564k free, 536k buffers Swap: 987988k total,0k used, 987988k free, 885976k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 11701 hechacke 20 0 51268 30m 18m R 18.9 1.5 1:46.30 gmplayer 11620 hechacke 20 0 127m 35m 18m S 15.3 1.7 0:40.64 mono 11695 hechacke 20 0 180m 67m 20m S 14.3 3.4 1:51.19 totem 9421 root 20 0 272m 49m 15m S 11.3 2.4 0:59.29 Xorg 11442 hechacke 20 0 63304 6232 4316 S 3.7 0.3 0:11.86 beryl Observation: Music plays perfectly. Audio of video's play perfectly. New Processes start in about the same amount of time as cfs-v6 Already open applications are responsive. Behavior of video: video's both moving forward. totem is doing ~0.9fps. mplayer is doing ~1 fps. Video's are slideshows in cfs-v7 vs cfs-v6 which was had more fps. Going back to sd-0.46 since so far it provides the most constant FPS under load (and thus the most responsiveness/smoothness). - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio"
so SD context-switched twice as much and saturated the CPU fully, while under cfs-v6 there was 34% idle time left. That double context-switch rate and higher CPU utilization could easily result in you experiencing a 'smoother' desktop (and smoother video playback) on SD. could you try to maximize the preemption ratio on CFS by using a sched_granularity_ns of 0? Does that result in a higher context-switch rate and in better CPU utilization? Thanks, Ingo Thanks for pointing out the idle cpu usage. I hadn't noticed it before. I re-ran the tests with sched_granularity_ns 0, but fyi, I did try that, as well as many values around 500, 1-10ms, and some huge values too just to see how the scheduler reacted. The reason why I settled on 200 was because of trial and error and determining that value produced the best fps. Large values than 500 clearly caused the FPS to drop. Too small values did the same. FYI, when the desktop is sitting on only one cube side, i.e. normal desktop. The idle usage for cfs and SD goes down to nearly 0. But when the cube is in between faces in free form mode (the mouse controls the cube in 3D space) the idle cpu usage goes up, for both cfs, SD, and mainline. I looked carefully at the idle usage values for cfs-v6 and sd-0.46, and cfs hovers around ~35% idle cpu, sd hovers around ~3-30%, usually ~15% overall. So perhaps my video card is fill rate limited during this stress test... but... mplayer and totem are visibly refreshing faster with sd-0.46, almost watchable and smooth during certain points in the video. (clearly the fps for SD is much higher) The new results: --- cfs-v6 700m kernel # cat sched_granularity_ns 0 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 8 0 0 894204536 8923880045 0 4192 12237 57 7 35 0 0 0 0 893904536 8927080088 139 4298 12511 55 7 29 9 4 0 0 893620536 8929640085 0 4217 12189 55 5 39 0 0 0 0 890272536 8932200085 0 4285 12366 55 7 38 0 4 0 0 881008536 8933480043 0 4020 12610 59 6 35 0 5 0 0 882608536 8938480043 0 3053 12045 81 6 13 0 top - 01:03:47 up 10 min, 7 users, load average: 3.52, 2.81, 1.41 Tasks: 103 total, 1 running, 102 sleeping, 0 stopped, 0 zombie Cpu(s): 55.0%us, 6.3%sy, 0.0%ni, 37.3%id, 0.7%wa, 0.7%hi, 0.0%si, 0.0%st Mem: 2057700k total, 1171508k used, 886192k free, 536k buffers Swap: 987988k total,0k used, 987988k free, 899704k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 20227 hechacke 20 0 51408 30m 18m S 22.9 1.5 1:44.30 gmplayer 20237 hechacke 20 0 128m 37m 18m S 14.0 1.9 0:28.88 mono 19931 root 20 0 271m 48m 15m S 11.0 2.4 0:52.76 Xorg 20214 hechacke 20 0 156m 51m 19m S 10.0 2.6 1:47.80 totem 20164 hechacke 20 0 63456 6332 4316 S 3.0 0.3 0:12.06 beryl Observation: Music plays perfectly. Audio of video's play perfectly. Processes start faster than sched_granularity_ns 200 Browsing the web faster than 200, but still not as fast as mainline (better than cfs) Already open applications are responsive. Behavior of video: video's both moving forward. totem is doing ~0.5fps. mplayer is doing ~2 fps. Visibly slower refreshing vs sched_granularity_ns 200 For some reason top doesn't seem reliable considering the system was stressed at the time? - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio
so SD context-switched twice as much and saturated the CPU fully, while under cfs-v6 there was 34% idle time left. That double context-switch rate and higher CPU utilization could easily result in you experiencing a 'smoother' desktop (and smoother video playback) on SD. could you try to maximize the preemption ratio on CFS by using a sched_granularity_ns of 0? Does that result in a higher context-switch rate and in better CPU utilization? Thanks, Ingo Thanks for pointing out the idle cpu usage. I hadn't noticed it before. I re-ran the tests with sched_granularity_ns 0, but fyi, I did try that, as well as many values around 500, 1-10ms, and some huge values too just to see how the scheduler reacted. The reason why I settled on 200 was because of trial and error and determining that value produced the best fps. Large values than 500 clearly caused the FPS to drop. Too small values did the same. FYI, when the desktop is sitting on only one cube side, i.e. normal desktop. The idle usage for cfs and SD goes down to nearly 0. But when the cube is in between faces in free form mode (the mouse controls the cube in 3D space) the idle cpu usage goes up, for both cfs, SD, and mainline. I looked carefully at the idle usage values for cfs-v6 and sd-0.46, and cfs hovers around ~35% idle cpu, sd hovers around ~3-30%, usually ~15% overall. So perhaps my video card is fill rate limited during this stress test... but... mplayer and totem are visibly refreshing faster with sd-0.46, almost watchable and smooth during certain points in the video. (clearly the fps for SD is much higher) The new results: --- cfs-v6 700m kernel # cat sched_granularity_ns 0 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 8 0 0 894204536 8923880045 0 4192 12237 57 7 35 0 0 0 0 893904536 8927080088 139 4298 12511 55 7 29 9 4 0 0 893620536 8929640085 0 4217 12189 55 5 39 0 0 0 0 890272536 8932200085 0 4285 12366 55 7 38 0 4 0 0 881008536 8933480043 0 4020 12610 59 6 35 0 5 0 0 882608536 8938480043 0 3053 12045 81 6 13 0 top - 01:03:47 up 10 min, 7 users, load average: 3.52, 2.81, 1.41 Tasks: 103 total, 1 running, 102 sleeping, 0 stopped, 0 zombie Cpu(s): 55.0%us, 6.3%sy, 0.0%ni, 37.3%id, 0.7%wa, 0.7%hi, 0.0%si, 0.0%st Mem: 2057700k total, 1171508k used, 886192k free, 536k buffers Swap: 987988k total,0k used, 987988k free, 899704k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 20227 hechacke 20 0 51408 30m 18m S 22.9 1.5 1:44.30 gmplayer 20237 hechacke 20 0 128m 37m 18m S 14.0 1.9 0:28.88 mono 19931 root 20 0 271m 48m 15m S 11.0 2.4 0:52.76 Xorg 20214 hechacke 20 0 156m 51m 19m S 10.0 2.6 1:47.80 totem 20164 hechacke 20 0 63456 6332 4316 S 3.0 0.3 0:12.06 beryl Observation: Music plays perfectly. Audio of video's play perfectly. Processes start faster than sched_granularity_ns 200 Browsing the web faster than 200, but still not as fast as mainline (better than cfs) Already open applications are responsive. Behavior of video: video's both moving forward. totem is doing ~0.5fps. mplayer is doing ~2 fps. Visibly slower refreshing vs sched_granularity_ns 200 For some reason top doesn't seem reliable considering the system was stressed at the time? - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio"
On 27/04/07, hechacker1 <[EMAIL PROTECTED]> wrote: "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio" Hardware: Dell Inspiron 700m laptop 1.7GHz Pentium M (Dothan 2M cache) 2GB RAM 1000Hz Gentoo Linux dyn-tick 700m # cat /sys/devices/system/cpu/cpu0/cpufreq/ondemand/sampling_rate 1 (microseconds, 10ms) 855gm integrated video/chipset xf86-video-i810 (intel 1.7.4) DRI enabled xorg-server-1.2.0-r3 beryl-core 0.3.0-svn MPlayer dev-SVN-rUNKNOWN-4.1.2 - x11 Gnome totem 2.16.5 - x11-gstreamer reiser4 w/cryptcompress Screenshot: http://ordorica.org/misc/beryl.png muine playing mp3's off mounted windows share Tests run under 16 bit color which provides a constant 75 fps on one cube side (fps forced limited). Drops to ~45-50 fps during animation/rotate/scale (depending on complexity of rendering) Vsync off. 75Hz refresh 1280x800. totem running fullscreen playing 700MB divx "An Inconvenient Truth.avi" on one side of cube/desktop gmplayer running fullscreen on another cube side (same file). The given observations/numbers are when I move the cube with my mouse and view two faces at one time (see screenshot). One face is playing the totem video, the other containing my terminals. Some numbers I've seen other people throw around: I don't know their relevance. cfs-v6: 700m kernel # cat sched_granularity_ns 500 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 221480300 139461200 181 0 6068 5317 69 6 25 0 4 0 0 220880300 139526800 176 0 6147 5579 68 6 27 0 1 0 0 220340300 139576800 167 0 6052 5393 70 6 24 0 6 0 0 219920300 139620400 103 0 5830 5211 73 6 21 0 top - 18:31:17 up 7:45, 5 users, load average: 5.18, 4.73, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 91.6%us, 6.4%sy, 0.0%ni, 0.3%id, 0.0%wa, 1.3%hi, 0.3%si, 0.0%st Mem: 2057700k total, 1845952k used, 211748k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1404040k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18046 hechacke 20 0 189m 83m 20m S 38.7 4.2 12:04.64 totem 18059 hechacke 20 0 51280 30m 18m R 25.8 1.5 9:47.36 gmplayer 12117 root 20 0 275m 54m 18m R 20.2 2.7 15:18.38 Xorg 22730 hechacke 20 0 119m 35m 18m R 5.3 1.7 0:12.68 mono 12350 hechacke 20 0 63820 6776 4328 S 3.6 0.3 2:20.36 beryl 16465 hechacke 20 0 43960 15m 10m S 2.3 0.8 0:07.14 gnome-terminal 12200 hechacke 20 0 5308 4016 1740 S 0.3 0.2 0:05.45 gconfd-2 12215 hechacke 20 0 38704 8956 7588 S 0.3 0.4 0:08.90 xfce4-clipman-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Firefox (already cached in ram) takes about 5 seconds to start; even right after closing it. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every half second. mplayer updates about every 3 seconds. - cfs-v6: 700m kernel # cat sched_granularity_ns 200 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 99604 44 151936400 0 0 3903 5575 91 5 5 0 3 0 0 99512 44 151936400 0 0 5990 6783 72 5 23 0 3 0 0 100412 44 151936400 0 0 6858 7261 67 5 28 0 1 0 0 100412 44 151936400 0 0 7426 7634 62 4 34 0 4 0 0 100288 44 151936400 0 0 7039 7442 60 6 34 0 top - 19:05:09 up 8:18, 5 users, load average: 3.62, 4.16, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 69.8%us, 5.0%sy, 0.0%ni, 24.5%id, 0.0%wa, 0.7%hi, 0.0%si, 0.0%st Mem: 2057700k total, 2009396k used,48304k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1555428k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18059 hechacke 20 0 51332 30m 18m R 30.8 1.5 18:48.17 gmplayer 18046 hechacke 20 0 189m 83m 20m S 20.9 4.2 23:25.49 totem 12117 root 20 0 276m 57m 18m S 9.6 2.8 20:59.01 Xorg 22730 hechacke 20 0 129m 36m 18m R 8.6 1.8 1:28.59 mono 22930 hechacke 20 0 65480 8392 4320 S 4.0 0.4 0:53.38 beryl 12213 hechacke 20 0 34472 7680 6484 S 0.7 0.4 1:16.41 xfce4-battery-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forwar
Re: "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio"
* hechacker1 <[EMAIL PROTECTED]> wrote: > "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio" thanks for testing it out. one immediate observation i have is that you used a 2msec granularity setting on CFS, but even that did not cause context-switching as high as SD's rr_interval==2 setting: > cfs-v6: > 700m kernel # cat sched_granularity_ns > 200 > r b swpd free buff cachesi sobibo in cs us sy id > 1 0 0 100412 44 151936400 0 0 7426 7634 62 4 34 > 4 0 0 100288 44 151936400 0 0 7039 7442 60 6 34 > sd-0.46: > 700m kernel # cat rr_interval > 2 > r b swpd free buff cache si sobibo incs us sy id > 5 0 0 918052536 83284000 411 0 2387 15242 89 11 0 > 4 1 0 915600536 83490800 388 0 2283 15428 90 10 0 so SD context-switched twice as much and saturated the CPU fully, while under cfs-v6 there was 34% idle time left. That double context-switch rate and higher CPU utilization could easily result in you experiencing a 'smoother' desktop (and smoother video playback) on SD. could you try to maximize the preemption ratio on CFS by using a sched_granularity_ns of 0? Does that result in a higher context-switch rate and in better CPU utilization? Thanks, Ingo - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: "REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio"
On Thu, 2007-04-26 at 21:01 -0700, hechacker1 wrote: > Overall: > SD-0.46 is my new choice for scheduler. When not under load everything > run's better or similarly to cfs or mainline. Under load however it > shows the most responsiveness. > > Occasionally I had complete mouse freezes with cfs when the system was > busy. But rarely. > > Under SD i haven't seen anything get starved. > > mainline surprisingly works better than i expected, but beryl suffers > and responsiveness suffers under load. Your findings seems somewhat similar to mine, what i have observed is that SD is much more smooth, with vanilla/cfs, under just abit load opengl stuff will stutter, whereas with sd it will simply get slightly(or more, depending on the load) lower fps. > > --hechacker1 > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to [EMAIL PROTECTED] > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ > - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio
On Thu, 2007-04-26 at 21:01 -0700, hechacker1 wrote: snip Overall: SD-0.46 is my new choice for scheduler. When not under load everything run's better or similarly to cfs or mainline. Under load however it shows the most responsiveness. Occasionally I had complete mouse freezes with cfs when the system was busy. But rarely. Under SD i haven't seen anything get starved. mainline surprisingly works better than i expected, but beryl suffers and responsiveness suffers under load. Your findings seems somewhat similar to mine, what i have observed is that SD is much more smooth, with vanilla/cfs, under just abit load opengl stuff will stutter, whereas with sd it will simply get slightly(or more, depending on the load) lower fps. --hechacker1 - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio
* hechacker1 [EMAIL PROTECTED] wrote: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio thanks for testing it out. one immediate observation i have is that you used a 2msec granularity setting on CFS, but even that did not cause context-switching as high as SD's rr_interval==2 setting: cfs-v6: 700m kernel # cat sched_granularity_ns 200 r b swpd free buff cachesi sobibo in cs us sy id 1 0 0 100412 44 151936400 0 0 7426 7634 62 4 34 4 0 0 100288 44 151936400 0 0 7039 7442 60 6 34 sd-0.46: 700m kernel # cat rr_interval 2 r b swpd free buff cache si sobibo incs us sy id 5 0 0 918052536 83284000 411 0 2387 15242 89 11 0 4 1 0 915600536 83490800 388 0 2283 15428 90 10 0 so SD context-switched twice as much and saturated the CPU fully, while under cfs-v6 there was 34% idle time left. That double context-switch rate and higher CPU utilization could easily result in you experiencing a 'smoother' desktop (and smoother video playback) on SD. could you try to maximize the preemption ratio on CFS by using a sched_granularity_ns of 0? Does that result in a higher context-switch rate and in better CPU utilization? Thanks, Ingo - To unsubscribe from this list: send the line unsubscribe linux-kernel in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/
Re: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio
On 27/04/07, hechacker1 [EMAIL PROTECTED] wrote: REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio Hardware: Dell Inspiron 700m laptop 1.7GHz Pentium M (Dothan 2M cache) 2GB RAM 1000Hz Gentoo Linux dyn-tick 700m # cat /sys/devices/system/cpu/cpu0/cpufreq/ondemand/sampling_rate 1 (microseconds, 10ms) 855gm integrated video/chipset xf86-video-i810 (intel 1.7.4) DRI enabled xorg-server-1.2.0-r3 beryl-core 0.3.0-svn MPlayer dev-SVN-rUNKNOWN-4.1.2 - x11 Gnome totem 2.16.5 - x11-gstreamer reiser4 w/cryptcompress Screenshot: http://ordorica.org/misc/beryl.png muine playing mp3's off mounted windows share Tests run under 16 bit color which provides a constant 75 fps on one cube side (fps forced limited). Drops to ~45-50 fps during animation/rotate/scale (depending on complexity of rendering) Vsync off. 75Hz refresh 1280x800. totem running fullscreen playing 700MB divx An Inconvenient Truth.avi on one side of cube/desktop gmplayer running fullscreen on another cube side (same file). The given observations/numbers are when I move the cube with my mouse and view two faces at one time (see screenshot). One face is playing the totem video, the other containing my terminals. Some numbers I've seen other people throw around: I don't know their relevance. cfs-v6: 700m kernel # cat sched_granularity_ns 500 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 221480300 139461200 181 0 6068 5317 69 6 25 0 4 0 0 220880300 139526800 176 0 6147 5579 68 6 27 0 1 0 0 220340300 139576800 167 0 6052 5393 70 6 24 0 6 0 0 219920300 139620400 103 0 5830 5211 73 6 21 0 top - 18:31:17 up 7:45, 5 users, load average: 5.18, 4.73, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 91.6%us, 6.4%sy, 0.0%ni, 0.3%id, 0.0%wa, 1.3%hi, 0.3%si, 0.0%st Mem: 2057700k total, 1845952k used, 211748k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1404040k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18046 hechacke 20 0 189m 83m 20m S 38.7 4.2 12:04.64 totem 18059 hechacke 20 0 51280 30m 18m R 25.8 1.5 9:47.36 gmplayer 12117 root 20 0 275m 54m 18m R 20.2 2.7 15:18.38 Xorg 22730 hechacke 20 0 119m 35m 18m R 5.3 1.7 0:12.68 mono 12350 hechacke 20 0 63820 6776 4328 S 3.6 0.3 2:20.36 beryl 16465 hechacke 20 0 43960 15m 10m S 2.3 0.8 0:07.14 gnome-terminal 12200 hechacke 20 0 5308 4016 1740 S 0.3 0.2 0:05.45 gconfd-2 12215 hechacke 20 0 38704 8956 7588 S 0.3 0.4 0:08.90 xfce4-clipman-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Firefox (already cached in ram) takes about 5 seconds to start; even right after closing it. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every half second. mplayer updates about every 3 seconds. - cfs-v6: 700m kernel # cat sched_granularity_ns 200 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 99604 44 151936400 0 0 3903 5575 91 5 5 0 3 0 0 99512 44 151936400 0 0 5990 6783 72 5 23 0 3 0 0 100412 44 151936400 0 0 6858 7261 67 5 28 0 1 0 0 100412 44 151936400 0 0 7426 7634 62 4 34 0 4 0 0 100288 44 151936400 0 0 7039 7442 60 6 34 0 top - 19:05:09 up 8:18, 5 users, load average: 3.62, 4.16, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 69.8%us, 5.0%sy, 0.0%ni, 24.5%id, 0.0%wa, 0.7%hi, 0.0%si, 0.0%st Mem: 2057700k total, 2009396k used,48304k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1555428k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18059 hechacke 20 0 51332 30m 18m R 30.8 1.5 18:48.17 gmplayer 18046 hechacke 20 0 189m 83m 20m S 20.9 4.2 23:25.49 totem 12117 root 20 0 276m 57m 18m S 9.6 2.8 20:59.01 Xorg 22730 hechacke 20 0 129m 36m 18m R 8.6 1.8 1:28.59 mono 22930 hechacke 20 0 65480 8392 4320 S 4.0 0.4 0:53.38 beryl 12213 hechacke 20 0 34472 7680 6484 S 0.7 0.4 1:16.41 xfce4-battery-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every 1/3
"REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio"
"REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio" Hardware: Dell Inspiron 700m laptop 1.7GHz Pentium M (Dothan 2M cache) 2GB RAM 1000Hz Gentoo Linux dyn-tick 700m # cat /sys/devices/system/cpu/cpu0/cpufreq/ondemand/sampling_rate 1 (microseconds, 10ms) 855gm integrated video/chipset xf86-video-i810 (intel 1.7.4) DRI enabled xorg-server-1.2.0-r3 beryl-core 0.3.0-svn MPlayer dev-SVN-rUNKNOWN-4.1.2 - x11 Gnome totem 2.16.5 - x11-gstreamer reiser4 w/cryptcompress Screenshot: http://ordorica.org/misc/beryl.png muine playing mp3's off mounted windows share Tests run under 16 bit color which provides a constant 75 fps on one cube side (fps forced limited). Drops to ~45-50 fps during animation/rotate/scale (depending on complexity of rendering) Vsync off. 75Hz refresh 1280x800. totem running fullscreen playing 700MB divx "An Inconvenient Truth.avi" on one side of cube/desktop gmplayer running fullscreen on another cube side (same file). The given observations/numbers are when I move the cube with my mouse and view two faces at one time (see screenshot). One face is playing the totem video, the other containing my terminals. Some numbers I've seen other people throw around: I don't know their relevance. cfs-v6: 700m kernel # cat sched_granularity_ns 500 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 221480300 139461200 181 0 6068 5317 69 6 25 0 4 0 0 220880300 139526800 176 0 6147 5579 68 6 27 0 1 0 0 220340300 139576800 167 0 6052 5393 70 6 24 0 6 0 0 219920300 139620400 103 0 5830 5211 73 6 21 0 top - 18:31:17 up 7:45, 5 users, load average: 5.18, 4.73, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 91.6%us, 6.4%sy, 0.0%ni, 0.3%id, 0.0%wa, 1.3%hi, 0.3%si, 0.0%st Mem: 2057700k total, 1845952k used, 211748k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1404040k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18046 hechacke 20 0 189m 83m 20m S 38.7 4.2 12:04.64 totem 18059 hechacke 20 0 51280 30m 18m R 25.8 1.5 9:47.36 gmplayer 12117 root 20 0 275m 54m 18m R 20.2 2.7 15:18.38 Xorg 22730 hechacke 20 0 119m 35m 18m R 5.3 1.7 0:12.68 mono 12350 hechacke 20 0 63820 6776 4328 S 3.6 0.3 2:20.36 beryl 16465 hechacke 20 0 43960 15m 10m S 2.3 0.8 0:07.14 gnome-terminal 12200 hechacke 20 0 5308 4016 1740 S 0.3 0.2 0:05.45 gconfd-2 12215 hechacke 20 0 38704 8956 7588 S 0.3 0.4 0:08.90 xfce4-clipman-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Firefox (already cached in ram) takes about 5 seconds to start; even right after closing it. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every half second. mplayer updates about every 3 seconds. - cfs-v6: 700m kernel # cat sched_granularity_ns 200 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 99604 44 151936400 0 0 3903 5575 91 5 5 0 3 0 0 99512 44 151936400 0 0 5990 6783 72 5 23 0 3 0 0 100412 44 151936400 0 0 6858 7261 67 5 28 0 1 0 0 100412 44 151936400 0 0 7426 7634 62 4 34 0 4 0 0 100288 44 151936400 0 0 7039 7442 60 6 34 0 top - 19:05:09 up 8:18, 5 users, load average: 3.62, 4.16, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 69.8%us, 5.0%sy, 0.0%ni, 24.5%id, 0.0%wa, 0.7%hi, 0.0%si, 0.0%st Mem: 2057700k total, 2009396k used,48304k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1555428k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18059 hechacke 20 0 51332 30m 18m R 30.8 1.5 18:48.17 gmplayer 18046 hechacke 20 0 189m 83m 20m S 20.9 4.2 23:25.49 totem 12117 root 20 0 276m 57m 18m S 9.6 2.8 20:59.01 Xorg 22730 hechacke 20 0 129m 36m 18m R 8.6 1.8 1:28.59 mono 22930 hechacke 20 0 65480 8392 4320 S 4.0 0.4 0:53.38 beryl 12213 hechacke 20 0 34472 7680 6484 S 0.7 0.4 1:16.41 xfce4-battery-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every 1/3 second. mplayer updates about every
REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio
REPORT: sd-0.46 vs cfs-v6 vs mainline 2.6.21-rc7 Beryl + Video + Audio Hardware: Dell Inspiron 700m laptop 1.7GHz Pentium M (Dothan 2M cache) 2GB RAM 1000Hz Gentoo Linux dyn-tick 700m # cat /sys/devices/system/cpu/cpu0/cpufreq/ondemand/sampling_rate 1 (microseconds, 10ms) 855gm integrated video/chipset xf86-video-i810 (intel 1.7.4) DRI enabled xorg-server-1.2.0-r3 beryl-core 0.3.0-svn MPlayer dev-SVN-rUNKNOWN-4.1.2 - x11 Gnome totem 2.16.5 - x11-gstreamer reiser4 w/cryptcompress Screenshot: http://ordorica.org/misc/beryl.png muine playing mp3's off mounted windows share Tests run under 16 bit color which provides a constant 75 fps on one cube side (fps forced limited). Drops to ~45-50 fps during animation/rotate/scale (depending on complexity of rendering) Vsync off. 75Hz refresh 1280x800. totem running fullscreen playing 700MB divx An Inconvenient Truth.avi on one side of cube/desktop gmplayer running fullscreen on another cube side (same file). The given observations/numbers are when I move the cube with my mouse and view two faces at one time (see screenshot). One face is playing the totem video, the other containing my terminals. Some numbers I've seen other people throw around: I don't know their relevance. cfs-v6: 700m kernel # cat sched_granularity_ns 500 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 221480300 139461200 181 0 6068 5317 69 6 25 0 4 0 0 220880300 139526800 176 0 6147 5579 68 6 27 0 1 0 0 220340300 139576800 167 0 6052 5393 70 6 24 0 6 0 0 219920300 139620400 103 0 5830 5211 73 6 21 0 top - 18:31:17 up 7:45, 5 users, load average: 5.18, 4.73, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 91.6%us, 6.4%sy, 0.0%ni, 0.3%id, 0.0%wa, 1.3%hi, 0.3%si, 0.0%st Mem: 2057700k total, 1845952k used, 211748k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1404040k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18046 hechacke 20 0 189m 83m 20m S 38.7 4.2 12:04.64 totem 18059 hechacke 20 0 51280 30m 18m R 25.8 1.5 9:47.36 gmplayer 12117 root 20 0 275m 54m 18m R 20.2 2.7 15:18.38 Xorg 22730 hechacke 20 0 119m 35m 18m R 5.3 1.7 0:12.68 mono 12350 hechacke 20 0 63820 6776 4328 S 3.6 0.3 2:20.36 beryl 16465 hechacke 20 0 43960 15m 10m S 2.3 0.8 0:07.14 gnome-terminal 12200 hechacke 20 0 5308 4016 1740 S 0.3 0.2 0:05.45 gconfd-2 12215 hechacke 20 0 38704 8956 7588 S 0.3 0.4 0:08.90 xfce4-clipman-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Firefox (already cached in ram) takes about 5 seconds to start; even right after closing it. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every half second. mplayer updates about every 3 seconds. - cfs-v6: 700m kernel # cat sched_granularity_ns 200 procs ---memory-- ---swap-- -io -system-- cpu r b swpd free buff cache si sobibo in cs us sy id wa 5 0 0 99604 44 151936400 0 0 3903 5575 91 5 5 0 3 0 0 99512 44 151936400 0 0 5990 6783 72 5 23 0 3 0 0 100412 44 151936400 0 0 6858 7261 67 5 28 0 1 0 0 100412 44 151936400 0 0 7426 7634 62 4 34 0 4 0 0 100288 44 151936400 0 0 7039 7442 60 6 34 0 top - 19:05:09 up 8:18, 5 users, load average: 3.62, 4.16, 4.28 Tasks: 98 total, 4 running, 94 sleeping, 0 stopped, 0 zombie Cpu(s): 69.8%us, 5.0%sy, 0.0%ni, 24.5%id, 0.0%wa, 0.7%hi, 0.0%si, 0.0%st Mem: 2057700k total, 2009396k used,48304k free, 300k buffers Swap: 987988k total,0k used, 987988k free, 1555428k cached PID USER PR NI VIRT RES SHR S %CPU %MEMTIME+ COMMAND 18059 hechacke 20 0 51332 30m 18m R 30.8 1.5 18:48.17 gmplayer 18046 hechacke 20 0 189m 83m 20m S 20.9 4.2 23:25.49 totem 12117 root 20 0 276m 57m 18m S 9.6 2.8 20:59.01 Xorg 22730 hechacke 20 0 129m 36m 18m R 8.6 1.8 1:28.59 mono 22930 hechacke 20 0 65480 8392 4320 S 4.0 0.4 0:53.38 beryl 12213 hechacke 20 0 34472 7680 6484 S 0.7 0.4 1:16.41 xfce4-battery-p Observation: Music plays perfectly. Audio of video's play perfectly. New processes take forever to start. Browsing the web is slow. Already open applications are responsive. Behavior of video: video's both moving forward. totem is updating about every 1/3 second. mplayer updates about every 1 seconds. Rotating