Here is set of patches that implements a *simple minded* RSS controller for containers. It would be nice to split up the memory controller design and implementation in phases
1. RSS control 2. Page Cache control (with split clean and dirty accounting/control) 3. mlock() control 4. Kernel accounting and control The beancounter implementation follows a very similar approach. The split up makes the design of the controller easier. RSS for example, can be tracked per mm_struct. Page Cache could be tracked per inode, per thread or per mm_struct (depending on what form is most suitable). The definition of RSS was debated on lkml, please see http://lkml.org/lkml/2006/10/10/130 This patchset is a proof of concept implementation and the accounting can be easily adapted to meet the definition of RSS as and when it is re-defined or revisited. The changes required should be small. The reclamation logic has been borrowed from Dave Hansen's challenged memory controller and from shrink_all_memory(). The accounting was inspired from Rohit Seth's container patches. The good -------- No additional pointers required in struct page. There is also a lot of scope for code reuse in tracking the rss of a process (this reuse is yet to be exploited). The not so good --------------- The patches contain a lot of debugging code. Applying the patches -------------------- This patchset has been developed on top of 2.6.19-rc2 with the latest containers patch applied. To run and test this patch, additional fixes are required. Please see http://lkml.org/lkml/2006/11/6/10 http://lkml.org/lkml/2006/11/6/245 Series ------ container-res-groups-fix-parsing.patch container-memctlr-setup.patch container-memctlr-callbacks.patch container-memctlr-acct.patch container-memctlr-task-migration.patch container-memctlr-shares.patch container-memctlr-reclaim.patch Setup ----- To test the series, here's what you need to do 0. Get the latest containers patches against 2.6.19-rc2 1. Apply all the fixes 2. Apply these patches 3. Build the kernel and mount the container filesystem mount -t container container /container 4. Disable cpuset's (to simply assignment of tasks to resource groups) cd /container echo 0 > cpuset_enabled 5. Add the current task to a new group mkdir /container/a echo $$ > tasks cat memctlr_stats 6. Set limits echo "res=memctlr,max_shares=10" > memctlr_shares 7. Spin the system, hang it, revolve it, crash it!! 8. Please provide feedback, both code review and any thing else that can be useful for further development Testing ------- Kernbench was run on these patches and it did not show any significant overhead in the tests. -- Balbir Singh, Linux Technology Center, IBM Software Labs ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ ckrm-tech mailing list https://lists.sourceforge.net/lists/listinfo/ckrm-tech