Paul Menage wrote: > On 10/30/06, Balbir Singh <[EMAIL PROTECTED]> wrote: >> You'll also end up with per zone page cache pools for each zone. A list of >> active/inactive pages per zone (which will split up the global LRU list). > > Yes, these are some of the inefficiencies that we're ironing out. > >> What about the hard-partitioning. If a container/cpuset is not using its full >> 64MB of a fake node, can some other node use it? > > No. So the granularity at which you can divide up the system depends > on how big your fake nodes are. For our purposes, we figure that 64MB > granularity should be fine. >
I am still a little concerned about how limit size changes will be implemented. Will the cpuset "mems" field change to reflect the changed limits? >> Also, won't you end up >> with a big zonelist? > > Yes - but PaulJ's recent patch to speed up the zone selection helped > reduce the overhead of this a lot. Great! let me find those patches > >> Consider the other side of the story. lets say we have a shared lib shared >> among quite a few containers. We limit the usage of the inode containing >> the shared library to 50M. Tasks A and B use some part of the library >> and cause the container "C" to reach the limit. Container C is charged >> for all usage of the shared library. Now no other task, irrespective of which >> container it belongs to, can touch any new pages of the shared library. > > Well, if the pages aren't mlocked then presumably some of the existing > pages can be flushed out to disk and replaced with other pages. > >> What you are suggesting is to virtually group the inodes by container rather >> than task. It might make sense in some cases, but not all. > > Right - I think it's an important feature to be able to support, but I > agree that it's not suitable for all situations. >> We could consider implementing the controllers in phases >> >> 1. RSS control (anon + mapped pages) >> 2. Page Cache control > > Page cache control is actually more essential that RSS control, in our > experience - it's pretty easy to track RSS values from userspace, and > react reasonably quickly to kill things that go over their limit, but > determining page cache usage (i.e. determining which job on the system > is flooding the page cache with dirty buffers) is pretty much > impossible currently. > Hmm... interesting. Why do you think its impossible, what are the kinds of issues you've run into? > Paul > -- Balbir Singh, Linux Technology Center, IBM Software Labs ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ ckrm-tech mailing list https://lists.sourceforge.net/lists/listinfo/ckrm-tech