On Thu, 22 Sep 2016, zijun_hu wrote:

> On 2016/9/22 5:21, David Rientjes wrote:
> > On Wed, 21 Sep 2016, zijun_hu wrote:
> > 
> >> From: zijun_hu <zijun...@htc.com>
> >>
> >> correct lazy_max_pages() return value if the number of online
> >> CPUs is power of 2
> >>
> >> Signed-off-by: zijun_hu <zijun...@htc.com>
> >> ---
> >>  mm/vmalloc.c | 4 +++-
> >>  1 file changed, 3 insertions(+), 1 deletion(-)
> >>
> >> diff --git a/mm/vmalloc.c b/mm/vmalloc.c
> >> index a125ae8..2804224 100644
> >> --- a/mm/vmalloc.c
> >> +++ b/mm/vmalloc.c
> >> @@ -594,7 +594,9 @@ static unsigned long lazy_max_pages(void)
> >>  {
> >>    unsigned int log;
> >>  
> >> -  log = fls(num_online_cpus());
> >> +  log = num_online_cpus();
> >> +  if (log > 1)
> >> +          log = (unsigned int)get_count_order(log);
> >>  
> >>    return log * (32UL * 1024 * 1024 / PAGE_SIZE);
> >>  }
> > 
> > The implementation of lazy_max_pages() is somewhat arbitrarily defined, 
> > the existing approximation has been around for eight years and 
> > num_online_cpus() isn't intended to be rounded up to the next power of 2.  
> > I'd be inclined to just leave it as it is.
> > 
> do i understand the intent in current code logic as below ?
> [8, 15) roundup to 16?
> [32, 63) roundup to 64?
> 

The intent is as it is implemented; with your change, lazy_max_pages() is 
potentially increased depending on the number of online cpus.  This is 
only a heuristic, changing it would need justification on why the new 
value is better.  It is opposite to what the comment says: "to be 
conservative and not introduce a big latency on huge systems, so go with
a less aggressive log scale."  NACK to the patch.

Reply via email to