Re: [PATCH v5 2/2] arm64: support batched/deferred tlb shootdown during page reclamation

2022-11-14 Thread Yicong Yang
On 2022/11/14 22:19, Anshuman Khandual wrote: > > > On 11/14/22 14:16, Yicong Yang wrote: >> On 2022/11/14 11:29, Anshuman Khandual wrote: >>> >>> On 10/28/22 13:42, Yicong Yang wrote: +static inline bool arch_tlbbatch_should_defer(struct mm_struct *mm) +{ + /* + * TLB

Re: [PATCH v5 2/2] arm64: support batched/deferred tlb shootdown during page reclamation

2022-11-14 Thread Anshuman Khandual
On 11/14/22 14:16, Yicong Yang wrote: > On 2022/11/14 11:29, Anshuman Khandual wrote: >> >> On 10/28/22 13:42, Yicong Yang wrote: >>> +static inline bool arch_tlbbatch_should_defer(struct mm_struct *mm) >>> +{ >>> + /* >>> +* TLB batched flush is proved to be beneficial for systems with

Re: [PATCH v5 2/2] arm64: support batched/deferred tlb shootdown during page reclamation

2022-11-14 Thread Yicong Yang
On 2022/11/14 11:29, Anshuman Khandual wrote: > > > On 10/28/22 13:42, Yicong Yang wrote: >> +static inline bool arch_tlbbatch_should_defer(struct mm_struct *mm) >> +{ >> +/* >> + * TLB batched flush is proved to be beneficial for systems with large >> + * number of CPUs, especially

Re: [PATCH v5 2/2] arm64: support batched/deferred tlb shootdown during page reclamation

2022-11-14 Thread haoxin
在 2022/10/28 下午4:12, Yicong Yang 写道: From: Barry Song on x86, batched and deferred tlb shootdown has lead to 90% performance increase on tlb shootdown. on arm64, HW can do tlb shootdown without software IPI. But sync tlbi is still quite expensive. Even running a simplest program which

Re: [PATCH v5 2/2] arm64: support batched/deferred tlb shootdown during page reclamation

2022-11-13 Thread Anshuman Khandual
On 10/28/22 13:42, Yicong Yang wrote: > +static inline bool arch_tlbbatch_should_defer(struct mm_struct *mm) > +{ > + /* > + * TLB batched flush is proved to be beneficial for systems with large > + * number of CPUs, especially system with more than 8 CPUs. TLB shutdown > + *

[PATCH v5 2/2] arm64: support batched/deferred tlb shootdown during page reclamation

2022-10-28 Thread Yicong Yang
From: Barry Song on x86, batched and deferred tlb shootdown has lead to 90% performance increase on tlb shootdown. on arm64, HW can do tlb shootdown without software IPI. But sync tlbi is still quite expensive. Even running a simplest program which requires swapout can prove this is true,