> > Joakim Tjernlund wrote: > > > > > I realized that kernel space won't cause ITLB misses when CONFIG_PIN_TLB > > > is set. At least on > > > my system(no module support). > > > > Why do this? For years we have been trying to remove #ifdefs from the > > kernel, > > Then why did you add CONFIG_PIN_TLB in the first place? It is not required for > proper operation. It is a performance enhancement, just like my patch. > > > this does nothing but add useless configuration complexity, it doesn't save > > any space or execution time, > > But it does. 12 instructions removed and 5 added equals 7*4 bytes saved in > the fast path. > Executionwise you save 4 instructions: > stw r20, 0(r0) > andi. r21, r20, 0x0800 > beq 3f > lwz r21, 0(r0) > > There is one instruction between the load of the L1 entry and where it is > used by the cmplwi > instruction: > lwz r21, 0(r20) /* Get the level 1 entry */ > mfcr r20 > cmplwi cr0,r21,0x0fff > > > and the first time you load a module the kernel > > will crash. > > So I have to add a test for module support, not a big deal. > > Jocke
Hi Again I have updated the patch to address the modules issue. I have also included my response to Dans mail since I forgot to include the list in that reply. Also, I count 24 instructions in the fast path with my patch and 31 instructions without my patch. 24 instructions == 6 cache lines and 31 instructions == 7.75 cachelines. 2 cachelines is saved by the patch. Jocke Index: arch/ppc/kernel/head_8xx.S =================================================================== RCS file: /home/cvsadmin/cvsroot/kernel/linuxppc/arch/ppc/kernel/head_8xx.S,v retrieving revision 1.10 diff -u -r1.10 head_8xx.S --- arch/ppc/kernel/head_8xx.S 8 Oct 2003 10:15:40 -0000 1.10 +++ arch/ppc/kernel/head_8xx.S 17 Oct 2003 14:47:34 -0000 @@ -325,8 +325,10 @@ lwz r3, 12(r0) #endif mtspr M_TW, r20 /* Save a couple of working registers */ +#if !CONFIG_PIN_TLB || CONFIG_MODULES mfcr r20 stw r20, 0(r0) +#endif stw r21, 4(r0) mfspr r20, SRR0 /* Get effective address of fault */ #ifdef CONFIG_8xx_CPU6 @@ -337,6 +339,7 @@ mtspr MD_EPN, r20 /* Have to use MD_EPN for walk, MI_EPN can't */ mfspr r20, M_TWB /* Get level 1 table entry address */ +#if !CONFIG_PIN_TLB || CONFIG_MODULES /* If we are faulting a kernel address, we have to use the * kernel page tables. */ @@ -349,6 +352,13 @@ lwz r21, 0(r20) /* Get the level 1 entry */ rlwinm. r20, r21,0,0,19 /* Extract page descriptor page address */ beq 2f /* If zero, don't try to find a pte */ +#else + lwz r21, 0(r20) /* Get the level 1 entry */ + mfcr r20 + cmplwi cr0,r21,0x0fff /* Test page descriptor page address */ + bng- 2f /* If zero, don't try to find a pte */ + mtcr r20 +#endif /* We have a pte table, so load the MI_TWC with the attributes * for this "segment." @@ -390,17 +400,24 @@ mtspr MI_RPN, r20 /* Update TLB entry */ mfspr r20, M_TW /* Restore registers */ +#if !CONFIG_PIN_TLB || CONFIG_MODULES lwz r21, 0(r0) mtcr r21 +#endif lwz r21, 4(r0) #ifdef CONFIG_8xx_CPU6 lwz r3, 8(r0) #endif rfi -2: mfspr r20, M_TW /* Restore registers */ +2: +#if !CONFIG_PIN_TLB || CONFIG_MODULES lwz r21, 0(r0) mtcr r21 +#else + mtcr r20 +#endif + mfspr r20, M_TW /* Restore registers */ lwz r21, 4(r0) #ifdef CONFIG_8xx_CPU6 lwz r3, 8(r0) ** Sent via the linuxppc-embedded mail list. See http://lists.linuxppc.org/