Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-28 Thread Chuck Ebbert
On 06/25/2007 06:34 PM, Andi Kleen wrote:
> On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
>> On 06/25/2007 05:38 PM, Loic Prylli wrote:
>>
>> [cc: Andi]
>>
>>> Processors synchronization in set_mtrr requires the .gate field
>>> to be set after .count field is properly initialized. Without an explicit
>>> barrier, the compiler was reordering those memory stores. That was sometimes
>>> causing a processor (in ipi_handler) to see the .gate change and
>>> decrement .count before the latter is set by set_mtrr() (which
>>> then hangs in a infinite loop with irqs disabled).
> 
> Hmm, perhaps we should just put the smp_wmb into atomic_set().
> Near all other atomic operations have memory barriers too. I think
> that would be the better fix.

Can we get something merged before 2.6.22-final?

The original patch seems okay...
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-28 Thread Chuck Ebbert
On 06/25/2007 06:34 PM, Andi Kleen wrote:
 On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
 On 06/25/2007 05:38 PM, Loic Prylli wrote:

 [cc: Andi]

 Processors synchronization in set_mtrr requires the .gate field
 to be set after .count field is properly initialized. Without an explicit
 barrier, the compiler was reordering those memory stores. That was sometimes
 causing a processor (in ipi_handler) to see the .gate change and
 decrement .count before the latter is set by set_mtrr() (which
 then hangs in a infinite loop with irqs disabled).
 
 Hmm, perhaps we should just put the smp_wmb into atomic_set().
 Near all other atomic operations have memory barriers too. I think
 that would be the better fix.

Can we get something merged before 2.6.22-final?

The original patch seems okay...
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Loic Prylli
On 6/25/2007 6:34 PM, Andi Kleen wrote:
> On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
>   
>> On 06/25/2007 05:38 PM, Loic Prylli wrote:
>>
>> [cc: Andi]
>>
>> 
>>> Processors synchronization in set_mtrr requires the .gate field
>>> to be set after .count field is properly initialized. Without an explicit
>>> barrier, the compiler was reordering those memory stores. That was sometimes
>>> causing a processor (in ipi_handler) to see the .gate change and
>>> decrement .count before the latter is set by set_mtrr() (which
>>> then hangs in a infinite loop with irqs disabled).
>>>   
>
> Hmm, perhaps we should just put the smp_wmb into atomic_set().
> Near all other atomic operations have memory barriers too. I think
> that would be the better fix.
>
> -Andi
>   


In Documentation/atomic_ops.txt atomic_set/atomic_read are described as
nothing more than a type-safe assignement or reading, without any extra
semantics. For other atomic operations, the rule is that any atomic
operation that doesn't return a value doesn't come with a barrier (and
any operation that returns the atomic value must have memory barriers).

So I guess you are suggesting to change the doc and the implementation
for all arches.

I should admit I did not knew a number of atomic operations did not
imply memory-barriers before. But maybe the extra cost might not be
completely negligible, especially if, for consistency with other
"barrier-implied" atomic operations, a new memory barrier is put before
and after,

Are you suggested changing just atomic_set(), or also other barrier-free
atomic operations :"atomic_dec", "atomic_inc", "atomic_add", "atomic_sub" ?

Independently of what is done to atomic, what about not making the .gate
field an atomic_t, but a simple "int" in the mttr code, since the only
operations done on it are atomic_read and atomic_set?


Loic

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Andi Kleen
On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
> On 06/25/2007 05:38 PM, Loic Prylli wrote:
> 
> [cc: Andi]
> 
> > Processors synchronization in set_mtrr requires the .gate field
> > to be set after .count field is properly initialized. Without an explicit
> > barrier, the compiler was reordering those memory stores. That was sometimes
> > causing a processor (in ipi_handler) to see the .gate change and
> > decrement .count before the latter is set by set_mtrr() (which
> > then hangs in a infinite loop with irqs disabled).

Hmm, perhaps we should just put the smp_wmb into atomic_set().
Near all other atomic operations have memory barriers too. I think
that would be the better fix.

-Andi
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Chuck Ebbert
On 06/25/2007 05:38 PM, Loic Prylli wrote:

[cc: Andi]

> Processors synchronization in set_mtrr requires the .gate field
> to be set after .count field is properly initialized. Without an explicit
> barrier, the compiler was reordering those memory stores. That was sometimes
> causing a processor (in ipi_handler) to see the .gate change and
> decrement .count before the latter is set by set_mtrr() (which
> then hangs in a infinite loop with irqs disabled).
> 
> Signed-off-by: Loic Prylli <[EMAIL PROTECTED]>
> ---
>  arch/i386/kernel/cpu/mtrr/main.c |4 
>  1 files changed, 4 insertions(+), 0 deletions(-)
> 
> diff --git a/arch/i386/kernel/cpu/mtrr/main.c 
> b/arch/i386/kernel/cpu/mtrr/main.c
> index 55b0051..75dc6d5 100644
> --- a/arch/i386/kernel/cpu/mtrr/main.c
> +++ b/arch/i386/kernel/cpu/mtrr/main.c
> @@ -229,6 +229,8 @@ static void set_mtrr(unsigned int reg, unsigned long base,
>   data.smp_size = size;
>   data.smp_type = type;
>   atomic_set(, num_booting_cpus() - 1);
> + /* make sure data.count is visible before unleashing other CPUs */
> + smp_wmb();
>   atomic_set(,0);
>  
>   /*  Start the ball rolling on other CPUs  */
> @@ -242,6 +244,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
>  
>   /* ok, reset count and toggle gate */
>   atomic_set(, num_booting_cpus() - 1);
> + smp_wmb();
>   atomic_set(,1);
>  
>   /* do our MTRR business */
> @@ -260,6 +263,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
>   cpu_relax();
>  
>   atomic_set(, num_booting_cpus() - 1);
> + smp_wmb();
>   atomic_set(,0);
>  
>   /*
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Loic Prylli
Processors synchronization in set_mtrr requires the .gate field
to be set after .count field is properly initialized. Without an explicit
barrier, the compiler was reordering those memory stores. That was sometimes
causing a processor (in ipi_handler) to see the .gate change and
decrement .count before the latter is set by set_mtrr() (which
then hangs in a infinite loop with irqs disabled).

Signed-off-by: Loic Prylli <[EMAIL PROTECTED]>
---
 arch/i386/kernel/cpu/mtrr/main.c |4 
 1 files changed, 4 insertions(+), 0 deletions(-)

diff --git a/arch/i386/kernel/cpu/mtrr/main.c b/arch/i386/kernel/cpu/mtrr/main.c
index 55b0051..75dc6d5 100644
--- a/arch/i386/kernel/cpu/mtrr/main.c
+++ b/arch/i386/kernel/cpu/mtrr/main.c
@@ -229,6 +229,8 @@ static void set_mtrr(unsigned int reg, unsigned long base,
data.smp_size = size;
data.smp_type = type;
atomic_set(, num_booting_cpus() - 1);
+   /* make sure data.count is visible before unleashing other CPUs */
+   smp_wmb();
atomic_set(,0);
 
/*  Start the ball rolling on other CPUs  */
@@ -242,6 +244,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
 
/* ok, reset count and toggle gate */
atomic_set(, num_booting_cpus() - 1);
+   smp_wmb();
atomic_set(,1);
 
/* do our MTRR business */
@@ -260,6 +263,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
cpu_relax();
 
atomic_set(, num_booting_cpus() - 1);
+   smp_wmb();
atomic_set(,0);
 
/*
-- 1.5.2.2 

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


[PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Loic Prylli
Processors synchronization in set_mtrr requires the .gate field
to be set after .count field is properly initialized. Without an explicit
barrier, the compiler was reordering those memory stores. That was sometimes
causing a processor (in ipi_handler) to see the .gate change and
decrement .count before the latter is set by set_mtrr() (which
then hangs in a infinite loop with irqs disabled).

Signed-off-by: Loic Prylli [EMAIL PROTECTED]
---
 arch/i386/kernel/cpu/mtrr/main.c |4 
 1 files changed, 4 insertions(+), 0 deletions(-)

diff --git a/arch/i386/kernel/cpu/mtrr/main.c b/arch/i386/kernel/cpu/mtrr/main.c
index 55b0051..75dc6d5 100644
--- a/arch/i386/kernel/cpu/mtrr/main.c
+++ b/arch/i386/kernel/cpu/mtrr/main.c
@@ -229,6 +229,8 @@ static void set_mtrr(unsigned int reg, unsigned long base,
data.smp_size = size;
data.smp_type = type;
atomic_set(data.count, num_booting_cpus() - 1);
+   /* make sure data.count is visible before unleashing other CPUs */
+   smp_wmb();
atomic_set(data.gate,0);
 
/*  Start the ball rolling on other CPUs  */
@@ -242,6 +244,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
 
/* ok, reset count and toggle gate */
atomic_set(data.count, num_booting_cpus() - 1);
+   smp_wmb();
atomic_set(data.gate,1);
 
/* do our MTRR business */
@@ -260,6 +263,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
cpu_relax();
 
atomic_set(data.count, num_booting_cpus() - 1);
+   smp_wmb();
atomic_set(data.gate,0);
 
/*
-- 1.5.2.2 

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Chuck Ebbert
On 06/25/2007 05:38 PM, Loic Prylli wrote:

[cc: Andi]

 Processors synchronization in set_mtrr requires the .gate field
 to be set after .count field is properly initialized. Without an explicit
 barrier, the compiler was reordering those memory stores. That was sometimes
 causing a processor (in ipi_handler) to see the .gate change and
 decrement .count before the latter is set by set_mtrr() (which
 then hangs in a infinite loop with irqs disabled).
 
 Signed-off-by: Loic Prylli [EMAIL PROTECTED]
 ---
  arch/i386/kernel/cpu/mtrr/main.c |4 
  1 files changed, 4 insertions(+), 0 deletions(-)
 
 diff --git a/arch/i386/kernel/cpu/mtrr/main.c 
 b/arch/i386/kernel/cpu/mtrr/main.c
 index 55b0051..75dc6d5 100644
 --- a/arch/i386/kernel/cpu/mtrr/main.c
 +++ b/arch/i386/kernel/cpu/mtrr/main.c
 @@ -229,6 +229,8 @@ static void set_mtrr(unsigned int reg, unsigned long base,
   data.smp_size = size;
   data.smp_type = type;
   atomic_set(data.count, num_booting_cpus() - 1);
 + /* make sure data.count is visible before unleashing other CPUs */
 + smp_wmb();
   atomic_set(data.gate,0);
  
   /*  Start the ball rolling on other CPUs  */
 @@ -242,6 +244,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
  
   /* ok, reset count and toggle gate */
   atomic_set(data.count, num_booting_cpus() - 1);
 + smp_wmb();
   atomic_set(data.gate,1);
  
   /* do our MTRR business */
 @@ -260,6 +263,7 @@ static void set_mtrr(unsigned int reg, unsigned long base,
   cpu_relax();
  
   atomic_set(data.count, num_booting_cpus() - 1);
 + smp_wmb();
   atomic_set(data.gate,0);
  
   /*
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Andi Kleen
On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
 On 06/25/2007 05:38 PM, Loic Prylli wrote:
 
 [cc: Andi]
 
  Processors synchronization in set_mtrr requires the .gate field
  to be set after .count field is properly initialized. Without an explicit
  barrier, the compiler was reordering those memory stores. That was sometimes
  causing a processor (in ipi_handler) to see the .gate change and
  decrement .count before the latter is set by set_mtrr() (which
  then hangs in a infinite loop with irqs disabled).

Hmm, perhaps we should just put the smp_wmb into atomic_set().
Near all other atomic operations have memory barriers too. I think
that would be the better fix.

-Andi
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: [PATCH] MTRR: Fix race causing set_mtrr to go into infinite loop

2007-06-25 Thread Loic Prylli
On 6/25/2007 6:34 PM, Andi Kleen wrote:
 On Tuesday 26 June 2007 00:05:17 Chuck Ebbert wrote:
   
 On 06/25/2007 05:38 PM, Loic Prylli wrote:

 [cc: Andi]

 
 Processors synchronization in set_mtrr requires the .gate field
 to be set after .count field is properly initialized. Without an explicit
 barrier, the compiler was reordering those memory stores. That was sometimes
 causing a processor (in ipi_handler) to see the .gate change and
 decrement .count before the latter is set by set_mtrr() (which
 then hangs in a infinite loop with irqs disabled).
   

 Hmm, perhaps we should just put the smp_wmb into atomic_set().
 Near all other atomic operations have memory barriers too. I think
 that would be the better fix.

 -Andi
   


In Documentation/atomic_ops.txt atomic_set/atomic_read are described as
nothing more than a type-safe assignement or reading, without any extra
semantics. For other atomic operations, the rule is that any atomic
operation that doesn't return a value doesn't come with a barrier (and
any operation that returns the atomic value must have memory barriers).

So I guess you are suggesting to change the doc and the implementation
for all arches.

I should admit I did not knew a number of atomic operations did not
imply memory-barriers before. But maybe the extra cost might not be
completely negligible, especially if, for consistency with other
barrier-implied atomic operations, a new memory barrier is put before
and after,

Are you suggested changing just atomic_set(), or also other barrier-free
atomic operations :atomic_dec, atomic_inc, atomic_add, atomic_sub ?

Independently of what is done to atomic, what about not making the .gate
field an atomic_t, but a simple int in the mttr code, since the only
operations done on it are atomic_read and atomic_set?


Loic

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/