Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-10 Thread Tony Luck
>> Signed-off-by: Naveen N. Rao > > Acked-by: Borislav Petkov Applied-by: Tony Luck :-) Naveen: Thanks for having this idea, implementing it, and sticking with it through the review process. Once 3.11-rc1 is out I'll ask Ingo to pull this series to the tip tree ... and then on to 3.12 -Tony

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-10 Thread Borislav Petkov
On Wed, Jul 10, 2013 at 02:57:01PM +0530, Naveen N. Rao wrote: > On 07/09/2013 12:30 AM, Tony Luck wrote: > > I was off on vacation last week - looks like you got lots done without me > > :-) > > > > I have parts 1 & 2 applied to an internal tree. Looks like parts 3 & 4 need > > a few final

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-10 Thread Naveen N. Rao
On 07/09/2013 12:30 AM, Tony Luck wrote: > I was off on vacation last week - looks like you got lots done without me > :-) > > I have parts 1 & 2 applied to an internal tree. Looks like parts 3 & 4 need > a few final polishes to get an Ack from Boris. Cool :) Here is the updated patch for part 3

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-10 Thread Naveen N. Rao
On 07/09/2013 12:30 AM, Tony Luck wrote: I was off on vacation last week - looks like you got lots done without me :-) I have parts 1 2 applied to an internal tree. Looks like parts 3 4 need a few final polishes to get an Ack from Boris. Cool :) Here is the updated patch for part 3

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-10 Thread Borislav Petkov
On Wed, Jul 10, 2013 at 02:57:01PM +0530, Naveen N. Rao wrote: On 07/09/2013 12:30 AM, Tony Luck wrote: I was off on vacation last week - looks like you got lots done without me :-) I have parts 1 2 applied to an internal tree. Looks like parts 3 4 need a few final polishes to get an

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-10 Thread Tony Luck
Signed-off-by: Naveen N. Rao naveen.n@linux.vnet.ibm.com Acked-by: Borislav Petkov b...@suse.de Applied-by: Tony Luck :-) Naveen: Thanks for having this idea, implementing it, and sticking with it through the review process. Once 3.11-rc1 is out I'll ask Ingo to pull this series to the

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-08 Thread Tony Luck
On Wed, Jul 3, 2013 at 8:40 AM, Naveen N. Rao wrote: >>> +#ifdef CONFIG_ACPI_APEI_MEMORY_FAILURE >>> + int sec_sev = ghes_severity(gdata->error_severity); >>> + struct cper_sec_mem_err *mem_err; >>> + mem_err = (struct cper_sec_mem_err *)(gdata+1); >> >> >> A newline here

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-08 Thread Tony Luck
On Wed, Jul 3, 2013 at 8:40 AM, Naveen N. Rao naveen.n@linux.vnet.ibm.com wrote: +#ifdef CONFIG_ACPI_APEI_MEMORY_FAILURE + int sec_sev = ghes_severity(gdata-error_severity); + struct cper_sec_mem_err *mem_err; + mem_err = (struct cper_sec_mem_err *)(gdata+1); A

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-03 Thread Naveen N. Rao
On 07/03/2013 08:14 PM, Borislav Petkov wrote: On Tue, Jul 02, 2013 at 05:02:48PM +0530, Naveen N. Rao wrote: Here is the updated patch. I also added printk_ratelimit() in line with the rest of the GHES code. Thanks, Naveen -- If the firmware indicates in GHES error data entry that the error

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-03 Thread Borislav Petkov
On Tue, Jul 02, 2013 at 05:02:48PM +0530, Naveen N. Rao wrote: > Here is the updated patch. I also added printk_ratelimit() in line with the > rest of the GHES code. > > Thanks, > Naveen > > -- > If the firmware indicates in GHES error data entry that the error threshold > has exceeded for a

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-03 Thread Borislav Petkov
On Tue, Jul 02, 2013 at 05:02:48PM +0530, Naveen N. Rao wrote: Here is the updated patch. I also added printk_ratelimit() in line with the rest of the GHES code. Thanks, Naveen -- If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-03 Thread Naveen N. Rao
On 07/03/2013 08:14 PM, Borislav Petkov wrote: On Tue, Jul 02, 2013 at 05:02:48PM +0530, Naveen N. Rao wrote: Here is the updated patch. I also added printk_ratelimit() in line with the rest of the GHES code. Thanks, Naveen -- If the firmware indicates in GHES error data entry that the error

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-02 Thread Naveen N. Rao
Here is the updated patch. I also added printk_ratelimit() in line with the rest of the GHES code. Thanks, Naveen -- If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-02 Thread Naveen N. Rao
On 07/02/2013 04:38 AM, Borislav Petkov wrote: On Mon, Jul 01, 2013 at 09:08:59PM +0530, Naveen N. Rao wrote: If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-02 Thread Naveen N. Rao
On 07/02/2013 04:38 AM, Borislav Petkov wrote: On Mon, Jul 01, 2013 at 09:08:59PM +0530, Naveen N. Rao wrote: If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-02 Thread Naveen N. Rao
Here is the updated patch. I also added printk_ratelimit() in line with the rest of the GHES code. Thanks, Naveen -- If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-01 Thread Borislav Petkov
On Mon, Jul 01, 2013 at 09:08:59PM +0530, Naveen N. Rao wrote: > If the firmware indicates in GHES error data entry that the error threshold > has exceeded for a corrected error event, then we try to soft-offline the > page. This could be called in interrupt context, so we queue this up similar >

[PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-01 Thread Naveen N. Rao
If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in interrupt context, so we queue this up similar to how we handle memory failure scenarios. Signed-off-by: Naveen N.

[PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-01 Thread Naveen N. Rao
If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in interrupt context, so we queue this up similar to how we handle memory failure scenarios. Signed-off-by: Naveen N.

Re: [PATCH v3 3/3] mce, acpi/apei: Soft-offline a page on firmware GHES notification

2013-07-01 Thread Borislav Petkov
On Mon, Jul 01, 2013 at 09:08:59PM +0530, Naveen N. Rao wrote: If the firmware indicates in GHES error data entry that the error threshold has exceeded for a corrected error event, then we try to soft-offline the page. This could be called in interrupt context, so we queue this up similar to