[x86_64 MCE] [RFC] mce.c race condition (or: when evil hacks are the only options)

2007-07-12 Thread Joshua Wise
/git/torvalds/linux-2.6.git;a=commit;h=4f84e4be53a04a65d97bf0faa0c8f99e29bc0170 [2] http://investor.google.com/conduct.html -- From: Joshua Wise <[EMAIL PROTECTED]> Background: In some situations, mce_log would race against mce_read and deadlock. This race condition is described in more

[x86_64 MCE] [RFC] mce.c race condition (or: when evil hacks are the only options)

2007-07-12 Thread Joshua Wise
/git/torvalds/linux-2.6.git;a=commit;h=4f84e4be53a04a65d97bf0faa0c8f99e29bc0170 [2] http://investor.google.com/conduct.html -- From: Joshua Wise [EMAIL PROTECTED] Background: In some situations, mce_log would race against mce_read and deadlock. This race condition is described in more detail

[PATCH] Print utsname on Oops on all architectures

2007-07-05 Thread Joshua Wise
Hopefully this patch has not been munged by pine; I have, minimally, unchecked the mung-patches-sent-to-lkml option in pine's config. In the case that it has been munged, I have also attached it. -- From: Joshua Wise <[EMAIL PROTECTED]> Background: This patch is a follow-on to &quo

[PATCH] Print utsname on Oops on all architectures

2007-07-05 Thread Joshua Wise
Hopefully this patch has not been munged by pine; I have, minimally, unchecked the mung-patches-sent-to-lkml option in pine's config. In the case that it has been munged, I have also attached it. -- From: Joshua Wise [EMAIL PROTECTED] Background: This patch is a follow-on to Info dump on Oops

Re: [PATCH] [revised -- version 2] Info dump on Oops or panic()

2007-06-28 Thread Joshua Wise
On Thu, 28 Jun 2007, Andrew Morton wrote: Your email client is doing space-stuffing. It's easy enough to fix at this end, but even easier if you fix it ;) Aw darn :( Stupid PINE. I'll fix it for the next patch. + atomic_notifier_call_chain(_dumper_list, 0, NULL); [...] So... Please

Re: [PATCH] [revised -- version 2] Info dump on Oops or panic()

2007-06-28 Thread Joshua Wise
. Okay, fair enough. Fixed version follows. I also fixed some checkpatch issues that I missed before. However, please note that in general, the info dumped by this will consist of only a few lines. In our implementations, I believe that we only dump one line per notifier. joshua -- From: Joshua

[PATCH] Info dump on Oops or panic()

2007-06-28 Thread Joshua Wise
From: Joshua Wise <[EMAIL PROTECTED]> Background: When managing a large number of servers, as Google does, it's sometimes useful to get an "at-a-glance" view of a machine when it crashes. When no other post-mortem is possible, it's often useful to know how long the machine

[PATCH] Info dump on Oops or panic()

2007-06-28 Thread Joshua Wise
From: Joshua Wise [EMAIL PROTECTED] Background: When managing a large number of servers, as Google does, it's sometimes useful to get an at-a-glance view of a machine when it crashes. When no other post-mortem is possible, it's often useful to know how long the machine has been powered

Re: [PATCH] [revised -- version 2] Info dump on Oops or panic()

2007-06-28 Thread Joshua Wise
PROTECTED] and Masoud Sharbiani [EMAIL PROTECTED]. Changelog: v2 -- fixed some checkpatch issues. Moved dumping to before Oopses, as per the suggestion of Kyle McMartin [EMAIL PROTECTED]. Patch: This patch is against git 48d8d7ee5dd17c64833e0343ab4ae8ef01cc2648. Signed-off-by: Joshua Wise [EMAIL

Re: [PATCH] [revised -- version 2] Info dump on Oops or panic()

2007-06-28 Thread Joshua Wise
On Thu, 28 Jun 2007, Andrew Morton wrote: Your email client is doing space-stuffing. It's easy enough to fix at this end, but even easier if you fix it ;) Aw darn :( Stupid PINE. I'll fix it for the next patch. + atomic_notifier_call_chain(info_dumper_list, 0, NULL); [...] So...

[PATCH] x86_64: Fix misplaced `continue' in mce.c

2007-06-21 Thread Joshua Wise
From: Joshua Wise <[EMAIL PROTECTED]> Background: When a userspace application wants to know about machine check events, it opens /dev/mcelog and does a read(). Usually, we found that this interface works well, but in some cases, when the system was taking large numbers of machine

[PATCH] x86_64: Fix misplaced `continue' in mce.c

2007-06-21 Thread Joshua Wise
From: Joshua Wise [EMAIL PROTECTED] Background: When a userspace application wants to know about machine check events, it opens /dev/mcelog and does a read(). Usually, we found that this interface works well, but in some cases, when the system was taking large numbers of machine check

Re: CPU_IDLE prevents resuming from STR [was: Re: 2.6.21-rc6-mm1]

2007-04-18 Thread Joshua Wise
On Tue, 17 Apr 2007, Shaohua Li wrote: Looks there is init order issue of sysfs files. The new refreshed patch should fix your bug. Yes, that did fix the hang on resume from STR -- that now works fine. However: [EMAIL PROTECTED]:/sys/devices/system/cpu/cpuidle$ cat available_drivers

Re: CPU_IDLE prevents resuming from STR [was: Re: 2.6.21-rc6-mm1]

2007-04-18 Thread Joshua Wise
On Tue, 17 Apr 2007, Shaohua Li wrote: Looks there is init order issue of sysfs files. The new refreshed patch should fix your bug. Yes, that did fix the hang on resume from STR -- that now works fine. However: [EMAIL PROTECTED]:/sys/devices/system/cpu/cpuidle$ cat available_drivers

Re: CPU_IDLE prevents resuming from STR [was: Re: 2.6.21-rc6-mm1]

2007-04-16 Thread Joshua Wise
On Mon, 16 Apr 2007, Shaohua Li wrote: On Sat, 2007-04-14 at 01:45 +0200, Mattia Dongili wrote: ... please check if the patch at http://marc.info/?l=linux-acpi=117523651630038=2 fixed the issue I have the same system as Mattia, and when I applied this patch and turned CPU_IDLE back on, I got

Re: CPU_IDLE prevents resuming from STR [was: Re: 2.6.21-rc6-mm1]

2007-04-16 Thread Joshua Wise
On Mon, 16 Apr 2007, Shaohua Li wrote: On Sat, 2007-04-14 at 01:45 +0200, Mattia Dongili wrote: ... please check if the patch at http://marc.info/?l=linux-acpim=117523651630038w=2 fixed the issue I have the same system as Mattia, and when I applied this patch and turned CPU_IDLE back on, I

Re: NAPI poll routine happens in interrupt context?

2005-08-17 Thread Joshua Wise
On Wednesday 17 August 2005 12:43, Stephen Hemminger wrote: > You will get more response to network issues on netdev@vger.kernel.org Okay. Thanks. > NAPI poll is usually called from softirq context. This means that > hardware interrupts are enabled, but it is not in a thread context that > can

Re: NAPI poll routine happens in interrupt context?

2005-08-17 Thread Joshua Wise
[] start_kernel+0x44c/0x4e8 Apologies for any inconvenience. joshua On Wednesday 17 August 2005 09:32, Joshua Wise wrote: > Hello LKML, > > I have recently been working on a network driver for an emulated > ultra-simple network card, and I've run into a few snags with the NAPI. My >

NAPI poll routine happens in interrupt context?

2005-08-17 Thread Joshua Wise
Hello LKML, I have recently been working on a network driver for an emulated ultra-simple network card, and I've run into a few snags with the NAPI. My current issue is that it seems to me that my poll routine is being called from an atomic context, so when poll calls rx, and rx calls

NAPI poll routine happens in interrupt context?

2005-08-17 Thread Joshua Wise
Hello LKML, I have recently been working on a network driver for an emulated ultra-simple network card, and I've run into a few snags with the NAPI. My current issue is that it seems to me that my poll routine is being called from an atomic context, so when poll calls rx, and rx calls

Re: NAPI poll routine happens in interrupt context?

2005-08-17 Thread Joshua Wise
[8037fcf4] start_kernel+0x44c/0x4e8 Apologies for any inconvenience. joshua On Wednesday 17 August 2005 09:32, Joshua Wise wrote: Hello LKML, I have recently been working on a network driver for an emulated ultra-simple network card, and I've run into a few snags with the NAPI. My

Re: NAPI poll routine happens in interrupt context?

2005-08-17 Thread Joshua Wise
On Wednesday 17 August 2005 12:43, Stephen Hemminger wrote: You will get more response to network issues on netdev@vger.kernel.org Okay. Thanks. NAPI poll is usually called from softirq context. This means that hardware interrupts are enabled, but it is not in a thread context that can