Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2013-08-15 Thread Moritz Muehlenhoff
reassign 628444 src:linux
thanks

On Wed, Mar 14, 2012 at 08:37:17PM -0700, Shannon Dealy wrote:


 Like others, the problems seemed to start around 2.6.39.

 Thought I should note here, my system showed this problem with 2.6.36  
 through 2.6.39.  It seems to have stopped showing the problem (possibly  
 due to a memory upgrade many months ago), but it still has chronic  
 instability of the connection (drops out for varying intervals), and  
 often, network-manager doesn't seem aware of the failure - reloading the  
 driver isn't a reliable solution (sometimes appears to work, often does  
 not)

Does this bug still occur with more recent kernels/firmware, e.g. Wheezy?

Cheers,
Moritz


-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20130815163651.gb15...@inutil.org



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2013-08-15 Thread Bjørn Mork
Shannon Dealy de...@deatech.com writes:

 I have not seen this problem or any of the other iwl instabilities in
 a long time, however, I have been running with these option lines
 disabling 11n functionality in order to achieve that stability:

 options iwlagn 11n_disable50=1
 options iwlwifi 11n_disable=1

 since I am no longer running kernels which use iwlagn (currently
 running 3.2.xx), I can't really speak to the original issue anymore.
 Of course iwlwifi has had 11n related stability issues as well,
 however, I have never seen the:

   MAC is in deep sleep

 bug (as far as I can recall) while running a newer kernel with iwlwifi
 instead of iwlagn.  Not sure how much code (if any) is common between
 iwlwifi and the older iwlagn module.

AFAIK the code printing that particular error message is still the same
in 3.2. The error message was changed and a stack trace was added in
v3.4, so you have to look for another message if testing newer kernels:


commit aa5affbacb24cb5d8fd6f7c66e57d62164ed6d34
Author: Stanislaw Gruszka sgrus...@redhat.com
Date:   Wed Mar 7 09:52:23 2012 -0800

iwlwifi: dump stack when fail to gain access to the device

Print dump stack when the device is not responding. This should give
some more clue about the reason of failure. Also change the message we
print, since MAC in deep sleep is kinda confusing.

On the way add unlikely(), as fail to gain NIC access is hmm ...
unlikely.

Signed-off-by: Stanislaw Gruszka sgrus...@redhat.com
Signed-off-by: Johannes Berg johannes.b...@intel.com
Signed-off-by: Wey-Yi Guy wey-yi.w@intel.com
Signed-off-by: John W. Linville linvi...@tuxdriver.com

diff --git a/drivers/net/wireless/iwlwifi/iwl-io.c 
b/drivers/net/wireless/iwlwifi/iwl-io.c
index e2e3b5c..fc36535 100644
--- a/drivers/net/wireless/iwlwifi/iwl-io.c
+++ b/drivers/net/wireless/iwlwifi/iwl-io.c
@@ -121,10 +121,10 @@ int iwl_grab_nic_access_silent(struct iwl_trans *trans)
 int iwl_grab_nic_access(struct iwl_trans *trans)
 {
int ret = iwl_grab_nic_access_silent(trans);
-   if (ret) {
+   if (unlikely(ret)) {
u32 val = iwl_read32(trans, CSR_GP_CNTRL);
-   IWL_ERR(trans,
-   MAC is in deep sleep!. CSR_GP_CNTRL = 0x%08X\n, val);
+   WARN_ONCE(1, Timeout waiting for hardware access 
+(CSR_GP_CNTRL 0x%08x)\n, val);
}
 
return ret;



This code has since been refactored and moved into
drivers/net/wireless/iwlwifi/pcie/trans.c but the Timeout waiting for
hardware access message is still there.

 I suppose I should try enabling 11n again to see what happens (I had
 forgotten it was disabled).

Time to move on to 11ac now :)


Bjørn


--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/87ppte7rg9@nemi.mork.no



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2013-08-15 Thread Shannon Dealy


Hi Moritz,

I have not seen this problem or any of the other iwl instabilities in a 
long time, however, I have been running with these option lines disabling 
11n functionality in order to achieve that stability:


options iwlagn 11n_disable50=1
options iwlwifi 11n_disable=1

since I am no longer running kernels which use iwlagn (currently running 
3.2.xx), I can't really speak to the original issue anymore.  Of course 
iwlwifi has had 11n related stability issues as well, however, I have 
never seen the:


  MAC is in deep sleep

bug (as far as I can recall) while running a newer kernel with iwlwifi 
instead of iwlagn.  Not sure how much code (if any) is common between 
iwlwifi and the older iwlagn module.


I suppose I should try enabling 11n again to see what happens (I had 
forgotten it was disabled).


The upgrade to wheezy is most notable for much shorter times required to 
make wireless connections, though I am not sure if it is the driver or 
network-manager responsible for that improvement.


FWIW.

Shannon C. Dealy  |   DeaTech Research Inc.
de...@deatech.com |  - Custom Software Development -
Phone: (800) 467-5820 |  - Natural Building Instruction -
   or: (541) 929-4089 |  www.deatech.com


On Thu, 15 Aug 2013, Moritz Muehlenhoff wrote:


reassign 628444 src:linux
thanks

On Wed, Mar 14, 2012 at 08:37:17PM -0700, Shannon Dealy wrote:




Like others, the problems seemed to start around 2.6.39.


Thought I should note here, my system showed this problem with 2.6.36
through 2.6.39.  It seems to have stopped showing the problem (possibly
due to a memory upgrade many months ago), but it still has chronic
instability of the connection (drops out for varying intervals), and
often, network-manager doesn't seem aware of the failure - reloading the
driver isn't a reliable solution (sometimes appears to work, often does
not)


Does this bug still occur with more recent kernels/firmware, e.g. Wheezy?

Cheers,
   Moritz




--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/alpine.deb.2.02.1308151022300.20...@nashapur.deatech.com



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-04-01 Thread Dafydd Harries
On Tue, Mar 20, 2012 at 12:28:01AM +, Juha Jäykkä wrote:
 I am starting to believe I am in the clear here, but I am not yet convinced 
 since the root cause is still unknown. In any case, thanks for all of you for 
 helping hunt this down.
 
 Dafydd: I think you might want to try pcie_asmp=off next
 
 Cheers,
 Juha

As I think I mentioned before, pice_aspm=off doesn't seem to help much for me.
I still get wireless failure after a day or two, although they seem to be more
often correlated with resuming from suspend.



-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120402024231.GE31750@nia



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-22 Thread Juha Jäykkä
This may or may not be related, but I thought it best to inform you about what 
I just saw anyway:

I accidentally toggled the physical RF kill switch on my laptop and the result 
was very much reminiscent of the bug we are discussing (note the QoS and 
failure to remove key):

Mar 22 20:13:47 rigel kernel: [1259943.614216] iwlwifi :03:00.0: RF_KILL 
bit toggled to disable radio.
Mar 22 20:13:47 rigel kernel: [1259943.614350] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.614359] iwlwifi :03:00.0: Error 
sending REPLY_QOS_PARAM: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.614366] iwlwifi :03:00.0: Failed to 
update QoS
Mar 22 20:13:47 rigel kernel: [1259943.614374] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.614381] iwlwifi :03:00.0: Error 
sending REPLY_RXON: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.614388] iwlwifi :03:00.0: Error 
clearing ASSOC_MSK on BSS (-5)
Mar 22 20:13:47 rigel kernel: [1259943.614400] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.614407] iwlwifi :03:00.0: Error 
sending REPLY_RXON: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.614414] iwlwifi :03:00.0: Error 
clearing ASSOC_MSK on BSS (-5)
Mar 22 20:13:47 rigel kernel: [1259943.614422] wlan0: deauthenticating from 
00:24:17:33:f4:f5 by local choice (reason=3)
Mar 22 20:13:47 rigel kernel: [1259943.624234] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.624238] iwlwifi :03:00.0: Error 
sending REPLY_ADD_STA: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.624242] ieee80211 phy0: failed to 
remove key (1, ff:ff:ff:ff:ff:ff) from hardware (-5)
Mar 22 20:13:47 rigel kernel: [1259943.624339] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.624342] iwlwifi :03:00.0: Error 
sending REPLY_ADD_STA: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.624346] ieee80211 phy0: failed to 
remove key (0, 00:24:17:33:f4:f5) from hardware (-5)
Mar 22 20:13:47 rigel kernel: [1259943.624351] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.624354] iwlwifi :03:00.0: Error 
sending REPLY_REMOVE_STA: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.624358] iwlwifi :03:00.0: Error 
removing station 00:24:17:33:f4:f5
Mar 22 20:13:47 rigel kernel: [1259943.624391] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.624394] iwlwifi :03:00.0: Error 
sending REPLY_RXON: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.624397] iwlwifi :03:00.0: Error 
clearing ASSOC_MSK on BSS (-5)
Mar 22 20:13:47 rigel kernel: [1259943.624425] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.624428] iwlwifi :03:00.0: Error 
sending REPLY_LEDS_CMD: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.624433] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.624436] iwlwifi :03:00.0: Error 
sending REPLY_RXON: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.624439] iwlwifi :03:00.0: Error 
clearing ASSOC_MSK on BSS (-5)
Mar 22 20:13:47 rigel kernel: [1259943.624459] cfg80211: Calling CRDA to 
update world regulatory domain
Mar 22 20:13:47 rigel kernel: [1259943.640238] iwlwifi :03:00.0: Not 
sending command - RF KILL
Mar 22 20:13:47 rigel kernel: [1259943.640245] iwlwifi :03:00.0: Error 
sending REPLY_RXON: enqueue_hcmd failed: -5
Mar 22 20:13:47 rigel kernel: [1259943.640249] iwlwifi :03:00.0: Error 
clearing ASSOC_MSK on BSS (-5)

After switching the radios back on, I get

Mar 22 20:14:22 rigel kernel: [1259978.719860] iwlwifi :03:00.0: RF_KILL 
bit toggled to enable radio.

but *absolutely* *nothing* *else*. No RF LED, nothing.

However, unlike the bug we are discussing, reloading the iwlwifi module fixed 
this.

Hope this helps,
Juha

-- 
 ---
| Juha Jäykkä, ju...@iki.fi |
| http://www.maths.leeds.ac.uk/~juhaj   |
 ---


signature.asc
Description: This is a digitally signed message part.


Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-22 Thread Venkataraman, Meenakshi
Hi Juha,

Mar 22 20:13:47 rigel kernel: [1259943.614216] iwlwifi :03:00.0: RF_KILL
bit toggled to disable radio.
Mar 22 20:13:47 rigel kernel: [1259943.614350] iwlwifi :03:00.0: Not
sending command - RF KILL

 [MV] These are not technically error messages, and we had a patch fixing these 
error messages in the recent past. What kernel version are you running?

Mar 22 20:13:47 rigel kernel: [1259943.624242] ieee80211 phy0: failed to
remove key (1, ff:ff:ff:ff:ff:ff) from hardware (-5)

[MV]  This could potentially be an issue -- we'll look into this.

After switching the radios back on, I get

Mar 22 20:14:22 rigel kernel: [1259978.719860] iwlwifi :03:00.0: RF_KILL
bit toggled to enable radio.

but *absolutely* *nothing* *else*. No RF LED, nothing.

[MV]  Ouch. This is likely also a bug in mac80211/driver...can you file a 
separate bug for this on the intel linux wireless bugzilla: 
http://bugzilla.intellinuxwireless.org/; so we can have someone look at it?

Thanks,
Meenakshi


Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-19 Thread Juha Jäykkä
My update (after a reboot caused by a crashed hibernation attempt at uptime ~8 
days):

juhaj@rigel  uptime 
 00:18:27 up 14 days,  6:50,  9 users,  load average: 0.00, 0.03, 0.05

juhaj@rigel  cat /proc/cmdline 
BOOT_IMAGE=/vmlinuz-3.2.0-1-amd64 root=/dev/mapper/system-root ro quiet 
usbcore.autosuspend=1 video=intelfb:1440x900M@60 pcie_aspm=off

juhaj@rigel  grep . /sys/module/iwlwifi/parameters/*
/sys/module/iwlwifi/parameters/11n_disable:0
/sys/module/iwlwifi/parameters/ack_check:N
/sys/module/iwlwifi/parameters/amsdu_size_8K:1
/sys/module/iwlwifi/parameters/antenna_coupling:0
/sys/module/iwlwifi/parameters/auto_agg:Y
/sys/module/iwlwifi/parameters/bt_ch_inhibition:Y
/sys/module/iwlwifi/parameters/bt_coex_active:Y
/sys/module/iwlwifi/parameters/fw_restart:1
/sys/module/iwlwifi/parameters/led_mode:0
/sys/module/iwlwifi/parameters/no_sleep_autoadjust:Y
/sys/module/iwlwifi/parameters/plcp_check:Y
/sys/module/iwlwifi/parameters/power_level:0
/sys/module/iwlwifi/parameters/power_save:N
/sys/module/iwlwifi/parameters/queues_num:0
/sys/module/iwlwifi/parameters/swcrypto:0
/sys/module/iwlwifi/parameters/ucode_alternative:1
/sys/module/iwlwifi/parameters/wd_disable:0
(I think these are all defaults.)

rigel kernel: [1073316.734468] iwlwifi :03:00.0: L1 Enabled; Disabling L0S

juhaj@rigel  uname -a
Linux rigel 3.2.0-1-amd64 #1 SMP Fri Feb 17 05:17:36 UTC 2012 x86_64 GNU/Linux

There have been TWO changes since I last had problems:

1. pcie_aspm=off
2. re-seating the iwlwifi mini-pci card in its socket

I am starting to believe I am in the clear here, but I am not yet convinced 
since the root cause is still unknown. In any case, thanks for all of you for 
helping hunt this down.

Dafydd: I think you might want to try pcie_asmp=off next

Cheers,
Juha

-- 
 ---
| Juha Jäykkä, ju...@iki.fi |
| http://www.maths.leeds.ac.uk/~juhaj   |
 ---


signature.asc
Description: This is a digitally signed message part.


Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-17 Thread Shannon Dealy

On Fri, 16 Mar 2012, Bjørn Mork wrote:


Shannon Dealy de...@deatech.com writes:


I created a file /etc/modprobe.d/iwlagn.conf and placed the
following line in it:

  options iwlagn 11n_disable=1 11n_disable50=1


Note that the 11n_disable50 options was removed in 3.0 and the iwlagn
module was renamed to iwlwifi in 3.2.


Unfortunate, since further testing shows that it is the 11n_disable50=1 
that matters.  Disabling either of the above options makes my link stable, 
and since 11n_disable=1 would effectively cause 11n_disable50 to happen 
as well, the implication is that the problem is in the code which is/was 
controlled by the 11n_disable50.


Would be nice if anyone can confirm if this fixes the deep sleep problem 
as well.



Which makes this workaround pretty much irrelevant to any current Debian
kernel as noone(?) has seen the bug in 2.6.32.


While it may not be relevant to you, it is completely relevant in that it:

  - provides developers with a narrowed range of places to look for the problem
in all versions of the kernel/driver code, including those were the
option was removed (they can easily check what part of the code it
previously disabled)

  - using just the 11n_disable option should still solve the stability
problems I am seeing and possibly the deep sleep as well (need
confirmation on that) for people running 3.x (though granted it is not
a great workaround given the performance hit)

  - I assume you are speaking with regard to 2.6.32 being the default
stable kernel so it won't affect people running stock Debian stable?
I don't recall seeing confirmation from anyone that the problem went
away by downgrading to 2.6.32.

Are you still using the  2.6.39-1 kernel you originally opened this bug 
against?

[snip]

Yes, though it has been rebuilt for debug tracing of the iwlagn driver. 
Unfortunately I can't afford the down time right now that a kernel 
upgrade beyond 2.6.39 would entail (other software would have to be 
upgraded and that might require that I write some code as well to 
deal with the kernel changes).  I also can't downgrade to 2.6.32 
(assuming that it would make any difference) as it doesn't support some of 
my hardware.


FWIW

Shannon C. Dealy  |   DeaTech Research Inc.
de...@deatech.com |  - Custom Software Development -
Phone: (800) 467-5820 |  - Natural Building Instruction -
   or: (541) 929-4089 |  www.deatech.com

Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-16 Thread Bjørn Mork
Shannon Dealy de...@deatech.com writes:

 I created a file /etc/modprobe.d/iwlagn.conf and placed the
 following line in it:

   options iwlagn 11n_disable=1 11n_disable50=1

Note that the 11n_disable50 options was removed in 3.0 and the iwlagn
module was renamed to iwlwifi in 3.2.

Which makes this workaround pretty much irrelevant to any current Debian
kernel as noone(?) has seen the bug in 2.6.32.  Are you still using the
2.6.39-1 kernel you originally opened this bug against?

FWIW, I enabled power_save a few days ago (was previously at default)
after the nice summary from Meenakshi, just to see if I could replicate
this issue.  My module parameters are now:

bjorn@nemi:~$ grep . /sys/module/iwlwifi/parameters/*
/sys/module/iwlwifi/parameters/11n_disable:0
/sys/module/iwlwifi/parameters/ack_check:N
/sys/module/iwlwifi/parameters/amsdu_size_8K:1
/sys/module/iwlwifi/parameters/antenna_coupling:0
/sys/module/iwlwifi/parameters/auto_agg:Y
/sys/module/iwlwifi/parameters/bt_ch_inhibition:Y
/sys/module/iwlwifi/parameters/bt_coex_active:Y
/sys/module/iwlwifi/parameters/fw_restart:1
/sys/module/iwlwifi/parameters/led_mode:0
/sys/module/iwlwifi/parameters/no_sleep_autoadjust:Y
/sys/module/iwlwifi/parameters/plcp_check:Y
/sys/module/iwlwifi/parameters/power_level:0
/sys/module/iwlwifi/parameters/power_save:Y
/sys/module/iwlwifi/parameters/queues_num:0
/sys/module/iwlwifi/parameters/swcrypto:0
/sys/module/iwlwifi/parameters/ucode_alternative:1
/sys/module/iwlwifi/parameters/wd_disable:0


And I am often using 802.11n at 5GHz.  Still haven't seen the issue.

But I wonder about Juha's observation about this difference between my
setup: 

  [   12.611061] iwlwifi :03:00.0: L1 Disabled; Enabling L0S

and his:

  [245082.407512] iwlwifi :03:00.0: L1 Enabled; Disabling L0S

This looks extremely suspicious to me in light of Meenakshi's comment on
known issues with L1 on these devices and that Intel therefore use L0S
by default.

How come Juha end up with L1 enabled, then?  And is that the case for
everyone seeing this bug?



Bjørn



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/87y5r0ex39@nemi.mork.no



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-16 Thread Dafydd Harries
On Fri, Mar 16, 2012 at 10:03:54AM +0100, Bjørn Mork wrote:
 But I wonder about Juha's observation about this difference between my
 setup: 
 
   [   12.611061] iwlwifi :03:00.0: L1 Disabled; Enabling L0S
 
 and his:
 
   [245082.407512] iwlwifi :03:00.0: L1 Enabled; Disabling L0S
 
 This looks extremely suspicious to me in light of Meenakshi's comment on
 known issues with L1 on these devices and that Intel therefore use L0S
 by default.
 
 How come Juha end up with L1 enabled, then?  And is that the case for
 everyone seeing this bug?

Today I got a failure, and on my machine it was using L0S, not L1S:

Mar 16 11:38:45 localhost kernel: [ 9045.913617] iwlwifi :02:00.0: L1 
Disabled; Enabling L0S

So I don't think this is the difference necessarily.

Syslog from today attached.
Mar 16 11:37:49 localhost kernel: [ 8989.631323] iwlwifi :02:00.0: Error 
sending REPLY_QOS_PARAM: time out after 2000ms.
Mar 16 11:37:49 localhost kernel: [ 8989.631332] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 63
Mar 16 11:37:49 localhost kernel: [ 8989.631336] iwlwifi :02:00.0: Failed 
to update QoS
Mar 16 11:37:51 localhost kernel: [ 8991.626456] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:37:51 localhost kernel: [ 8991.626463] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 66
Mar 16 11:37:51 localhost kernel: [ 8991.626468] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 11:37:53 localhost kernel: [ 8993.621540] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:37:53 localhost kernel: [ 8993.621547] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 69
Mar 16 11:37:53 localhost kernel: [ 8993.621552] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 11:37:55 localhost kernel: [ 8995.616627] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:37:55 localhost kernel: [ 8995.616634] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 70
Mar 16 11:37:55 localhost kernel: [ 8995.616638] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 11:37:57 localhost kernel: [ 8997.611717] iwlwifi :02:00.0: Error 
sending REPLY_SCAN_CMD: time out after 2000ms.
Mar 16 11:37:57 localhost kernel: [ 8997.611724] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 71
Mar 16 11:37:59 localhost kernel: [ 8999.606756] iwlwifi :02:00.0: Error 
sending REPLY_ADD_STA: time out after 2000ms.
Mar 16 11:37:59 localhost kernel: [ 8999.606764] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 72
Mar 16 11:38:01 localhost kernel: [ 9001.601883] iwlwifi :02:00.0: Error 
sending REPLY_REMOVE_STA: time out after 2000ms.
Mar 16 11:38:01 localhost kernel: [ 9001.601891] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 73
Mar 16 11:38:01 localhost kernel: [ 9001.601898] iwlwifi :02:00.0: Error 
removing station 00:50:7f:cb:4b:58
Mar 16 11:38:03 localhost kernel: [ 9003.596977] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:38:03 localhost kernel: [ 9003.596984] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 74
Mar 16 11:38:03 localhost kernel: [ 9003.596989] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 11:38:05 localhost kernel: [ 9005.600071] iwlwifi :02:00.0: fail to 
flush all tx fifo queues
Mar 16 11:38:07 localhost kernel: [ 9007.595140] iwlwifi :02:00.0: Error 
sending POWER_TABLE_CMD: time out after 2000ms.
Mar 16 11:38:07 localhost kernel: [ 9007.595148] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 75
Mar 16 11:38:09 localhost kernel: [ 9007.595154] iwlwifi :02:00.0: set 
power fail, ret = -110
Mar 16 11:38:09 localhost kernel: [ 9009.590164] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:38:09 localhost kernel: [ 9009.590171] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 76
Mar 16 11:38:09 localhost kernel: [ 9009.590175] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 11:38:11 localhost kernel: [ 9011.621214] iwlwifi :02:00.0: Error 
sending REPLY_ADD_STA: time out after 2000ms.
Mar 16 11:38:11 localhost kernel: [ 9011.621221] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 77
Mar 16 11:38:13 localhost kernel: [ 9013.616306] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:38:13 localhost kernel: [ 9013.616312] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 78
Mar 16 11:38:13 localhost kernel: [ 9013.616317] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 11:38:15 localhost kernel: [ 9015.611391] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 11:38:15 localhost kernel: [ 9015.611397] iwlwifi :02:00.0: Current 
CMD queue read_ptr 59 write_ptr 79
Mar 16 11:38:15 localhost 

Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-16 Thread Dafydd Harries
On Fri, Mar 16, 2012 at 01:30:46PM +0100, Dafydd Harries wrote:
 Today I got a failure, and on my machine it was using L0S, not L1S:
 
 Mar 16 11:38:45 localhost kernel: [ 9045.913617] iwlwifi :02:00.0: L1 
 Disabled; Enabling L0S
 
 So I don't think this is the difference necessarily.
 
 Syslog from today attached.

And I got a failure very quickly (50m) after I tried booting with
power_save=1.
Mar 16 13:05:15 localhost kernel: [   31.986202] iwlwifi :02:00.0: PCI INT 
A - GSI 16 (level, low) - IRQ 16
Mar 16 13:05:15 localhost kernel: [   31.986239] iwlwifi :02:00.0: setting 
latency timer to 64
Mar 16 13:05:15 localhost kernel: [   31.986285] iwlwifi :02:00.0: 
pci_resource_len = 0x2000
Mar 16 13:05:15 localhost kernel: [   31.986287] iwlwifi :02:00.0: 
pci_resource_base = c90011794000
Mar 16 13:05:15 localhost kernel: [   31.986288] iwlwifi :02:00.0: HW 
Revision ID = 0x35
Mar 16 13:05:15 localhost kernel: [   31.986414] iwlwifi :02:00.0: irq 42 
for MSI/MSI-X
Mar 16 13:05:15 localhost kernel: [   31.986494] iwlwifi :02:00.0: Detected 
Intel(R) Centrino(R) Ultimate-N 6300 AGN, REV=0x74
Mar 16 13:05:15 localhost kernel: [   31.986602] iwlwifi :02:00.0: L1 
Enabled; Disabling L0S
Mar 16 13:05:15 localhost kernel: [   32.003127] iwlwifi :02:00.0: device 
EEPROM VER=0x436, CALIB=0x6
Mar 16 13:05:15 localhost kernel: [   32.003132] iwlwifi :02:00.0: Device 
SKU: 0X1f0
Mar 16 13:05:15 localhost kernel: [   32.003136] iwlwifi :02:00.0: Valid Tx 
ant: 0X7, Valid Rx ant: 0X7
Mar 16 13:05:15 localhost kernel: [   32.003190] iwlwifi :02:00.0: Tunable 
channels: 13 802.11bg, 24 802.11a channels
Mar 16 13:05:15 localhost kernel: [   32.716291] iwlwifi :02:00.0: loaded 
firmware version 9.221.4.1 build 25532
Mar 16 13:05:25 localhost kernel: [ 1033.032388] iwlwifi :02:00.0: L1 
Enabled; Disabling L0S
Mar 16 13:05:25 localhost kernel: [ 1033.039190] iwlwifi :02:00.0: Radio 
type=0x0-0x3-0x1
Mar 16 13:05:26 localhost kernel: [ 1033.281070] iwlwifi :02:00.0: L1 
Enabled; Disabling L0S
Mar 16 13:05:26 localhost kernel: [ 1033.287845] iwlwifi :02:00.0: Radio 
type=0x0-0x3-0x1
Mar 16 13:36:18 localhost kernel: [ 2880.810592] iwlwifi :02:00.0: Error 
sending POWER_TABLE_CMD: time out after 2000ms.
Mar 16 13:36:18 localhost kernel: [ 2880.810599] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 6
Mar 16 13:36:21 localhost kernel: [ 2880.810603] iwlwifi :02:00.0: set 
power fail, ret = -110
Mar 16 13:36:21 localhost kernel: [ 2883.304348] iwlwifi :02:00.0: Error 
sending REPLY_QOS_PARAM: time out after 2000ms.
Mar 16 13:36:21 localhost kernel: [ 2883.304355] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 9
Mar 16 13:36:21 localhost kernel: [ 2883.304360] iwlwifi :02:00.0: Failed 
to update QoS
Mar 16 13:36:23 localhost kernel: [ 2885.299347] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 13:36:23 localhost kernel: [ 2885.299355] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 12
Mar 16 13:36:23 localhost kernel: [ 2885.299360] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 13:36:25 localhost kernel: [ 2887.294339] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 13:36:25 localhost kernel: [ 2887.294346] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 15
Mar 16 13:36:25 localhost kernel: [ 2887.294350] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 13:36:27 localhost kernel: [ 2889.289307] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 13:36:27 localhost kernel: [ 2889.289315] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 18
Mar 16 13:36:27 localhost kernel: [ 2889.289320] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 13:36:29 localhost kernel: [ 2891.284303] iwlwifi :02:00.0: Error 
sending REPLY_ADD_STA: time out after 2000ms.
Mar 16 13:36:29 localhost kernel: [ 2891.284309] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 21
Mar 16 13:36:31 localhost kernel: [ 2893.279199] iwlwifi :02:00.0: Error 
sending REPLY_REMOVE_STA: time out after 2000ms.
Mar 16 13:36:31 localhost kernel: [ 2893.279208] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 24
Mar 16 13:36:31 localhost kernel: [ 2893.279215] iwlwifi :02:00.0: Error 
removing station 00:50:7f:cb:4b:58
Mar 16 13:36:33 localhost kernel: [ 2895.278267] iwlwifi :02:00.0: Error 
sending REPLY_RXON: time out after 2000ms.
Mar 16 13:36:33 localhost kernel: [ 2895.278275] iwlwifi :02:00.0: Current 
CMD queue read_ptr 4 write_ptr 27
Mar 16 13:36:33 localhost kernel: [ 2895.278280] iwlwifi :02:00.0: Error 
clearing ASSOC_MSK on BSS (-110)
Mar 16 13:36:35 localhost kernel: [ 2897.281201] iwlwifi :02:00.0: fail to 
flush all tx fifo queues
Mar 16 13:36:37 localhost kernel: [ 2899.276196] iwlwifi :02:00.0: 

Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-15 Thread Dafydd Harries

Another failure today, but this time the device failed to come out of
a sleep state after a resume cycle.

...
Mar 15 16:10:52 localhost kernel: [79800.936071] ehci_hcd :00:1d.0: power 
state changed by ACPI to D0
Mar 15 16:10:52 localhost kernel: [79800.936173] ahci :00:1f.2: restoring 
config space at offset 0x1 (was 0x2b7, writing 0x2b00407)
Mar 15 16:10:52 localhost kernel: [79800.936240] intel ips :00:1f.6: 
restoring config space at offset 0xf (was 0x400, writing 0x40b)
Mar 15 16:10:52 localhost kernel: [79800.936261] intel ips :00:1f.6: 
restoring config space at offset 0x1 (was 0x10, writing 0x12)
Mar 15 16:10:52 localhost kernel: [79800.951431] iwlwifi :02:00.0: Refused 
to change power state, currently in D3
Mar 15 16:10:52 localhost kernel: [79800.951443] iwlwifi :02:00.0: 
restoring config space at offset 0xf (was 0x, writing 0x10b)
Mar 15 16:10:52 localhost kernel: [79800.951451] iwlwifi :02:00.0: 
restoring config space at offset 0xe (was 0x, writing 0x0)
Mar 15 16:10:52 localhost kernel: [79800.951458] iwlwifi :02:00.0: 
restoring config space at offset 0xd (was 0x, writing 0xc8)
...
Mar 15 16:10:53 localhost kernel: [79803.655115] iwlwifi :02:00.0: L1 
Enabled; Disabling L0S
Mar 15 16:10:53 localhost kernel: [79803.672140] iwlwifi :02:00.0: MAC is 
in deep sleep!. CSR_GP_CNTRL = 0x
Mar 15 16:10:53 localhost kernel: [79803.689187] iwlwifi :02:00.0: MAC is 
in deep sleep!. CSR_GP_CNTRL = 0x
...
Mar 15 16:10:59 localhost kernel: [79809.254554] iwlwifi :02:00.0: Could 
not load the INST uCode section
Mar 15 16:10:59 localhost kernel: [79809.254563] iwlwifi :02:00.0: Failed 
to start RT ucode: -110
Mar 15 16:10:59 localhost kernel: [79809.271456] iwlwifi :02:00.0: MAC is 
in deep sleep!. CSR_GP_CNTRL = 0x

Next iteration:

wd_disable=1 power_save=0 bt_coex_active=0

Note that power_save=0 seems to be the default anyway.



-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120315151842.GA7166@nia



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-15 Thread Shannon Dealy


Found these two pages discussing what appears to be the same problem:

   https://bugs.launchpad.net/ubuntu/+source/linux/+bug/575492
   http://ubuntuforums.org/showthread.php?t=1470820

in at least one case, it is definitely the same problem (deep sleep 
message shows up in the posted log).  Note: while there were Lenovo 
computers, there were a number of other brands also having problems. 
Wireless card in the discussion was the Intel 5100 AGN.


I created a file /etc/modprobe.d/iwlagn.conf and placed the following 
line in it:


  options iwlagn 11n_disable=1 11n_disable50=1

and reloaded my driver.  Wireless has been rock solid for 14 hours, no 
dropped packets, no long pauses, no mysterious disconnects.  Normally I 
would at least see short pauses, a few disconnects and 100's of dropped 
packets in this time frame.


Since I haven't been seeing the deep sleep problem lately, someone else 
will have to disable N on their system to test if this prevents that 
issue.


This doesn't really constitute a fix and it isn't a great work-around 
either given the major lost of network performance, but if it works for 
others, it would significantly narrow the range of possible causes for 
the problem.


Shannon C. Dealy  |   DeaTech Research Inc.
de...@deatech.com |  - Custom Software Development -
Phone: (800) 467-5820 |  - Natural Building Instruction -
   or: (541) 929-4089 |  www.deatech.com



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/alpine.deb.2.00.1203152139090.28...@nashapur.deatech.com



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-14 Thread Dafydd Harries
On Mon, Mar 12, 2012 at 12:44:29PM -0500, Jonathan Nieder wrote:
 Dafydd Harries wrote:
  Ar 12/03/2012 am 17:11, ysgrifennodd Venkataraman, Meenakshi:
  Dafydd Harries wrote:
 
  Like others, the problems seemed to start around 2.6.39.
 [...]
  Sadly, my dpkg.log only goes back to 3.0, which was installed last July
  (!).
 
 Thanks for checking, and sorry for the lack of clarity.
 /var/log/dpkg.log.1 et al might go back further.

Yes, my logs go back as far as dpkg.log.8, which is last July.



-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120314153724.GB7496@nia



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-14 Thread Shannon Dealy




Like others, the problems seemed to start around 2.6.39.


Thought I should note here, my system showed this problem with 2.6.36 
through 2.6.39.  It seems to have stopped showing the problem (possibly 
due to a memory upgrade many months ago), but it still has chronic 
instability of the connection (drops out for varying intervals), and 
often, network-manager doesn't seem aware of the failure - reloading the 
driver isn't a reliable solution (sometimes appears to work, often does 
not)


FWIW.

Shannon C. Dealy  |   DeaTech Research Inc.
de...@deatech.com |  - Custom Software Development -
Phone: (800) 467-5820 |  - Natural Building Instruction -
   or: (541) 929-4089 |  www.deatech.com



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/alpine.deb.2.00.1203142027440.28...@nashapur.deatech.com



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Jonathan Nieder
Hi,

Venkataraman, Meenakshi wrote:

 Hi Shannon and others,

Thanks for a helpful note.  Forwarding to Shannon, Juha, and Bjørn:

 First up, my sincere apologies for not responding earlier. I've been
 swamped with other work, and have had a chance to look at this only
 now.

 I just got caught up with the email thread, and it appears that
 you're seeing a problem with the following configuration most
 frequently:

 1)  Enable power saving in the driver (power_save)
 2)  Enabling 11n
 3)  Leaving aspm at its default
 4)  wd_disable=0 (the default)

 Our devices are known to have issues with being in L1 (a PCIe sleep
 state), and so we use L0S by default - this is a lower latency and
 higher power state.

 We've also not been able to reproduce the MAC in deep sleep
 problem at our end, so not sure at the moment what is causing it.

 However, there was one issue with queue-stuck detection that we
 found and fixed very recently. The patch is available in the
 wireless-next tree, and will likely improve the situation if a stuck
 queue was the initial cause of your problem.

 You can get the source here:
 http://git.kernel.org/pub/scm/linux/kernel/git/linville/wireless-next.git

 And the patch I'm talking about is this:
 git.kernel.org/?p=linux/kernel/git/linville/wireless-next.git;a=commit;h=342bbf3fee2fa9a18147e74b2e3c4229a4564912

 My suggestion is to load the module with power_save=0, wd_disable=1.
 Enabling 11n should not be a problem, but if it is, then please let
 us know. You should not need to use the wd_disable=1 in the upcoming
 versions of the kernel, but for now, I'd suggest using it. Since
 your problem seems to be reducing significantly by using
 pcie_aspm=off, I would appreciate it much if you could tell us what
 the behaviour is with all other parameters being the same
 (power_save=0, wd_disable=1), and just toggling the state of this
 variable.

 We'll try to reproduce the suspend/hibernate/resume issue in-house
 and let you know if we were able to reproduce the problem at our
 end. If not, we'd like you to try out a newer WiFi card; as the 5100
 is a fairly old device, and will likely not get any firmware updates
 (if it is some weird firmware/driver combo that produced the PCIe
 error).

 Thanks!
 Meenakshi Venkataraman



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120312160738.GA18721@burratino



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Jonathan Nieder
found 628444 linux-2.6/3.2.9-1
tags 628444 + upstream patch moreinfo
quit

Hi Dafydd,

Dafydd Harries wrote:

 I've been seeing similar problems with my Intel Corporation Centrino
 Ultimate-N 6300.

 Like others, the problems seemed to start around 2.6.39.

Odd. What kernel did you use before then?  (/var/log/dpkg.log might
tell.)

 Like othes, the card flakes out a day or two after booting, and a reboot
 always fixes the problem. Occasionally it stays working for longer.

 Like others, I've added RAM. But as far as I can recall the upgrade
 happened well before any poblems started appearing.

Interesting and useful.

 Any ASPM settings are at their default.

 I'll try wd_disable=1 as a workaround for now.

 Meenakshi, will the patch you mentioned be applied in 3.3?

Cc-ing her.  The patch currently seems to be part of the wireless-next
tree but not davem's net tree.

 Below is a syslog excerpt from around the time of failue. It seems to
 support Meenakshi's suggestion that it's related to the queue getting
 stuck.

Well, that can be tested.  Could you try the patch against current
master?  It works like this:

0. Prerequisites:
apt-get install git build-essential

1. Get the kernel history, if you don't already have it:
git clone \
  git://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git

2. Configure and build:
cd linux
git checkout origin/master
cp /boot/config-$(uname -r) .config; # current configuration
make localmodconfig; # optional: minimize configuration
make deb-pkg; # optionally with -jnum for parallel build
dpkg -i ../name of package; # as root
reboot

... test test test ...

3. Hopefully it reproduces the problem.  So try the attached patch:
git am -3sc the patch
make deb-pkg; # maybe with -j4
dpkg -i ../name of package; # as root
reboot

If it works, we can pass this to Dave with information about what
happened and your test result, to get the patch fast-tracked.

Thanks,
Jonathan

 Below is a syslog excerpt from around the time of failue. It seems to
 support Meenakshi's suggestion that it's related to the queue getting
 stuck.
[...]
 iwlwifi :02:00.0: Queue 4 stuck for 2000 ms.
 iwlwifi :02:00.0: Current read_ptr 112 write_ptr 115
 iwlwifi :02:00.0: On demand firmware reload
 iwlwifi :02:00.0: Command REPLY_QOS_PARAM failed: FW Error
 iwlwifi :02:00.0: Failed to update QoS
 iwlwifi :02:00.0: fw recovery, no hcmd send
 iwlwifi :02:00.0: Error sending REPLY_RXON: enqueue_hcmd failed: -5
 iwlwifi :02:00.0: Error clearing ASSOC_MSK on BSS (-5)
 iwlwifi :02:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0x
 iwlwifi :02:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0x
[...]
 ieee80211 phy0: Hardware restart was requested
 wpa_supplicant[1472]: CTRL-EVENT-DISCONNECTED bssid=00:50:7f:cb:4b:58 reason=4
 ieee80211 phy0: failed to remove key (1, ff:ff:ff:ff:ff:ff) from hardware (-2)
[]
 iwlwifi :02:00.0: Could not load the INST uCode section
 iwlwifi :02:00.0: Failed to start RT ucode: -110
[...]
 iwlwifi :02:00.0: MAC is in deep sleep!. CSR_GP_CNTRL = 0x
[...]
 I get some kind of OOPS but I'm guessing this is just because the driver can't
 communicate with the card when the module is being unloaded:
[...]
 WARNING: at 
 /build/buildd-linux-2.6_3.2.9-1-amd64-KTPapN/linux-2.6-3.2.9/debian/build/source_amd64_none/drivers/net/wireless/iwlwifi/iwl-core.c:1330
  iwlagn_mac_remove_interface+0x48/0xdd [iwlwifi]()
 Hardware name: 3249CTO
 Modules linked in: uvcvideo videodev v4l2_compat_ioctl32 media snd_usb_audio 
 snd_usbmidi_lib pci_stub vboxpci(O) vboxnetadp(O) vboxnetflt(O) vboxdrv(O) 
 acpi_cpufreq mperf cpufreq_stats cpufreq_userspace cpu
 Mar 12 13:15:04 localhost kernel: sync_memcpy async_tx raid1 raid0 multipath 
 linear md_mod sd_mod crc_t10dif usbhid hid ahci libahci ehci_hcd libata 
 scsi_mod usbcore thermal thermal_sys usb_common e1000e [last unloaded: 
 scsi_wait_scan]
 Mar 12 13:15:04 localhost kernel: [48290.674508] Pid: 1405, comm: 
 NetworkManager Tainted: G   O 3.2.0-2-amd64 #1
 Mar 12 13:15:04 localhost kernel: [48290.674511] Call Trace:
 Mar 12 13:15:04 localhost kernel: [48290.674520]  [81046879] ? 
 warn_slowpath_common+0x78/0x8c
 Mar 12 13:15:04 localhost kernel: [48290.674531]  [a03ea9af] ? 
 iwlagn_mac_remove_interface+0x48/0xdd [iwlwifi]
[...]
 Mar 12 13:15:04 localhost kernel: [48290.674647]  [812a35a5] ? 
 netlink_rcv_skb+0x36/0x7a
[...]
 iwlwifi :02:00.0: ctx-vif =   (null), vif = 8801b1c72df0
 iwlwifi :02:00.0:  ID = 0: ctx = 8801b1a834b0  ctx-vif =   
 (null)
From: Johannes Berg johannes.b...@intel.com
Date: Sun, 4 Mar 2012 08:50:46 -0800
Subject: iwlwifi: always monitor for stuck queues

commit 342bbf3fee2fa9a18147e74b2e3c4229a4564912 upstream.

If we only monitor while associated, the following
can happen:
 - 

Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Venkataraman, Meenakshi
Hi,

Dafydd Harries wrote:

 I've been seeing similar problems with my Intel Corporation Centrino
 Ultimate-N 6300.

 Like others, the problems seemed to start around 2.6.39.

Odd. What kernel did you use before then?  (/var/log/dpkg.log might
tell.)

 Like othes, the card flakes out a day or two after booting, and a reboot
 always fixes the problem. Occasionally it stays working for longer.

[MV] what platform are you using? And does your problem appear after a system 
hibernate?


 Like others, I've added RAM. But as far as I can recall the upgrade
 happened well before any poblems started appearing.

Interesting and useful.

 Any ASPM settings are at their default.

[MV] Can you try pcie_aspm=off during boot?

 Meenakshi, will the patch you mentioned be applied in 3.3?

[MV] Yes...it should be applied to 3.3 as well (it is also slated to be 
backported to stable kernels); but it is a fairly recent fix, so it will take 
some time before it gets accepted to the other Linux trees.

 Below is a syslog excerpt from around the time of failue. It seems to
 support Meenakshi's suggestion that it's related to the queue getting
 stuck.
[...]
 iwlwifi :02:00.0: Queue 4 stuck for 2000 ms.

[MV] Any idea what happened before this? Did you see any error sending host 
commands? Did you resume from a hibernate? Can you send me the log?

Thanks for your patience,
Meenakshi



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/4595b4d22ab93c4fabba84aad5aa37fd0dc...@orsmsx103.amr.corp.intel.com



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Dafydd Harries
Ar 12/03/2012 am 17:11, ysgrifennodd Venkataraman, Meenakshi:
 Hi,
 
 Dafydd Harries wrote:
 
  I've been seeing similar problems with my Intel Corporation Centrino
  Ultimate-N 6300.
 
  Like others, the problems seemed to start around 2.6.39.
 
 Odd. What kernel did you use before then?  (/var/log/dpkg.log might
 tell.)

Sadly, my dpkg.log only goes back to 3.0, which was installed last July
(!).

I could try installing e.g. 2.6.32 from snapshot.debian.org if other
things don't help. But that would be a lot of patches to bisect,
especially when reproduction iterations are so long...

  Like othes, the card flakes out a day or two after booting, and a reboot
  always fixes the problem. Occasionally it stays working for longer.
 
 [MV] what platform are you using? And does your problem appear after a system 
 hibernate?

Linux nia 3.2.0-2-amd64 #1 SMP Sun Mar 4 22:48:17 UTC 2012 x86_64 GNU/Linux

The system is Debian unstable.

I don't use hibernation. I do suspend regularly, but I haven't noticed any
correlation with suspend/resume.

 
  Like others, I've added RAM. But as far as I can recall the upgrade
  happened well before any poblems started appearing.
 
 Interesting and useful.
 
  Any ASPM settings are at their default.
 
 [MV] Can you try pcie_aspm=off during boot?

I'm currently waiting to see if wd_disable=1 help at all. This might
take a few days, though, since I don't have any way to reproduce it.

  Meenakshi, will the patch you mentioned be applied in 3.3?
 
 [MV] Yes...it should be applied to 3.3 as well (it is also slated to be 
 backported to stable kernels); but it is a fairly recent fix, so it will take 
 some time before it gets accepted to the other Linux trees.
 
  Below is a syslog excerpt from around the time of failue. It seems to
  support Meenakshi's suggestion that it's related to the queue getting
  stuck.
 [...]
  iwlwifi :02:00.0: Queue 4 stuck for 2000 ms.
 
 [MV] Any idea what happened before this? Did you see any error sending host 
 commands? Did you resume from a hibernate? Can you send me the log?

I haven't noticed any pattern. The first thing that happens is that
the NetworkManager applet seems to be trying to reconnect to the
wireless.

The log seems pretty quiet beforehand.

Mar 12 10:09:43 localhost dhclient: DHCPREQUEST on wlan0 to 192.168.1.1 port 67
Mar 12 10:09:43 localhost dhclient: DHCPACK from 192.168.1.1
Mar 12 10:09:43 localhost dhclient: bound to 192.168.1.15 -- renewal in 15209 
seconds.
Mar 12 10:17:01 localhost /USR/SBIN/CRON[12402]: (root) CMD (   cd /  
run-parts --report /etc/cron.hourly)
Mar 12 10:19:46 localhost smartd[2460]: Device: /dev/sda [SAT], SMART Usage 
Attribute: 194 Temperature_Celsius changed from 161 to 157
Mar 12 10:25:51 localhost dbus[1395]: [system] Activating service 
name='org.freedesktop.PackageKit' (using servicehelper)
Mar 12 10:25:51 localhost dbus[1395]: [system] Successfully activated service 
'org.freedesktop.PackageKit'
Mar 12 11:17:01 localhost /USR/SBIN/CRON[12472]: (root) CMD (   cd /  
run-parts --report /etc/cron.hourly)
Mar 12 11:19:46 localhost smartd[2460]: Device: /dev/sda [SAT], SMART Usage 
Attribute: 194 Temperature_Celsius changed from 157 to 161
Mar 12 11:19:59 localhost dbus[1395]: [system] Activating service 
name='org.freedesktop.PackageKit' (using servicehelper)
Mar 12 11:19:59 localhost dbus[1395]: [system] Successfully activated service 
'org.freedesktop.PackageKit'
Mar 12 11:25:51 localhost dbus[1395]: [system] Activating service 
name='org.freedesktop.PackageKit' (using servicehelper)
Mar 12 11:25:51 localhost dbus[1395]: [system] Successfully activated service 
'org.freedesktop.PackageKit'
Mar 12 11:49:46 localhost smartd[2460]: Device: /dev/sda [SAT], SMART Usage 
Attribute: 194 Temperature_Celsius changed from 161 to 157
Mar 12 12:17:01 localhost /USR/SBIN/CRON[15518]: (root) CMD (   cd /  
run-parts --report /etc/cron.hourly)
Mar 12 12:19:20 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:19:21 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:19:21 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:19:21 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:19:22 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:19:46 localhost smartd[2460]: Device: /dev/sda [SAT], SMART Usage 
Attribute: 194 Temperature_Celsius changed from 157 to 144
Mar 12 12:20:15 localhost anacron[19324]: Anacron 2.3 started on 2012-03-12
Mar 12 12:20:15 localhost anacron[19324]: Normal exit (0 jobs run)
Mar 12 12:20:22 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:20:23 localhost dbus[1395]: [system] Reloaded configuration
Mar 12 12:20:23 localhost dbus[1395]: [system] Activating service 
name='org.freedesktop.PackageKit' (using servicehelper)
Mar 12 12:20:23 localhost dbus[1395]: [system] Successfully activated service 
'org.freedesktop.PackageKit'
Mar 12 12:34:33 localhost dbus[1395]: [system] Activating service 

Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Jonathan Nieder
Dafydd Harries wrote:
 Ar 12/03/2012 am 17:11, ysgrifennodd Venkataraman, Meenakshi:
 Dafydd Harries wrote:

 Like others, the problems seemed to start around 2.6.39.
[...]
 Sadly, my dpkg.log only goes back to 3.0, which was installed last July
 (!).

Thanks for checking, and sorry for the lack of clarity.
/var/log/dpkg.log.1 et al might go back further.

 what platform are you using? And does your problem appear after a
 system hibernate?

 Linux nia 3.2.0-2-amd64 #1 SMP Sun Mar 4 22:48:17 UTC 2012 x86_64 GNU/Linux

 The system is Debian unstable.

Maybe lspci -vvnn output (as an attachment) and dmesg output from
booting if you have it could help to pin down the setup a little more.
(If I understand correctly, Meenakshi is dreaming of a reliable
reproduction recipe. ;-))

Ciao,
Jonathan



-- 
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120312174429.GA21040@burratino



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Venkataraman, Meenakshi
Hi,

 what platform are you using? And does your problem appear after a
 system hibernate?

 Linux nia 3.2.0-2-amd64 #1 SMP Sun Mar 4 22:48:17 UTC 2012 x86_64
 GNU/Linux

 The system is Debian unstable.

Maybe lspci -vvnn output (as an attachment) and dmesg output from
booting if you have it could help to pin down the setup a little more.
(If I understand correctly, Meenakshi is dreaming of a reliable reproduction
recipe. ;-))

[MV] That's correct! :-)

Meenakshi



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/4595b4d22ab93c4fabba84aad5aa37fd0dc...@orsmsx103.amr.corp.intel.com



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-03-12 Thread Venkataraman, Meenakshi
Hi Dafydd,

  I've been seeing similar problems with my Intel Corporation Centrino
  Ultimate-N 6300.
 
  Like others, the problems seemed to start around 2.6.39.

[MV] Hmm...this is interesting. Can you load the iwlwifi module with 
bt_coex_active=0 and see if it changes anything? One of the patches that went 
in between 2.6.38 and 2.6.39 changed the behaviour of coexistence with 
Bluetooth devices on some platforms, and caused users some grief; although 
their symptoms were different. The module parameter I mention above solved this 
problem for them.

 Odd. What kernel did you use before then?  (/var/log/dpkg.log might
 tell.)

Sadly, my dpkg.log only goes back to 3.0, which was installed last July
(!).

I could try installing e.g. 2.6.32 from snapshot.debian.org if other
things don't help. But that would be a lot of patches to bisect,
especially when reproduction iterations are so long...

[MV] No...let's leave bisecting as the last option for this problem.

 [MV] what platform are you using? And does your problem appear after a
system hibernate?

Linux nia 3.2.0-2-amd64 #1 SMP Sun Mar 4 22:48:17 UTC 2012 x86_64
GNU/Linux

[MV]  Oh...I was asking about hardware...who is the manufacturer of your system?

I don't use hibernation. I do suspend regularly, but I haven't noticed any
correlation with suspend/resume.

[MV] Interesting as well.

 [MV] Any idea what happened before this? Did you see any error sending
host commands? Did you resume from a hibernate? Can you send me the log?

I haven't noticed any pattern. The first thing that happens is that
the NetworkManager applet seems to be trying to reconnect to the
wireless.

[MV] Hmm...the queue stuck patch could potentially help here, but only provided 
that a queue was stuck prior to the reconnect. Your log doesn't run that far 
back, so I'm not able to say.  :-)

Thanks,
Meenakshi




--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/4595b4d22ab93c4fabba84aad5aa37fd0dc...@orsmsx103.amr.corp.intel.com



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-02-21 Thread Juha Jäykkä
Hi Jonathan!

Sorry I took a while to respond and apologies in advance for them being quite 
useless...

  - steps to reproduce, assuming I had the same hardware

Use the computer. My maximum time without hitting the bug has been less than 
48 hours before I added the following module options: 11n_disable=1 
power_save=0 wd_disable=1 which some googling suggested might solve it. They 
do not, but they make it less frequent. With them, I can even get a WEEK 
without seeing this.

But notice I have different hw than the original reporter: mine is X200s with 
Intel Corporation Ultimate N WiFi Link 5300 (pci-id: 8086:4236).

  - expected result, actual result, and how the difference indicates a
bug (should be simple enough in this case)

=) Expected result: wifi keeps working unless I switch it off using rf_kill, 
physical switch, unload the module, or turn off the computer. 

Actual result: suddenly, out of the blue, in the middle of typing an email in 
kmail, see attachment.

Bug: the wifi card has certainly not changed into Unknown hardware type 
suddenly.

  - how reproducible it is (100% of the time?  50%?)

100% when waiting long enough between reboots.

  - which kernel versions you have tested, and results with each

There were no problems in the 2.6-series. The bug occurs at least in the 
Debian kernel versions 3.2.0-1-amd64, 3.0.0-2-amd64, and 3.0.0-1-amd64.

  - full dmesg output from booting and reproducing the bug, as an
attachment

Do not have it now, if really necessary, will get it next time it occurs 
(which may be a while: I am back to 2.6.39 because I need to get work done).

  - any other weird observations or workarounds

No workaround. Above module parameters alleviate the issue. This is a 
regression, so I suggest someone with time (if anyone has it) bisects 2.6 and 
3.2... horrible task, I do not envy anyone doing that.

 upstream to a public mailing list.  So the purpose of these questions
 is to collect data on what's known so far as a starting point.

Please CC me if you do.

Cheers,
Juha

-- 
 ---
| Juha Jäykkä, ju...@iki.fi |
| http://www.maths.leeds.ac.uk/~juhaj   |
 ---
[252832.820219] iwlwifi :03:00.0: Error sending POWER_TABLE_CMD: time out 
after 2000ms.
[252832.820229] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
135
[252832.820237] iwlwifi :03:00.0: set power fail, ret = -110
[252835.320320] iwlwifi :03:00.0: Error sending REPLY_QOS_PARAM: time out 
after 2000ms.
[252835.320331] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
137
[252835.320338] iwlwifi :03:00.0: Failed to update QoS
[252837.320249] iwlwifi :03:00.0: Error sending REPLY_RXON: time out after 
2000ms.
[252837.320259] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
140
[252837.320267] iwlwifi :03:00.0: Error clearing ASSOC_MSK on BSS (-110)
[252839.320130] iwlwifi :03:00.0: Error sending REPLY_RXON: time out after 
2000ms.
[252839.320140] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
143
[252839.320148] iwlwifi :03:00.0: Error clearing ASSOC_MSK on BSS (-110)
[252841.320273] iwlwifi :03:00.0: Error sending REPLY_ADD_STA: time out 
after 2000ms.
[252841.320283] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
146
[252841.320296] ieee80211 phy0: failed to remove key (0, 00:24:17:33:f4:f5) 
from hardware (-110)
[252843.320053] iwlwifi :03:00.0: Error sending REPLY_REMOVE_STA: time out 
after 2000ms.
[252843.320057] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
149
[252843.320061] iwlwifi :03:00.0: Error removing station 00:24:17:33:f4:f5
[252845.324086] iwlwifi :03:00.0: Error sending REPLY_RXON: time out after 
2000ms.
[252845.324096] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
152
[252845.324104] iwlwifi :03:00.0: Error clearing ASSOC_MSK on BSS (-110)
[252847.328250] iwlwifi :03:00.0: fail to flush all tx fifo queues
[252849.328175] iwlwifi :03:00.0: Error sending POWER_TABLE_CMD: time out 
after 2000ms.
[252849.328185] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
154
[252849.328192] iwlwifi :03:00.0: set power fail, ret = -110
[252851.328203] iwlwifi :03:00.0: Error sending REPLY_RXON: time out after 
2000ms.
[252851.328214] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
155
[252851.328222] iwlwifi :03:00.0: Error clearing ASSOC_MSK on BSS (-110)
[252853.328234] iwlwifi :03:00.0: Error sending REPLY_ADD_STA: time out 
after 2000ms.
[252853.328245] iwlwifi :03:00.0: Current CMD queue read_ptr 134 write_ptr 
156
[252853.328258] ieee80211 phy0: failed to remove key (1, ff:ff:ff:ff:ff:ff) 
from hardware (-110)
[252853.328380] cfg80211: Calling CRDA to update world regulatory domain

Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2012-02-11 Thread Jonathan Nieder
Hi Juha,

Juha Jäykkä wrote:

 What is the status of the fix from Intel? I have the same problem since 
 upgrading to 3.x series and it is VERY annoying - only way I can fix it is 
 rebooting the kernel, so it really seems a kernel bug. Hibernating the system 
 and doing a full power-off (unplug AC, remove battery, wait 60 seconds – this 
 should do it) the laptop does NOT fix it, so I concur: the kernel must retain 
 some bogus information somewhere since the hw has definitely lost its status 
 info.

Please provide:

 - steps to reproduce, assuming I had the same hardware
 - expected result, actual result, and how the difference indicates a
   bug (should be simple enough in this case)
 - how reproducible it is (100% of the time?  50%?)
 - which kernel versions you have tested, and results with each
 - full dmesg output from booting and reproducing the bug, as an
   attachment
 - any other weird observations or workarounds

I believe the best way to move forward would be to take this report
upstream to a public mailing list.  So the purpose of these questions
is to collect data on what's known so far as a starting point.

Thanks,
Jonathan



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: http://lists.debian.org/20120211193827.GE6944@burratino



Bug#628444: iwlagn - MAC is in deep sleep, cannot restore wifi operation

2011-11-25 Thread Shannon Dealy

On Wed, 23 Nov 2011, Jonathan Nieder wrote:


Shannon Dealy wrote:


A developer at Intel contacted me regarding this bug the other day (he was
following up on a similar bug report from another source) and I am working
with him on the problem (currently doing a debug build of the module to
collect data on what is happening).


That's good to hear.  Did it bear any fruit?  (Any test results or
public mailing list messages we can look at?)


Nothing so far, both he and I are very busy so it is a slow process, 
waiting for each of us to work the next step into our schedules. 
Currently I am waiting for his response to a set of data I captured from 
the driver surrounding one failure.


Shannon C. Dealy  |   DeaTech Research Inc.
de...@deatech.com |  - Custom Software Development -
Phone: (800) 467-5820 |  - Natural Building Instruction -
   or: (541) 929-4089 |  www.deatech.com



--
To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org
with a subject of unsubscribe. Trouble? Contact listmas...@lists.debian.org
Archive: 
http://lists.debian.org/alpine.deb.2.00.250814390.11...@nashapur.deatech.com