Hi,

are you running these devices through a powered hub? If not, can you test that?

I've read the above emails and the first email still rings worrysome.
You don't want to be under-powering the NIC in any way or things may
get extremely pissed off.

Are you using an AR9170, or an actual ath9k_htc part?

Can you post a dmesg?


-a


On 10 May 2014 23:40, Aaron Hamilton <aa...@logicdatasystems.net> wrote:
> I'll give the patches/config a try and see if it helps anything. Also,
> the line "supported_rates=10 20 55" doesn't seem to be working. When I
> do a station dump, the tx rate is reported as 6.5 Mbit/s.
>
> On the device that is currently locked up and not accepting
> connections, what are some options for obtaining useful data on the
> current state of the device (i.e. queue status, etc)? I know I can
> restart hostapd to fix it, but that doesn't help me find the root
> cause (and thus how to fix it).
>
> I've gone through an incredible amount of iterations of kernel
> configurations, hostapd changes, etc and I'm pulling my hair out not
> getting any closer to finding the problem. I really appreciate all the
> help thus far, but it would be awesome to be able to see the state of
> the queues and see if/where anything is locked up or pending.
>
> On Sat, May 10, 2014 at 2:26 AM, Oleksij Rempel <li...@rempel-privat.de> 
> wrote:
>> Am 09.05.2014 00:57, schrieb Aaron Hamilton:
>>> Did further testing and we still seem to have issues with clients
>>> connecting. Here's our scenario:
>>>
>>> ** Problem 1 - Extreme Latency **
>>>
>>> 1) Connect a Panasonic Toughbook laptop to the WiFi AP. Connection
>>> appears to come up without any issues. We initiate ongoing pings to
>>> the computer from the AP with consistent 3ms to 10ms responses.
>>>
>>> 2) Connect an embedded device to the AP (dnsmasq reports vendor class
>>> udhcp 1.18.5). When we initiate pings from the AP to the device,
>>> responses take between 500ms and 1000ms.
>>>
>>> We then powered down both client devices and reconnected only the
>>> embedded client. This time the pings started at 32ms and increased in
>>> latency for every subsequent ping. The following is a capture from the
>>> AP. It appears as though each subsequent ping is further delayed by
>>> approximately 20ms. During this time, only the one client is
>>> connected. Also, the only traffic coming across the interface are
>>> pings, which leads me to believe this is not a bandwidth problem.
>>
>> I'm 100% sure, it is about bandwidth.
>> Right now i did test with one AP (ath9k_htc) + 2 STAs.
>> AR9271 adapter is connected to USB 2.0 in high speed mode.
>>
>> Ping test:
>> - both STAs provide stable PING with ~2ms if they are in one room.
>> - After one STA was moved behind two walls it got less stable ping which
>> was variating 2-100ms.
>>
>> Bench test:
>> - Each STA run two stream netperf. AP is in HT20 since there is another
>> AP with HT40 in same room.
>> - The STA behind two walls had almost no chance. It got some 100KB/s
>> - The closest STA had about 4MB/s
>>
>> Well, for this kind of device it is acceptable:
>> https://wikidevi.com/wiki/ThinkPenguin_TPE-N150USB
>>
>>> # ping 192.168.2.192
>>> PING 192.168.2.192 (192.168.2.192): 56 data bytes
>>> 64 bytes from 192.168.2.192: seq=32 ttl=64 time=32.043 ms
>>> 64 bytes from 192.168.2.192: seq=33 ttl=64 time=155.090 ms
>>> 64 bytes from 192.168.2.192: seq=34 ttl=64 time=1013.031 ms
>>> 64 bytes from 192.168.2.192: seq=35 ttl=64 time=21.302 ms
>>> 64 bytes from 192.168.2.192: seq=36 ttl=64 time=6.622 ms
>>> 64 bytes from 192.168.2.192: seq=37 ttl=64 time=8.240 ms
>>> 64 bytes from 192.168.2.192: seq=38 ttl=64 time=79.743 ms
>>> 64 bytes from 192.168.2.192: seq=39 ttl=64 time=103.089 ms
>>> 64 bytes from 192.168.2.192: seq=40 ttl=64 time=121.613 ms
>>> 64 bytes from 192.168.2.192: seq=41 ttl=64 time=143.677 ms
>>> 64 bytes from 192.168.2.192: seq=42 ttl=64 time=167.053 ms
>>> 64 bytes from 192.168.2.192: seq=43 ttl=64 time=190.735 ms
>>> 64 bytes from 192.168.2.192: seq=44 ttl=64 time=215.027 ms
>>> 64 bytes from 192.168.2.192: seq=45 ttl=64 time=236.206 ms
>>> 64 bytes from 192.168.2.192: seq=46 ttl=64 time=259.461 ms
>>> 64 bytes from 192.168.2.192: seq=47 ttl=64 time=283.142 ms
>>> 64 bytes from 192.168.2.192: seq=48 ttl=64 time=302.704 ms
>>> 64 bytes from 192.168.2.192: seq=49 ttl=64 time=325.379 ms
>>> 64 bytes from 192.168.2.192: seq=50 ttl=64 time=364.105 ms
>>> 64 bytes from 192.168.2.192: seq=51 ttl=64 time=372.955 ms
>>> 64 bytes from 192.168.2.192: seq=52 ttl=64 time=394.073 ms
>>> 64 bytes from 192.168.2.192: seq=53 ttl=64 time=433.075 ms
>>> 64 bytes from 192.168.2.192: seq=54 ttl=64 time=436.615 ms
>>> 64 bytes from 192.168.2.192: seq=55 ttl=64 time=462.372 ms
>>> 64 bytes from 192.168.2.192: seq=56 ttl=64 time=484.283 ms
>>> 64 bytes from 192.168.2.192: seq=57 ttl=64 time=510.132 ms
>>> 64 bytes from 192.168.2.192: seq=58 ttl=64 time=534.637 ms
>>> 64 bytes from 192.168.2.192: seq=59 ttl=64 time=552.154 ms
>>> 64 bytes from 192.168.2.192: seq=60 ttl=64 time=571.411 ms
>>> 64 bytes from 192.168.2.192: seq=61 ttl=64 time=594.605 ms
>>> 64 bytes from 192.168.2.192: seq=62 ttl=64 time=616.638 ms
>>> 64 bytes from 192.168.2.192: seq=63 ttl=64 time=535.492 ms
>>> 64 bytes from 192.168.2.192: seq=64 ttl=64 time=661.377 ms
>>> 64 bytes from 192.168.2.192: seq=65 ttl=64 time=685.821 ms
>>> 64 bytes from 192.168.2.192: seq=66 ttl=64 time=708.862 ms
>>> ^C
>>> --- 192.168.2.192 ping statistics ---
>>> 68 packets transmitted, 35 packets received, 48% packet loss
>>> round-trip min/avg/max = 6.622/359.507/1013.031 ms
>>> #
>>
>> I would expect this kind of behaviour with high packet loss.
>>
>>> ** Problem 2 - Total Loss of Connectivity **
>>>
>>> Another issue we have is that WiFi clients loose their ability to
>>> connect to the AP after a period of time. I have remote access into an
>>> AP currently exhibiting this behavior. Here's what we're seeing:
>>>
>>> 1) WiFi beacon is being broadcast and is visible to clients
>>>
>>> 2) Client connection attempt fails and nothing appears in log
>>> indicating an attempt is made. Typically we would at least see
>>> association/authentication messages in the syslog.
>>>
>>> 3) Nothing unusual is reported by dmesg
>>>
>>> 4) If hostapd is restarted, connections will come back up
>>>
>>> Any ideas?  Is there anything I can gather from debugfs or other means?
>>
>> Firmware is defiantly not oopsed.
>> In some cases i noticed that firmware was not able to provide
>> notification about send or field TX packets because
>>  event queue was full. With slow usb i would assume that this queue will
>> often make some problems. But kernel driver was able to recover
>> connection even in this case. So, i don't think it will stall forever.
>>
>> You can try to add "disassoc_low_ack=1" to hostapd.conf which may be
>> will refresh some stalled connections.
>>
>> Other idea is to disbale ani. Try this workaround:
>>
>> diff --git a/drivers/net/wireless/ath/ath9k/htc_drv_main.c
>> b/drivers/net/wireless/ath/ath9k/htc_drv_main.c
>> index f46cd02..e89f85c 100644
>> --- a/drivers/net/wireless/ath/ath9k/htc_drv_main.c
>> +++ b/drivers/net/wireless/ath/ath9k/htc_drv_main.c
>> @@ -744,6 +744,8 @@ void ath9k_htc_start_ani(struct ath9k_htc_priv *priv)
>>         struct ath_common *common = ath9k_hw_common(priv->ah);
>>         unsigned long timestamp = jiffies_to_msecs(jiffies);
>>
>> +       return;
>> +
>>         common->ani.longcal_timer = timestamp;
>>         common->ani.shortcal_timer = timestamp;
>>         common->ani.checkani_timer = timestamp;
>>
>>
>>
>>
>>> On Tue, May 6, 2014 at 12:21 AM, Oleksij Rempel <li...@rempel-privat.de> 
>>> wrote:
>>>>
>>>> Am 06.05.2014 03:57, schrieb Aaron Hamilton:
>>>>> Oh ok. Is this not already handled by hostapd or the wifi drivers?
>>>>
>>>> No. hostapd suggest which rutaes should be used and driver, btw.
>>>> mac80211 should fallow this suggestion. ip layer is not touched.
>>>>
>>>>>
>>>>> Also, I reverted back to backports-3.12.8-1 and now trying to see if
>>>>> there's a difference when using sch_codel.ko and sch_fq_codel.ko
>>>>> (previously these two modules were not used as I was trying to minimize
>>>>> the number of moving parts). Which by the way, am I gaining or loosing
>>>>> anything with these? I'm not quiet sure what their purpose is.
>>>>
>>>> Scheduling is good for many reasons. For example, if you know what
>>>> bandwidth you have (in your case you know it) it is possible to use
>>>> priority for critical applications. DNS and ICMP traffic will have
>>>> higher priority then HTTP, and so on. Read more about QoS.
>>>> I would suggest to set scheduler to bandwidth lover then your USB
>>>> bandwidth. It should reduces usage of ath9k_htc_fw buffer. If you
>>>> configure scheduler, please try remove "supported_rates=10 20 55" from
>>>> you config.
>>>>
>>>> Don't forget. It is not enough to add scheduler module. You will need
>>>> configure it.
>>>>
>>>>> I'm also using the attached hostapd.conf file. Previously, when two
>>>>> devices were on the WiFi, one would always have ping latency of several
>>>>> hundred milliseconds despite minimal traffic on either host. Now latency
>>>>> only seems to spike when a large continuous file is moved across the
>>>>> WiFi. Streaming of music for example doesn't seem to have much effect on
>>>>> the other WiFi clients.
>>>>
>>>> How about filed tests? Do you still have stability issues?
>>>>
>>>>> # Begin hosatpd.conf
>>>>> interface=wlan0
>>>>> driver=nl80211
>>>>>
>>>>> hw_mode=g
>>>>>
>>>>> dump_file=/tmp/hostapd.dump
>>>>> ctrl_interface=/var/run/hostapd
>>>>> ctrl_interface_group=0
>>>>>
>>>>> logger_syslog=-1
>>>>> logger_syslog_level=2
>>>>> beacon_int=500
>>>>> dtim_period=2
>>>>>
>>>>> supported_rates=10 20 55
>>>>>
>>>>> max_num_sta=5
>>>>> rts_threshold=2347
>>>>> fragm_threshold=2346
>>>>>
>>>>> macaddr_acl=0
>>>>> eapol_version=1
>>>>> eapol_key_index_workaround=0
>>>>>
>>>>> # Attempting max time-outs for increased reliability
>>>>> wpa_group_rekey=0
>>>>> wpa_gmk_rekey=86400
>>>>> # wmm_enabled=1
>>>>> ieee80211n=1
>>>>> ieee80211d=1
>>>>> country_code=DE
>>>>> ht_capab=[HT40+][RX-STBC1][DSSS_CCK-40][SHORT-GI-40]
>>>>> ignore_broadcast_ssid=0
>>>>> channel=1
>>>>> ssid=TestSSID
>>>>>
>>>>> auth_algs=1
>>>>> wpa=2
>>>>> wpa_key_mgmt=WPA-PSK
>>>>>
>>>>> wpa_pairwise=CCMP
>>>>> rsn_pairwise=CCMP
>>>>>
>>>>> wpa_passphrase=fixmeplease
>>>>> # end hostapd.conf
>>>>>
>>>>>
>>>>> On Mon, May 5, 2014 at 12:32 PM, Oleksij Rempel <li...@rempel-privat.de
>>>>> <mailto:li...@rempel-privat.de>> wrote:
>>>>>
>>>>>     Am 05.05.2014 20:09, schrieb Aaron Hamilton:
>>>>>     > I'm sorry, what's TC?
>>>>>
>>>>>     http://linux.die.net/man/8/tc
>>>>>
>>>>>     > On Sat, May 3, 2014 at 2:07 AM, Oleksij Rempel
>>>>>     <li...@rempel-privat.de <mailto:li...@rempel-privat.de>
>>>>>     > <mailto:li...@rempel-privat.de <mailto:li...@rempel-privat.de>>>
>>>>>     wrote:
>>>>>     >
>>>>>     >     Am 02.05.2014 12:11, schrieb Aaron Hamilton:
>>>>>     >     > Ok, I updated the drivers to backports 3.14-1 and configured 
>>>>> the
>>>>>     >     > following hostapd settings. I connected an iPad and a
>>>>>     Windows PC, then
>>>>>     >     > ran continuous pings. For the first couple seconds
>>>>>     everything was
>>>>>     >     > returning in a few milliseconds. Within 30 seconds, the
>>>>>     pings started
>>>>>     >     > getting into the several hundred ms range (or timing out)
>>>>>     and remained
>>>>>     >     > there (for both the iPad and PC).
>>>>>     >     >
>>>>>     >     > After I disconnected the PC from the WiFi, the iPad's pings
>>>>>     dropped to
>>>>>     >     > an average of 15ms (about 30s to a minute after the PC was
>>>>>     moved to
>>>>>     >     > another AP).
>>>>>
>>>>>     --
>>>>>     Regards,
>>>>>     Oleksij
>>>>>
>>>>>
>>>>
>>>>
>>>> --
>>>> Regards,
>>>> Oleksij
>>>>
>>
>>
>> --
>> Regards,
>> Oleksij
>>
> _______________________________________________
> ath9k-devel mailing list
> ath9k-devel@lists.ath9k.org
> https://lists.ath9k.org/mailman/listinfo/ath9k-devel
_______________________________________________
ath9k-devel mailing list
ath9k-devel@lists.ath9k.org
https://lists.ath9k.org/mailman/listinfo/ath9k-devel

Reply via email to