On 19 maj 2012, at 20:09, Per-Olov Sjvholm wrote:

> On 19 maj 2012, at 17:58, Garry Dolley <gdol...@arpnetworks.com> wrote:
>
>> On Sat, May 19, 2012 at 04:40:08PM +0200, Per-Olov SjC6holm wrote:
>>>
>>>
>>> On 19 maj 2012, at 08:11, Garry Dolley <gdol...@arpnetworks.com> wrote:
>>>
>>>> On Sat, May 19, 2012 at 01:54:54AM +0200, Per-Olov Sjvholm wrote:
>>>>> On 17 maj 2012, at 12:53, Garry Dolley wrote:
>>>>>
>>>>>> On Thu, May 17, 2012 at 03:19:07AM -0700, Garry Dolley wrote:
>>>>>>> On Fri, May 11, 2012 at 09:13:30AM -0400, Simon Perreault wrote:
>>>>>>>> On 2012-05-11 04:15, Garry Dolley wrote:
>>>>>>>>> I now have an amd64 test VM set up, where I installed stock 5.0.
>>>>>>>>>
>>>>>>>>> I ran a lot of traffic over em0 without any timeouts.
>>>>>>>>
>>>>>>>> That's expected. 5.0 has been running without issue for me for a
long
>>>>> time.
>>>>>>>>
>>>>>>>>> I also have been trying several -current kernels.
>>>>>>>>>
>>>>>>>>> As of:
>>>>>>>>>
>>>>>>>>> OpenBSD 5.1-current (GENERIC) #205: Wed Mar 28 21:40:45 MDT 2012
>>>>>>>>>
>>>>>>>>> I don't see any em0 timeouts.
>>>>>>>>>
>>>>>>>>> I will continue to try newer ones and report back here...
>>>>>>>>
>>>>>>>> Why not just test 5.1? Problems have been reported against 5.1, not
>>>>>>>> -current.
>>>>>>>
>>>>>>> I now have a stock 5.1 test VM set up.
>>>>>>>
>>>>>>> OpenBSD 5.1 (GENERIC) #181: Sun Feb 12 09:35:53 MST 2012
>>>>>>>   dera...@amd64.openbsd.org:/usr/src/sys/arch/amd64/compile/GENERIC
>>>>>>>
>>>>>>> I don't see any timeouts.  I grabbed the ports tree via curl several
>>>>>>> times and have been slaving away at it over SSH.  I don't notice
>>>>>>> anything wrong.
>>>>>>>
>>>>>>> So, perhaps this issue does not appear in stock 5.1, but in a newer
>>>>>>> kernel.  I'll try something newer soon...
>>>>>>
>>>>>> I have tried the following newer kernels:
>>>>>>
>>>>>> bsd.20120330
>>>>>> bsd.20120419
>>>>>> bsd.20120427
>>>>>> bsd.20120516
>>>>>>
>>>>>> I still can't reproduce the problem.
>>>>>>
>>>>>> I have disabled mpbios on all these kernels, forgot to mention that.
>>>>>>
>>>>>> I will leave this be for now; will pick it up again if any new
>>>>>> information should arise.
>>>>>>
>>>>>> --
>>>>>> Garry Dolley
>>>>>> ARP Networks, Inc. | http://www.arpnetworks.com | (818) 206-0181
>>>>>> Data center, VPS, and IP Transit solutions
>>>>>> Member Los Angeles County REACT, Unit 336 | WQGK336
>>>>>> Blog http://scie.nti.st
>>>>>>
>>>>>
>>>>>
>>>>> I have a running 4.9 release + patches ( i.e 4.9 stable) working
perfect.
> When
>>>>> Updated to 5.1 release + patches I have real problems with watchdog
> timeout
>>>>> resets on my intel nic:s. Same hardware, but just different OpenBSD
> version.
>>>>>
>>>>> I have tried a bunch of kernels from Stuart Henderson (Broken after
> 4.9.....).
>>>>> I have also recompiled the 5.1 stable kernel with most  versions of the
>>>>> if_em.c driver. I have compiled and tried the following...
>>>>> (note that the userland was 5.1 stable with all kernel tests)
>>>>>
>>>>> bsd-5.1-stable
>>>>> bsd-5.1-stable_plus_if_em.c-1.249
>>>>> bsd-5.1-stable_plus_if_em.c-1.250
>>>>> bsd-5.1-stable_plus_if_em.c-1.251
>>>>> bsd-5.1-stable_plus_if_em.c-1.252
>>>>> bsd-5.1-stable_plus_if_em.c-1.253
>>>>> bsd-5.1-stable_plus_if_em.c-1.254
>>>>> bsd-5.1-stable_plus_if_em.c-1.263
>>>>>
>>>>> Watchdog timeout resets on all versions.....
>>>>>
>>>>> NOTE that the Watchdog timeout reset appears in version 1.249 of
if_em.c
> as
>>>>> well. And that version is default in 4.9 stable which works fantastic.
So
> if I
>>>>> haven't done anything totally wrong it must be related to something
else
> in
>>>>> the kernel. So.... my nic hardware and the kvm bios is the same. And an
>>>>> if_em.c version that works in 4.9 is tried. ????????
>>>>>
>>>>>
>>>>> I can see above that you got rid of the problem by testing the same
> version as
>>>>> me.. But you use AMD and I use i386.
>>>>> Also... I have a firewall with 2 nic:s. Often ONE nic works but the
> other
>>>>> gives watchdog timeout resets and wont work.
>>>>>
>>>>> Any clues?
>>>>
>>>> I don't have any clues.  I wasn't able to reproduce the problem,
>>>> even though one customer I have who also upgraded experienced this
>>>> behavior.  They did not do a fresh install (that I'm aware), but
>>>> upgraded (similar to you).  I'm not sure what the previous version
>>>> was.  They have one NIC and I believe run amd64.
>>>>
>>>> The only difference that I can see is that on a fresh 5.1 install,
>>>> there is no issue.  But if you upgrade from a previous release, then
>>>> the issue *might* appear.
>>>>
>>>> --
>>>> Garry Dolley
>>>> ARP Networks, Inc. | http://www.arpnetworks.com | (818) 206-0181
>>>> Data center, VPS, and IP Transit solutions
>>>> Member Los Angeles County REACT, Unit 336 | WQGK336
>>>> Blog http://scie.nti.st
>>>>
>>>
>>> I have a fresh 5.1 rel plus stable patches. No upgrade...
>>
>> What happened before you applied the stable patches?  On the fresh
>> 5.1 release without any changes, that is...
>>
>> --
>> Garry Dolley
>> ARP Networks, Inc. | http://www.arpnetworks.com | (818) 206-0181
>> Data center, VPS, and IP Transit solutions
>> Member Los Angeles County REACT, Unit 336 | WQGK336
>> Blog http://scie.nti.st
>
> That i have not tried..
>
> Per-Olov
>


Problem seems to be found....

A change of  "int   intr_shared_edge;" to "int   intr_shared_edge = 1;" in
i386/machdep.c  plus kernel recompile solves the problem.

This seems to have changed between the working and non working kernels...
http://www.openbsd.org/cgi-bin/cvsweb/src/sys/arch/i386/i386/machdep.c.diff?f
=h&r1=text&tr1=1.487&r2=text&tr2=1.506




Per-Olov

Reply via email to