On Sat, Apr 4, 2015, at 06:02 PM, Adam Wolk wrote:
> On Sat, Apr 4, 2015, at 03:36 PM, Adam Wolk wrote:
> > On Sat, Apr 4, 2015, at 03:19 PM, Adam Wolk wrote:
> > > > > You mentioned earlier some watchdog timeout.  Do you know if you 
> > > > > always
> > > > > see one when the pool corruption triggers?  You can type "dmesg" at 
> > > > > the
> > > > > ddb prompt to check if there's any weird message before the panic.
> > > > > 
> > > > 
> > > > No. I only see the watchdog timeout if I don't try to start netstart by
> > > > hand.
> > > > If it's only started in the boot process due to the existence of
> > > > /etc/hostname.re
> > > > then after a while I see a blue watchdog timeout for re0 but it does not
> > > > result
> > > > in the kernel panicking.
> > > > 
> > > > If I start it manually then I immediately get a kernel panic. By
> > > > manually I mean:
> > > >  - init tries to netstart, passes without crash but doesn't get an
> > > >  address
> > > >  - I run sh /etc/netstart immediately after logging in
> > > > 
> > > > I just did that again to check dmesg from ddb. There's no watchdog entry
> > > > there
> > > > in this case. Just last boot message (WARNING: / was not properly
> > > > unmounted as I
> > > > had a hard crash) followed by the ddb 'panic' entry.
> > > > 
> > > > I can reproduce this issue every time so can help with debugging it with
> > > > some
> > > > newbie guidance ;)
> > > > 
> > > 
> > > One more thing. It's worth to note that this only happens when re0 is
> > > set to dhcp.
> > > If I manually assign an address the kernel doesn't panic and once I even
> > > had proper
> > > connectivity with manual ip setup.
> > > 
> > > Regards,
> > > Adam
> > > 
> > 
> > Ok I think I found a really interesting thing with this. I removed
> > /etc/hostname.re0
> > booted up and did the test again.
> > 
> > # echo dhcp > /etc/hotname.re0
> > # sh /etc/netstart
> > WARNING: /etc/hostname.re0 is insecure, fixing permisions
> > DHCPDISCOVER on req0 - interval 3
> > DHCPDISCOVER on req0 - interval 8
> > DHCPDISCOVER on req0 - interval 15
> > DHCPDISCOVER on req0 - interval 10
> > DHCPDISCOVER on req0 - interval 15
> > DHCPDISCOVER on req0 - interval 10
> > No acceptable DHCPOFFERS received.
> > No working leases in persistent database - sleeping.
> > # sh /etc/netstart
> > panic: pool_do_get: mc12k free list modified: page
> > 0xffffff00a569d000;......
> > SNIP
> > 
> > So as you can see. The issue is determined by the second run of netstart
> > and only
> > when it's set to dhcp. Could it be the dhclient holding on to something?
> > ddb ps
> > shows that two dhclient's are running.
> > 
> > PS.
> > attached the acpidump.
> > 
> > Regards,
> > Adam
> > Email had 1 attachment:
> > + acpidump.tar.gz
> >   1k (application/x-tar-gz)
> 
> Final data point. I now believe that the 'freezes' I'm experiencing are
> actually
> caused by the watchdog tmeout for re0 (even with manual inet config).
> Tried several times, each time the "re0: watchdog timedout" is present
> in /var/log/messages as the last entry before reboot.
> 
> Though I am not dropped into  ddb session, the system becomes just
> unresponsive
> forcing me into a hard reboot.
> 
> I also can't reliably ping my gate with the re0 driver. I managed to
> have it
> working only once. The router sees the box but the box is unable to ping
> the router ('host down').
> 

I can now confirm that the issues are caused by the re0 driver. When
/etc/hostname.re0 doesn't exist the 'freezes' don't happen. I have
the box running for 52 minutes now with bsd.mp when normally it
hangs after the first timeout (somewhere between 5-15m - did not
measure).

With the re0 driver up:
 - kernel panic for dhcp when ran more then once
 - system freezes for static ip
 - system freezes for dhcp when ran only once
 - almost impossible to achieve connectivity with static ip (happened
   only once) the driver keeps reporting that the gateway host is down
   when I try to ping it (incomplete arp entry)

I would appreciate if anyone has some input/ideas on this. Internet
connectivity is the last (serious) problem I have with this box and
the wifi card doesn't seem to be supported at all.

Regards,
Adam

Reply via email to