Re: advice sought: workflow with -CURRENT and amd GPU [Re: -CURRENT hangs since at least 2022-04-04]

2022-04-19 Thread Paul Mather
On Apr 19, 2022, at 4:49 AM, Michael Schuster  wrote:

> Hi,
> 
> I'm highjacking and re-purposing the previous thread, I hope that's OK
> (I did change the subject ;-)) - I'm keeping some of the previous
> contents for reference.
> 
> I have similar HW to OP (Ryzen 7 4700 w. Renoir Graphics), and have
> been using a similar approach to keep the machine up to date - or so I
> suspect. Still, after a while (several months), I end up with one or
> more of these:
> - I get some sort of panic in DRM (at startup or, currently, at shutdown)
> - when I boot into to a previous BE to attempt a fix and then again
> reboot into the current one, I get tons of messages like this
> "... kernel: KLD iic.ko: depends on kernel - not available or
> version mismatch
>  ... kernel: linker_load_file: /boot/kernel/iic.ko - unsupported file 
> type"
>   and computer refuses to accept input (let alone start X)
> 
> and some others I don't recall right now.
> 
> Before I ask for advice (see below), let me explain the approaches
> I've taken so far. I install with ZFS from the beginning, current boot
> env is "N". These are outlines, not exact commands:
> 
> I) never touch the current BE, always update a new one:
>  1) given current BE N, I create a new BE N+1 and mount it on /mnt,
>  2) 'cd /usr/src; git pull; sudo make DESTDIR=/mnt ... (build, install, etc)'
>  3) 'cd usr/ports/graphics/drm-devel-kmod; sudo make DESTDIR=/mnt install'
>  4) beadm activate BE N+1; reboot
> 
> II) keep a "new" BE as backup/fallback, update current BE:
>  1) given current BE N, I create a new BE N+1 (mounting not required)
> (this is the intended 'fallback')
>  2) 'cd /usr/src; git pull"; then "make" as described in the Handbook
> "24.6. Updating FreeBSD from Source"
>  3) 'cd usr/ports/graphics/drm-devel-kmod; sudo make install'
>  4) reboot
> 
> in both scenarios(sp?), I do "pkg update; pkg upgrade" from time to
> time (also following the resp. approach shown above).
> 
> I suspect that I'm missing something fundamental in my approaches -
> does anyone have a (for them) foolproof approach along these lines, or
> can someone show me what I'm missing in either of mine (in private, if
> you prefer)?


I don't know whether you're missing anything, but I wanted to mention I
recently found a tool useful in helping my own BE-based source
upgrades: /usr/src/tools/build/beinstall.sh

I've found it helps with the build/upgrade steps.  See man beinstall(8)
for details.

Cheers,

Paul.




advice sought: workflow with -CURRENT and amd GPU [Re: -CURRENT hangs since at least 2022-04-04]

2022-04-19 Thread Michael Schuster
Hi,

I'm highjacking and re-purposing the previous thread, I hope that's OK
(I did change the subject ;-)) - I'm keeping some of the previous
contents for reference.

I have similar HW to OP (Ryzen 7 4700 w. Renoir Graphics), and have
been using a similar approach to keep the machine up to date - or so I
suspect. Still, after a while (several months), I end up with one or
more of these:
- I get some sort of panic in DRM (at startup or, currently, at shutdown)
- when I boot into to a previous BE to attempt a fix and then again
reboot into the current one, I get tons of messages like this
 "... kernel: KLD iic.ko: depends on kernel - not available or
version mismatch
  ... kernel: linker_load_file: /boot/kernel/iic.ko - unsupported file type"
   and computer refuses to accept input (let alone start X)

and some others I don't recall right now.

Before I ask for advice (see below), let me explain the approaches
I've taken so far. I install with ZFS from the beginning, current boot
env is "N". These are outlines, not exact commands:

I) never touch the current BE, always update a new one:
  1) given current BE N, I create a new BE N+1 and mount it on /mnt,
  2) 'cd /usr/src; git pull; sudo make DESTDIR=/mnt ... (build, install, etc)'
  3) 'cd usr/ports/graphics/drm-devel-kmod; sudo make DESTDIR=/mnt install'
  4) beadm activate BE N+1; reboot

II) keep a "new" BE as backup/fallback, update current BE:
  1) given current BE N, I create a new BE N+1 (mounting not required)
(this is the intended 'fallback')
  2) 'cd /usr/src; git pull"; then "make" as described in the Handbook
"24.6. Updating FreeBSD from Source"
  3) 'cd usr/ports/graphics/drm-devel-kmod; sudo make install'
  4) reboot

in both scenarios(sp?), I do "pkg update; pkg upgrade" from time to
time (also following the resp. approach shown above).

I suspect that I'm missing something fundamental in my approaches -
does anyone have a (for them) foolproof approach along these lines, or
can someone show me what I'm missing in either of mine (in private, if
you prefer)?

TIA for all and any advice
Michael

On Mon, Apr 18, 2022 at 9:33 PM Pete Wright  wrote:
>
>
>
> On 4/18/22 12:23, filis+fbsdcurr...@filis.org wrote:
> > Hi,
> >
> > I'm running -CURRENT on this one desktop box which is a "Ryzen 7 4800U
> > with Radeon Graphics", since it didn't work on 13R.
> > I use Boot environments and on 2022-04-04 I updated it and it started
> > to completely freeze under X (I haven't tried letting it run without
> > X) after a few dozen minutes.
> [...]
>
>
> After updating your CURRENT environment did you rebuild the drm-kmod
> package?  that's usually required as the LKPI is much more of a moving
> target on that branch compared to STABLE or RELEASE.  i have a pretty
> much identical setup and building/installing drm-devel-kmod has been
> working flawlessly for quite a while.
>
> after building/installing my latest world i do following (this is from a
> local script i use when rebuilding):
>
> cd $PORTS/graphics/drm-devel-kmod
> sudo pkg unlock -y drm-devel-kmod
> sudo make package
> sudo pkg upgrade -y work/pkg/*.pkg
> sudo pkg lock -y drm-devel-kmod
>
> -pete
>
> --
> Pete Wright
> p...@nomadlogic.org
> @nomadlogicLA
>
>


-- 
Michael Schuster
http://recursiveramblings.wordpress.com/
recursion, n: see 'recursion'



Re: -CURRENT hangs since at least 2022-04-04

2022-04-19 Thread Evilham

On dl., abr. 18 2022, Pete Wright wrote:


On 4/18/22 12:23, filis+fbsdcurr...@filis.org wrote:

Hi,

I'm running -CURRENT on this one desktop box which is a "Ryzen 
7 4800U with

Radeon Graphics", since it didn't work on 13R.
I use Boot environments and on 2022-04-04 I updated it and it 
started to
completely freeze under X (I haven't tried letting it run 
without X) after a

few dozen minutes.
I went on vacation and came back today and updated it again to 
see if the
issue went away, but it froze again. I went back to the latest 
BE before
2022-04-04, which is from 2022-03-21 and so far it works fine 
again. I use a
different machine to build and then rsync /usr/src and /usr/obj 
over and run
make installworld, etc locally and also pkg upgrade (I use 
FreeBSD -latest
packages) everything, so I can't quite tell if this is related 
to base or
drm-kmod and I'm not too familiar with changes in the timeframe 
between

2022-03-21 and 2022-04-04 that would affect my setup.
Is there anything I can try and/or find or collect info to shed 
more light on

this?



After updating your CURRENT environment did you rebuild the 
drm-kmod package?
that's usually required as the LKPI is much more of a moving 
target on that
branch compared to STABLE or RELEASE.  i have a pretty much 
identical setup and
building/installing drm-devel-kmod has been working flawlessly 
for quite a

while.

after building/installing my latest world i do following (this 
is from a local

script i use when rebuilding):

cd $PORTS/graphics/drm-devel-kmod
sudo pkg unlock -y drm-devel-kmod
sudo make package
sudo pkg upgrade -y work/pkg/*.pkg
sudo pkg lock -y drm-devel-kmod

-pete


I too have recently noticed some freezes after a few hours on 
-CURRENT that were not happening before.
This with a matching drm-devel-kmod package (built with matching 
source on matching kernel).


The hw being: AMD Ryzen 7 PRO 2700U w/ Radeon Vega Mobile Gfx
--
Evilham



Re: -CURRENT hangs since at least 2022-04-04

2022-04-18 Thread Pete Wright




On 4/18/22 12:23, filis+fbsdcurr...@filis.org wrote:

Hi,

I'm running -CURRENT on this one desktop box which is a "Ryzen 7 4800U 
with Radeon Graphics", since it didn't work on 13R.
I use Boot environments and on 2022-04-04 I updated it and it started 
to completely freeze under X (I haven't tried letting it run without 
X) after a few dozen minutes.
I went on vacation and came back today and updated it again to see if 
the issue went away, but it froze again. I went back to the latest BE 
before 2022-04-04, which is from 2022-03-21 and so far it works fine 
again. I use a different machine to build and then rsync /usr/src and 
/usr/obj over and run make installworld, etc locally and also pkg 
upgrade (I use FreeBSD -latest packages) everything, so I can't quite 
tell if this is related to base or drm-kmod and I'm not too familiar 
with changes in the timeframe between 2022-03-21 and 2022-04-04 that 
would affect my setup.
Is there anything I can try and/or find or collect info to shed more 
light on this?




After updating your CURRENT environment did you rebuild the drm-kmod 
package?  that's usually required as the LKPI is much more of a moving 
target on that branch compared to STABLE or RELEASE.  i have a pretty 
much identical setup and building/installing drm-devel-kmod has been 
working flawlessly for quite a while.


after building/installing my latest world i do following (this is from a 
local script i use when rebuilding):


cd $PORTS/graphics/drm-devel-kmod
sudo pkg unlock -y drm-devel-kmod
sudo make package
sudo pkg upgrade -y work/pkg/*.pkg
sudo pkg lock -y drm-devel-kmod

-pete

--
Pete Wright
p...@nomadlogic.org
@nomadlogicLA




Re: Current hangs in reboot

2002-12-20 Thread Taavi Talvik
On Wed, 11 Dec 2002, Taavi Talvik wrote:

After testing, it turns out, that 'hang on reboot' problem maybe related
to mainboard (Gigabyte GA-8PE667 Ultra) and/or USB driver for Intel 82801DB.

After turning off onboard USB rebooting works even if ACPI is enabled.
It was necessary to disable only Intel 82801DB (ICH4) devices.

kernel: uhci0: Intel 82801DB (ICH4) USB controller USB-A port 0xb800-0xb81f irq 14 
at device 29.0 on pci0
kernel: usb0: Intel 82801DB (ICH4) USB controller USB-A on uhci0
kernel: usb0: USB revision 1.0
kernel: uhub0: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
kernel: uhub0: 2 ports with 2 removable, self powered
kernel: uhci1: Intel 82801DB (ICH4) USB controller USB-B port 0xb000-0xb01f irq 14 
at device 29.1 on pci0
kernel: usb1: Intel 82801DB (ICH4) USB controller USB-B on uhci1
kernel: usb1: USB revision 1.0
kernel: uhub1: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
kernel: uhub1: 2 ports with 2 removable, self powered
kernel: uhci2: Intel 82801DB (ICH4) USB controller USB-C port 0xb400-0xb41f irq 15 
at device 29.2 on pci0
kernel: usb2: Intel 82801DB (ICH4) USB controller USB-C on uhci2
kernel: usb2: USB revision 1.0
kernel: uhub2: Intel UHCI root hub, class 9/0, rev 1.00/1.00, addr 1
kernel: uhub2: 2 ports with 2 removable, self powered
kernel: pci0: serial bus, USB at device 29.7 (no driver attached)

kernel: ohci0: NEC uPD 9210 USB controller mem 0xdf00-0xdf000fff irq 14 at 
device 7.0 on pci2
kernel: usb3: OHCI version 1.0
kernel: usb3: NEC uPD 9210 USB controller on ohci0
kernel: usb3: USB revision 1.0
kernel: uhub3: NEC OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
kernel: uhub3: 3 ports with 3 removable, self powered
kernel: ohci1: NEC uPD 9210 USB controller mem 0xdf001000-0xdf001fff irq 15 at 
device 7.1 on pci2
kernel: usb4: OHCI version 1.0
kernel: usb4: NEC uPD 9210 USB controller on ohci1
kernel: usb4: USB revision 1.0
kernel: uhub4: NEC OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
kernel: uhub4: 2 ports with 2 removable, self powered


best regards,
taavi

 On Wed, 11 Dec 2002, Nate Lawson wrote:
  On Wed, 11 Dec 2002, Taavi Talvik wrote:
   On Wed, 11 Dec 2002, Taavi Talvik wrote:
Current as of yesterday (actually problem presist about
month) hangs on reboot.
   
Only message:
   
Waiting (max 60 seconds) for system process `vnlru' to stop...stopped
  
   Turning off ACPI makes it reboot. Thanks for suggestion Gerrit!
  
   But how is ACPI related to problem? Why rebooting hangs between
   stoping `vnlru' and `bufdaemon' in ACPI case?
 
  The next message you'd see if it was working is Stopping ACPI.  So acpi
  shutdown is probably what is hanging.


 No, Stopping ACPI appears normally only after stoping 'syncer' also. But
 in my situation hang happens between 'vnlru' and 'bufdaemon'.

 If console output is not buffered, what happens that last two prompts are
 not diapleyed? Any hints how I can futher debug it?

 Waiting (max 60 seconds) for system process `vnlru' to stop...stopped
 Waiting (max 60 seconds) for system process `bufdaemon' to stop...stopped
 Waiting (max 60 seconds) for system process `syncer' to stop...stopped

 syncing disks, buffers remaining... 2 2
 done
 Uptime: 9m6s


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Current hangs in reboot

2002-12-11 Thread Taavi Talvik
On Wed, 11 Dec 2002, Taavi Talvik wrote:

 Current as of yesterday (actually problem presist about
 month) hangs on reboot.

 Only message:

 Waiting (max 60 seconds) for system process `vnlru' to stop...stopped

Turning off ACPI makes it reboot. Thanks for suggestion Gerrit!

But how is ACPI related to problem? Why rebooting hangs between
stoping `vnlru' and `bufdaemon' in ACPI case?

best regards,
taavi


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Current hangs in reboot

2002-12-11 Thread Gerrit Kühn
On Wed, Dec 11, 2002 at 03:35:52PM +0200, Taavi Talvik wrote:

  Waiting (max 60 seconds) for system process `vnlru' to stop...stopped

 Turning off ACPI makes it reboot. Thanks for suggestion Gerrit!

You're welcome. I just noticed I forgot to include the mailinglist in
my answers to you. :)

 But how is ACPI related to problem? Why rebooting hangs between
 stoping `vnlru' and `bufdaemon' in ACPI case?

I'd like to know that, too. :-)


cu
  Gerrit
-- 

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Current hangs in reboot

2002-12-11 Thread Nate Lawson
On Wed, 11 Dec 2002, Taavi Talvik wrote:
 On Wed, 11 Dec 2002, Taavi Talvik wrote:
  Current as of yesterday (actually problem presist about
  month) hangs on reboot.
 
  Only message:
 
  Waiting (max 60 seconds) for system process `vnlru' to stop...stopped
 
 Turning off ACPI makes it reboot. Thanks for suggestion Gerrit!
 
 But how is ACPI related to problem? Why rebooting hangs between
 stoping `vnlru' and `bufdaemon' in ACPI case?

The next message you'd see if it was working is Stopping ACPI.  So acpi
shutdown is probably what is hanging.

-Nate


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Current hangs in reboot

2002-12-11 Thread Taavi Talvik
On Wed, 11 Dec 2002, Nate Lawson wrote:

 On Wed, 11 Dec 2002, Taavi Talvik wrote:
  On Wed, 11 Dec 2002, Taavi Talvik wrote:
   Current as of yesterday (actually problem presist about
   month) hangs on reboot.
  
   Only message:
  
   Waiting (max 60 seconds) for system process `vnlru' to stop...stopped
 
  Turning off ACPI makes it reboot. Thanks for suggestion Gerrit!
 
  But how is ACPI related to problem? Why rebooting hangs between
  stoping `vnlru' and `bufdaemon' in ACPI case?

 The next message you'd see if it was working is Stopping ACPI.  So acpi
 shutdown is probably what is hanging.


No, Stopping ACPI appears normally only after stoping 'syncer' also. But
in my situation hang happens between 'vnlru' and 'bufdaemon'.

If console output is not buffered, what happens that last two prompts are
not diapleyed? Any hints how I can futher debug it?

Waiting (max 60 seconds) for system process `vnlru' to stop...stopped
Waiting (max 60 seconds) for system process `bufdaemon' to stop...stopped
Waiting (max 60 seconds) for system process `syncer' to stop...stopped

syncing disks, buffers remaining... 2 2
done
Uptime: 9m6s

best regards,
taavi


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-26 Thread Andrew Gallatin


Kenneth D. Merry writes:
   Wow, I didn't actually expect my config would make things work
   differenty on your box.  I'm very interested in whatever you
   turn up.
  
  The answer is...the USB code.
  
  [ Nick and Joe CCed ]
  
  If I comment out the following lines in my standard config for this
  machine, SMP works:
  
  # USB support
  device  uhci# UHCI PCI-USB interface
  device  ohci# OHCI PCI-USB interface
  device  usb # USB Bus (required)
  #device udbp# USB Double Bulk Pipe devices
...


To add another datapoint, the USB code will prevent some alphas from
rebooting cleanly in -stable.  With USB compiled in, the machine will
not get any ATA interrupts after a warm reboot.  This change happened
somewhere between 4.4 and 4.5.

It doesn't seem to matter if the machine actually has USB enabled or
not.  Eg, it happens on a UP1000 which has USB hardware, but has it
disabled in the firmware:

chip0@pci0:0:0: class=0x06 card=0x chip=0x70061022 rev=0x25 hdr=0x00
vendor   = 'Advanced Micro Devices (AMD)'
device   = 'AMD-751 Processor-to-PCI Bridge / Memory Ctrlr'
class= bridge
subclass = HOST-PCI
pcib1@pci0:1:0: class=0x060400 card=0x chip=0x70071022 rev=0x01 hdr=0x01
vendor   = 'Advanced Micro Devices (AMD)'
device   = 'AMD-751 AGP and PCI-to-PCI Bridge (1x/2x AGP)'
class= bridge
subclass = PCI-PCI
isab0@pci0:7:0: class=0x060100 card=0x chip=0x153310b9 rev=0xc3 hdr=0x00
vendor   = 'Acer Labs Incorporated (ALi)'
device   = 'ALI M1533 Aladdin IV ISA Bridge'
class= bridge
subclass = PCI-ISA
sym0@pci0:8:0:  class=0x01 card=0x chip=0x000f1000 rev=0x04 hdr=0x00
vendor   = 'Symbios Logic (NCR) (LSI Logic)'
device   = '53C875, 53C876 Ultra-Wide SCSI (53C876 is dual-channel)'
class= mass storage
subclass = SCSI
rp0@pci0:10:0:  class=0x078000 card=0x chip=0x000511fe rev=0x04 hdr=0x00
vendor   = 'Comtrol Corp'
device   = 'RocketPort 8-port'
class= simple comms
xl0@pci0:11:0:  class=0x02 card=0x905510b7 chip=0x905510b7 rev=0x64 hdr=0x00
vendor   = '3COM Corp, Networking Division'
device   = '3C905B-TX Fast Etherlink 10/100 PCI TX NIC'
class= network
subclass = ethernet
atapci0@pci0:16:0:  class=0x0101fa card=0x chip=0x522910b9 rev=0xc1 
hdr=0x00
vendor   = 'Acer Labs Incorporated (ALi)'
device   = 'M1543 Southbridge EIDE Controller'
class= mass storage
subclass = ATA
chip1@pci0:17:0:class=0x00 card=0x chip=0x710110b9 rev=0x00 
hdr=0x00
vendor   = 'Acer Labs Incorporated (ALi)'
device   = 'ALI M7101 Power Management Controller'
class= old
subclass = non-VGA display device
none0@pci2:5:0: class=0x03 card=0x0a351048 chip=0x3d07104c rev=0x11 hdr=0x00
vendor   = 'Texas Instruments (TI)'
device   = 'TVP4020 AGP Permedia 2'
class= display
subclass = VGA


Cheers,

Drew


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-25 Thread Andrew Kenneth Milton

+---[ Glenn Gombert ]--
| There is a problem in -Current now between SCSI card(s) and the ATA
| drivers, On my Dell 410 Workstation at work I commetned out the ATA driver
| stuff (it was not used on that machine) and it booted fine, I think that
| might just cure the SMP problem you are seeing too.

I can normally boot fine, it's only under load that it hangs.

I think it might actually be the AHC driver, not CAM itself. I don't have 
any other SCSI cards to test that theory though.

-- 
Totally Holistic Enterprises Internet|  | Andrew Milton
The Internet (Aust) Pty Ltd  |  |
ACN: 082 081 472 ABN: 83 082 081 472 |  M:+61 416 022 411   | Carpe Daemon
PO Box 837 Indooroopilly QLD 4068|[EMAIL PROTECTED]| 

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-25 Thread Kenneth D. Merry

On Sun, Feb 24, 2002 at 23:15:34 -0800, Matthew Dillon wrote:
 
 :...
 : stuff (it was not used on that machine) and it booted fine, I think that
 : might just cure the SMP problem you are seeing too.
 :
 :Thanks for the suggestion.
 :
 :Unfortunately it still hangs with SMP enabled and the ATA drivers commented
 :out of the GENERIC config.
 :
 :Ken
 :-- 
 :Kenneth Merry
 
 Just as a data point, I've been running -current on a 2xCPU SMP
 system (DELL2550) for a few weeks and it's always booted fine.
 
 For the last few months I have noticed occassional freezes occuring
 at odd times long after boot.  I have no idea why it happens.

Your config works!

I've attached the dmesg.  Now the question is, what is the difference
between GENERIC and your config that is causing the problem?

(I suppose it makes sense that your config would work, since I think that
Dell is a Serverworks-based system too.)

I'll see if I can figure it out...

Ken
-- 
Kenneth Merry
[EMAIL PROTECTED]


Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #0: Mon Feb 25 22:01:01 MST 2002

[EMAIL PROTECTED]:/usr/home/ken/perforce/FreeBSD-ken/src/sys/i386/compile/dilloncfg
Preloaded elf kernel /boot/kernel.dillon/kernel at 0xc03c6000.
Preloaded elf module /boot/kernel.dillon/acpi.ko at 0xc03c60b0.
Timecounter i8254  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1266.07-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0x6b1  Stepping = 1
  
Features=0x383fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE
real memory  = 2684289024 (2621376K bytes)
avail memory = 2610569216 (2549384K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 - irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): apic id:  0, version: 0x00040011, at 0xfee0
 cpu1 (AP):  apic id:  1, version: 0x00040011, at 0xfee0
 io0 (APIC): apic id:  4, version: 0x000f0011, at 0xfec0
 io1 (APIC): apic id:  5, version: 0x000f0011, at 0xfec01000
Pentium Pro MTRR support enabled
Using $PIR table, 10 entries at 0xc00f52e0
ACPI-0204: *** Error: AcpiLoadTables: Could not load namespace: AE_NOT_FOUND
ACPI-0213: *** Error: AcpiLoadTables: Could not load tables: AE_NOT_FOUND
ACPI: table load failed: AE_NOT_FOUND
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: ServerWorks host to PCI bridge at pcibus 0 on motherboard
IOAPIC #1 intpin 10 - irq 2
IOAPIC #1 intpin 11 - irq 5
IOAPIC #1 intpin 15 - irq 9
pci0: PCI bus on pcib0
pcib1: PCI-PCI bridge at device 0.1 on pci0
IOAPIC #1 intpin 14 - irq 11
pci1: PCI bus on pcib1
pci1: display, VGA at device 0.0 (no driver attached)
ahc0: Adaptec aic7899 Ultra160 SCSI adapter port 0xd000-0xd0ff mem 
0xfeafc000-0xfeafcfff irq 2 at device 5.0 on pci0
aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: Adaptec aic7899 Ultra160 SCSI adapter port 0xd800-0xd8ff mem 
0xfeaff000-0xfeaf irq 5 at device 5.1 on pci0
aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/255 SCBs
fxp0: Intel Pro 10/100B/100+ Ethernet port 0xd400-0xd43f mem 
0xfe90-0xfe9f,0xfeafd000-0xfeafdfff irq 9 at device 6.0 on pci0
fxp0: Ethernet address 00:30:48:21:bb:74
inphy0: i82555 10/100 media interface on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
isab0: PCI-ISA bridge port 0x580-0x58f at device 15.0 on pci0
isa0: ISA bus on isab0
atapci0: ServerWorks ROSB4 ATA33 controller port 0xffa0-0xffaf at device 15.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
pci0: serial bus, USB at device 15.2 (no driver attached)
pcib2: ServerWorks host to PCI bridge at pcibus 2 on motherboard
IOAPIC #1 intpin 8 - irq 16
pci2: PCI bus on pcib2
pci2: network, ethernet at device 2.0 (no driver attached)
ata: ata0 already exists; skipping it
ata: ata1 already exists; skipping it
sc: sc0 already exists; skipping it
vga: vga0 already exists; skipping it
orm0: Option ROMs at iomem 0xd1800-0xd27ff,0xc-0xcbfff on isa0
fdc0: enhanced floppy controller (i82077, NE72065 or clone) at port 
0x3f7,0x3f0-0x3f5 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1440-KB 3.5 drive on fdc0 drive 0
atkbdc0: Keyboard controller (i8042) at port 0x64,0x60 on isa0
atkbd0: AT Keyboard flags 0x1 irq 1 on atkbdc0
psm0: PS/2 Mouse irq 12 on atkbdc0
psm0: model Generic PS/2 mouse, device ID 0
vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on isa0
sc0: System console at flags 0x100 on isa0
sc0: VGA 16 virtual consoles, flags=0x300
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: Broken MP table detected: 8254 is not connected to IOAPIC #0 intpin 2
APIC_IO: routing 8254 via 8259 and 

Re: -current hangs with SMP enabled

2002-02-25 Thread Matthew Dillon


: Just as a data point, I've been running -current on a 2xCPU SMP
: system (DELL2550) for a few weeks and it's always booted fine.
: 
: For the last few months I have noticed occassional freezes occuring
: at odd times long after boot.  I have no idea why it happens.
:
:Your config works!
:
:I've attached the dmesg.  Now the question is, what is the difference
:between GENERIC and your config that is causing the problem?
:
:(I suppose it makes sense that your config would work, since I think that
:Dell is a Serverworks-based system too.)
:
:I'll see if I can figure it out...
:
:Ken
:-- 
:Kenneth Merry

Wow, I didn't actually expect my config would make things work
differenty on your box.  I'm very interested in whatever you
turn up.

-Matt


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-25 Thread Kenneth D. Merry

On Mon, Feb 25, 2002 at 21:29:44 -0800, Matthew Dillon wrote:
 
 : Just as a data point, I've been running -current on a 2xCPU SMP
 : system (DELL2550) for a few weeks and it's always booted fine.
 : 
 : For the last few months I have noticed occassional freezes occuring
 : at odd times long after boot.  I have no idea why it happens.
 :
 :Your config works!
 :
 :I've attached the dmesg.  Now the question is, what is the difference
 :between GENERIC and your config that is causing the problem?
 :
 :(I suppose it makes sense that your config would work, since I think that
 :Dell is a Serverworks-based system too.)
 :
 :I'll see if I can figure it out...
 :
 :Ken
 :-- 
 :Kenneth Merry
 
 Wow, I didn't actually expect my config would make things work
 differenty on your box.  I'm very interested in whatever you
 turn up.

The answer is...the USB code.

[ Nick and Joe CCed ]

If I comment out the following lines in my standard config for this
machine, SMP works:

# USB support
device  uhci# UHCI PCI-USB interface
device  ohci# OHCI PCI-USB interface
device  usb # USB Bus (required)
#device udbp# USB Double Bulk Pipe devices
device  ugen# Generic
device  uhid# Human Interface Devices
device  ukbd# Keyboard
device  ulpt# Printer
device  umass   # Disks/Mass storage - Requires scbus and da
device  ums # Mouse
device  urio# Diamond Rio 500 MP3 player
device  uscanner# Scanners
# USB Ethernet, requires mii
device  aue # ADMtek USB ethernet
device  cue # CATC USB ethernet
device  kue # Kawasaki LSI USB ethernet

The USB module is still loaded when usbd cranks up, but it doesn't hang the
machine when it is loaded as a module.

I've tried two kernels, with the only difference being whether the usb code
(as listed above) is enabled or not.  The kernel with the USB stuff hangs.

I've attached dmesg output from the working boot. 

Ken
-- 
Kenneth Merry
[EMAIL PROTECTED]


Copyright (c) 1992-2002 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #4: Mon Feb 25 22:23:37 MST 2002

[EMAIL PROTECTED]:/usr/home/ken/perforce/FreeBSD-ken/src/sys/i386/compile/gondolin
Preloaded elf kernel /boot/kernel.gondolin.smp/kernel at 0xc04c7000.
Preloaded elf module /boot/kernel.gondolin.smp/acpi.ko at 0xc04c70b8.
Timecounter i8254  frequency 1193182 Hz
CPU: Pentium III/Pentium III Xeon/Celeron (1266.07-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0x6b1  Stepping = 1
  
Features=0x383fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE
real memory  = 2684289024 (2621376K bytes)
avail memory = 2609504256 (2548344K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 - irq 0
Programming 16 pins in IOAPIC #1
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): apic id:  0, version: 0x00040011, at 0xfee0
 cpu1 (AP):  apic id:  1, version: 0x00040011, at 0xfee0
 io0 (APIC): apic id:  4, version: 0x000f0011, at 0xfec0
 io1 (APIC): apic id:  5, version: 0x000f0011, at 0xfec01000
Pentium Pro MTRR support enabled
Using $PIR table, 10 entries at 0xc00f52e0
ACPI-0204: *** Error: AcpiLoadTables: Could not load namespace: AE_NOT_FOUND
ACPI-0213: *** Error: AcpiLoadTables: Could not load tables: AE_NOT_FOUND
ACPI: table load failed: AE_NOT_FOUND
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: ServerWorks host to PCI bridge at pcibus 0 on motherboard
IOAPIC #1 intpin 10 - irq 2
IOAPIC #1 intpin 11 - irq 5
IOAPIC #1 intpin 15 - irq 9
pci0: PCI bus on pcib0
pcib1: PCI-PCI bridge at device 0.1 on pci0
IOAPIC #1 intpin 14 - irq 11
pci1: PCI bus on pcib1
pci1: display, VGA at device 0.0 (no driver attached)
ahc0: Adaptec aic7899 Ultra160 SCSI adapter port 0xd000-0xd0ff mem 
0xfeafc000-0xfeafcfff irq 2 at device 5.0 on pci0
aic7899: Ultra160 Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: Adaptec aic7899 Ultra160 SCSI adapter port 0xd800-0xd8ff mem 
0xfeaff000-0xfeaf irq 5 at device 5.1 on pci0
aic7899: Ultra160 Wide Channel B, SCSI Id=7, 32/255 SCBs
fxp0: Intel Pro 10/100B/100+ Ethernet port 0xd400-0xd43f mem 
0xfe90-0xfe9f,0xfeafd000-0xfeafdfff irq 9 at device 6.0 on pci0
fxp0: Ethernet address 00:30:48:21:bb:74
inphy0: i82555 10/100 media interface on miibus0
inphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
isab0: PCI-ISA bridge port 0x580-0x58f at device 15.0 on pci0
isa0: ISA bus on isab0
atapci0: ServerWorks ROSB4 ATA33 controller port 0xffa0-0xffaf at device 15.1 on pci0
ata0: at 0x1f0 irq 14 on atapci0
ata1: at 0x170 irq 15 on atapci0
pci0: serial bus, USB at device 15.2 (no driver 

Re: -current hangs with SMP enabled

2002-02-25 Thread John Baldwin


On 26-Feb-02 Kenneth D. Merry wrote:
 On Mon, Feb 25, 2002 at 21:29:44 -0800, Matthew Dillon wrote:
 
 : Just as a data point, I've been running -current on a 2xCPU SMP
 : system (DELL2550) for a few weeks and it's always booted fine.
 : 
 : For the last few months I have noticed occassional freezes occuring
 : at odd times long after boot.  I have no idea why it happens.
 :
 :Your config works!
 :
 :I've attached the dmesg.  Now the question is, what is the difference
 :between GENERIC and your config that is causing the problem?
 :
 :(I suppose it makes sense that your config would work, since I think that
 :Dell is a Serverworks-based system too.)
 :
 :I'll see if I can figure it out...
 :
 :Ken
 :-- 
 :Kenneth Merry
 
 Wow, I didn't actually expect my config would make things work
 differenty on your box.  I'm very interested in whatever you
 turn up.
 
 The answer is...the USB code.
 
 [ Nick and Joe CCed ]
 
 If I comment out the following lines in my standard config for this
 machine, SMP works:

Ah yes!  We had a test machine at TWC that came in that we couldnt' install 4.5
on (even with the UP kernel on the boot disk) due to getting an interrupt storm
of USB interrupts.
 
# USB support
 device  uhci# UHCI PCI-USB interface
 device  ohci# OHCI PCI-USB interface
 device  usb # USB Bus (required)
#device udbp# USB Double Bulk Pipe devices
 device  ugen# Generic
 device  uhid# Human Interface Devices
 device  ukbd# Keyboard
 device  ulpt# Printer
 device  umass   # Disks/Mass storage - Requires scbus and da
 device  ums # Mouse
 device  urio# Diamond Rio 500 MP3 player
 device  uscanner# Scanners
# USB Ethernet, requires mii
 device  aue # ADMtek USB ethernet
 device  cue # CATC USB ethernet
 device  kue # Kawasaki LSI USB ethernet
 
 The USB module is still loaded when usbd cranks up, but it doesn't hang the
 machine when it is loaded as a module.
 
 I've tried two kernels, with the only difference being whether the usb code
 (as listed above) is enabled or not.  The kernel with the USB stuff hangs.
 
 I've attached dmesg output from the working boot. 
 
 Ken
 -- 
 Kenneth Merry
 [EMAIL PROTECTED]

-- 

John Baldwin [EMAIL PROTECTED]http://www.FreeBSD.org/~jhb/
Power Users Use the Power to Serve!  -  http://www.FreeBSD.org/

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-25 Thread Andrew Kenneth Milton

+---[ Kenneth D. Merry ]--
|
| The answer is...the USB code.

I have also solved my problem, and my answer is SMB code..

When booting an SMP kernel with SMB enabled, trying to access /dev/smb0
returns device not configured (works fine in UP mode).

Removing the SMB (and related options) from my kernel config and I'm through
a buildworld. I'm going to do a few more to make sure it's not a fluke, but,
somehow I think that's it.

So can I take it the smbbus on my mobo is hosed?

-- 
Totally Holistic Enterprises Internet|  | Andrew Milton
The Internet (Aust) Pty Ltd  |  |
ACN: 082 081 472 ABN: 83 082 081 472 |  M:+61 416 022 411   | Carpe Daemon
PO Box 837 Indooroopilly QLD 4068|[EMAIL PROTECTED]| 

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-24 Thread Alfred Perlstein

* Kenneth D. Merry [EMAIL PROTECTED] [020224 16:56] wrote:
 
 I've got a SMP machine with a Supermicro P3TDE6 motherboard.  (Serverworks
 HE-SL chipset, dual 1.26GHz Pentium III's.)
 
 It boots just fine with a GENERIC -current kernel (sources cvsupped
 yesterday at ~1500 MST), but hangs (at the Waiting 15 seconds for SCSI
 devices to settle message) when SMP and APIC_IO are enabled.
 
 Those two options are the only things different between the broken and
 working GENERIC kernels.
 
 I've attached dmesg output from the stock GENERIC kernel.
 
 Anyone have any ideas on how to get SMP working?

I've had the same problem for a while now.  Best of luck. :)

-Alfred


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-24 Thread Andrew Kenneth Milton

+---[ Kenneth D. Merry ]--
| 
| I've got a SMP machine with a Supermicro P3TDE6 motherboard.  (Serverworks
| HE-SL chipset, dual 1.26GHz Pentium III's.)
| 
| It boots just fine with a GENERIC -current kernel (sources cvsupped
| yesterday at ~1500 MST), but hangs (at the Waiting 15 seconds for SCSI
| devices to settle message) when SMP and APIC_IO are enabled.

I have a similar problem with -current and SMP.
Heavy I/O on the SCSI system (say a buildworld or even an FSCK) causes a hang 
(no panic). I think interrupts go off and stay off.

This happens with both the onboard and UFW PCI card, both Adaptec with ACPI
on and off (it's happened since SMPng started, I figured it was one of those
things that'd get fixed eventually). Although Matt's latest patches make me
think it might be sooner rather than later, so I'll be crash testing again
next weekend methinks.

ahc0: Adaptec aic7895 Ultra SCSI adapter port 0xd400-0xd4ff mem 
0xedfee000-0xedfeefff irq 10 at device 9.0 on pci0
aic7895C: Ultra Wide Channel A, SCSI Id=7, 32/255 SCBs

ahc1: Adaptec aic7895 Ultra SCSI adapter port 0xd800-0xd8ff mem 
0xedfef000-0xedfe irq 10 at device 9.1 on pci0
aic7895C: Ultra Wide Channel B, SCSI Id=7, 32/255 SCBs

ahc2: Adaptec 2940 Ultra SCSI adapter port 0xd000-0xd0ff mem 0xedfed000-0xedfedfff 
irq 9 at device 15.0 on pci0
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/255 SCBs

I try an SMP kernel about once a month.

-- 
Totally Holistic Enterprises Internet|  | Andrew Milton
The Internet (Aust) Pty Ltd  |  |
ACN: 082 081 472 ABN: 83 082 081 472 |  M:+61 416 022 411   | Carpe Daemon
PO Box 837 Indooroopilly QLD 4068|[EMAIL PROTECTED]| 

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-24 Thread Glenn Gombert

There is a problem in -Current now between SCSI card(s) and the ATA
drivers, On my Dell 410 Workstation at work I commetned out the ATA driver
stuff (it was not used on that machine) and it booted fine, I think that
might just cure the SMP problem you are seeing too.



At 11:15 AM 2/25/2002 +1000, Andrew Kenneth Milton wrote:
+---[ Kenneth D. Merry ]--
| 
| I've got a SMP machine with a Supermicro P3TDE6 motherboard.  (Serverworks
| HE-SL chipset, dual 1.26GHz Pentium III's.)
| 
| It boots just fine with a GENERIC -current kernel (sources cvsupped
| yesterday at ~1500 MST), but hangs (at the Waiting 15 seconds for SCSI
| devices to settle message) when SMP and APIC_IO are enabled.

I have a similar problem with -current and SMP.
Heavy I/O on the SCSI system (say a buildworld or even an FSCK) causes a
hang 
(no panic). I think interrupts go off and stay off.

This happens with both the onboard and UFW PCI card, both Adaptec with ACPI
on and off (it's happened since SMPng started, I figured it was one of those
things that'd get fixed eventually). Although Matt's latest patches make me
think it might be sooner rather than later, so I'll be crash testing again
next weekend methinks.

ahc0: Adaptec aic7895 Ultra SCSI adapter port 0xd400-0xd4ff mem
0xedfee000-0xedfeefff irq 10 at device 9.0 on pci0
aic7895C: Ultra Wide Channel A, SCSI Id=7, 32/255 SCBs

ahc1: Adaptec aic7895 Ultra SCSI adapter port 0xd800-0xd8ff mem
0xedfef000-0xedfe irq 10 at device 9.1 on pci0
aic7895C: Ultra Wide Channel B, SCSI Id=7, 32/255 SCBs

ahc2: Adaptec 2940 Ultra SCSI adapter port 0xd000-0xd0ff mem
0xedfed000-0xedfedfff irq 9 at device 15.0 on pci0
aic7880: Ultra Wide Channel A, SCSI Id=7, 16/255 SCBs

I try an SMP kernel about once a month.

-- 
Totally Holistic Enterprises Internet|  | Andrew Milton
The Internet (Aust) Pty Ltd  |  |
ACN: 082 081 472 ABN: 83 082 081 472 |  M:+61 416 022 411   | Carpe Daemon
PO Box 837 Indooroopilly QLD 4068|[EMAIL PROTECTED]| 

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message

Glenn Gombert
[EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-24 Thread Kenneth D. Merry

On Sun, Feb 24, 2002 at 21:02:02 -0500, Glenn Gombert wrote:
 There is a problem in -Current now between SCSI card(s) and the ATA
 drivers, On my Dell 410 Workstation at work I commetned out the ATA driver
 stuff (it was not used on that machine) and it booted fine, I think that
 might just cure the SMP problem you are seeing too.

Thanks for the suggestion.

Unfortunately it still hangs with SMP enabled and the ATA drivers commented
out of the GENERIC config.

Ken
-- 
Kenneth Merry
[EMAIL PROTECTED]

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-24 Thread David O'Brien

On Sun, Feb 24, 2002 at 09:02:02PM -0500, Glenn Gombert wrote:
 There is a problem in -Current now between SCSI card(s) and the ATA
 drivers,

For how long has this problem existed?

This is being typed from a dual Athlon system with 5 SCSI busses
(AHC,SYM,ISP) several disks, CD burnger; and with two IDE disks + ATAPI
CDROM drive.

FreeBSD dragon.nuxi.com 5.0-CURRENT FreeBSD #238: Fri Feb 22 21:56:02 PST 2002

I would really have noticed if my p0rn IDE disk and my SCSI system disk
were not usable at the same time.

To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: -current hangs with SMP enabled

2002-02-24 Thread Matthew Dillon


:...
: stuff (it was not used on that machine) and it booted fine, I think that
: might just cure the SMP problem you are seeing too.
:
:Thanks for the suggestion.
:
:Unfortunately it still hangs with SMP enabled and the ATA drivers commented
:out of the GENERIC config.
:
:Ken
:-- 
:Kenneth Merry

Just as a data point, I've been running -current on a 2xCPU SMP
system (DELL2550) for a few weeks and it's always booted fine.

For the last few months I have noticed occassional freezes occuring
at odd times long after boot.  I have no idea why it happens.

-Matt

#
# GENERIC -- Generic kernel configuration file for FreeBSD/i386
#
# For more information on this file, please read the handbook section on
# Kernel Configuration Files:
#
#http://www.FreeBSD.org/handbook/kernelconfig-config.html
#
# The handbook is also available locally in /usr/share/doc/handbook
# if you've installed the doc distribution, otherwise always see the
# FreeBSD World Wide Web server (http://www.FreeBSD.org/) for the
# latest information.
#
# An exhaustive list of options and more detailed explanations of the
# device lines is also present in the ./LINT configuration file. If you are
# in doubt as to the purpose or necessity of a line, check first in LINT.
#
# $FreeBSD: src/sys/i386/conf/GENERIC,v 1.260 2000/06/13 22:28:46 peter Exp $

machine i386
#cpuI386_CPU
#cpuI486_CPU
#cpuI586_CPU
cpu I686_CPU
ident   DELL2550
maxusers0

hints   TEST.hints#Default places to look for devices.

options SCSI_DELAY=5000
#options MAXMEM=(64*1024)
options DDB
options INVARIANTS
options INVARIANT_SUPPORT
options SOFTUPDATES
#optionsWITNESS
#optionsSHMALL=65537
#optionsSHMMAXPGS=65537
#options GDB_REMOTE_CHAT
options ALT_BREAK_TO_DEBUGGER

# -Os broken
#
makeoptions DEBUG=-g#Build kernel with gdb(1) debug symbols
makeoptions COPTFLAGS=-O -pipe

#optionsMATH_EMULATE#Support for x87 emulation
options INET#InterNETworking
#optionsINET6   #IPv6 communications protocols
options FFS #Berkeley Fast Filesystem
options MD_ROOT #MD is a potential root device
options NFSCLIENT   #Network Filesystem
options NFSSERVER   #Network Filesystem
options NFS_ROOT#NFS usable as root device, NFS required
options MSDOSFS #MSDOS Filesystem
options CD9660  #ISO 9660 Filesystem
options PSEUDOFS#Pseudofs
options PROCFS  #Process filesystem
options COMPAT_43   #Compatible with BSD 4.3 [KEEP THIS!]
#optionsSCSI_DELAY=15000#Delay (in ms) before probing SCSI
options UCONSOLE#Allow users to grab the console
options KTRACE  #ktrace(1) support
options SYSVSHM #SYSV-style shared memory
options SYSVMSG #SYSV-style message queues
options SYSVSEM #SYSV-style semaphores
#optionsP1003_1B#Posix P1003_1B real-time extensions
#options_KPOSIX_PRIORITY_SCHEDULING
#optionsKBD_INSTALL_CDEV# install a CDEV entry in /dev

# To make an SMP kernel, the next two are needed
#
options SMP # Symmetric MultiProcessor Kernel
options APIC_IO # Symmetric (APIC) I/O
# Optionally these may need tweaked, (defaults shown):
#optionsNCPU=2  # number of CPUs
#optionsNBUS=8  # number of busses
#optionsNAPIC=1 # number of IO APICs
#optionsNINTR=24# number of INTs

device  isa
#device eisa
device  pci
#optionsCOMPAT_OLDISA   # compatability shims for lnc, fe, le
#optionsCOMPAT_OLDPCI   # compatability shims for lnc, vx

# Floppy drives
device  fdc

# ATA and ATAPI devices
device  ata
device  atadisk # ATA disk drives
device  atapicd # ATAPI CDROM drives
device  atapifd # ATAPI floppy drives
device  atapist # ATAPI tape drives
options ATA_STATIC_ID   #Static device numbering
#optionsATA_ENABLE_ATAPI_DMA#Enable DMA on ATAPI devices

# SCSI Controllers
#device ahb # EISA AHA1742 family
device  ahc # AHA2940 and onboard AIC7xxx devices
#device amd # AMD 53C974 (Teckram DC-390(T))
#device dpt # DPT Smartcache - See LINT for options!
#device isp  

Re: current hangs...

2001-01-21 Thread Poul-Henning Kamp

In message [EMAIL PROTECTED], John Baldwin writes:

On 20-Jan-01 Poul-Henning Kamp wrote:
 In message [EMAIL PROTECTED], John Baldwin writes:

On 20-Jan-01 The Hermit Hacker wrote:
 On Sat, 20 Jan 2001, Mark Murray wrote:
 
 
  on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
  now hung the machine so that only reset got any attention simply by
 make -j 128 world

 Do you have an easy way to narrow it down to CCD by doing the same
 thing but without ccd involvement?
 
 I don't have CCD, and got home last night from the office and mine was
 hung also, on a kernel from the day before ... being in X, pretty much
 nothing I could do to try and debug it ... new laptop gets in this week,
 so will be setting up the whole serial console debugging env ...

Is it SMP, and does it have multiple SCSI disks hanging off of the same
device?
 
 SMP, one scsi disk on each controller, /usr and /home ccd'ed.

Is there any code dealing with disk I/O in the kernel that does the equivalent
of this:

while (!io_done)
  /* spin */ ;

That assumes an interrupt will set io_done?

Using DELAY() in places might explain this.

Not that I know of.

--
Poul-Henning Kamp   | UNIX since Zilog Zeus 3.20
[EMAIL PROTECTED] | TCP/IP since RFC 956
FreeBSD committer   | BSD since 4.3-tahoe
Never attribute to malice what can adequately be explained by incompetence.


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread Mark Murray

 
 on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
 now hung the machine so that only reset got any attention simply by
   make -j 128 world

Do you have an easy way to narrow it down to CCD by doing the same
thing but without ccd involvement?

M
-- 
Mark Murray
Warning: this .sig is umop ap!sdn


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread Poul-Henning Kamp

In message [EMAIL PROTECTED], Mark Murray writes:
 
 on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
 now hung the machine so that only reset got any attention simply by
  make -j 128 world

Do you have an easy way to narrow it down to CCD by doing the same
thing but without ccd involvement?

No, /usr is on ccd on this one :-(

--
Poul-Henning Kamp   | UNIX since Zilog Zeus 3.20
[EMAIL PROTECTED] | TCP/IP since RFC 956
FreeBSD committer   | BSD since 4.3-tahoe
Never attribute to malice what can adequately be explained by incompetence.


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread Mike Meyer

Mark Murray [EMAIL PROTECTED] types:
  on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
  now hung the machine so that only reset got any attention simply by
  make -j 128 world
 
 Do you have an easy way to narrow it down to CCD by doing the same
 thing but without ccd involvement?

I don't think it's ccd. I get similar hangs with "make -j 20 buildworld"
on a 2xPII-Xeon/450, 256M, two scsi disks on ahc without ccd.

I think more likely it's related to the problems that sos is currently
having/investigating.

mike



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread Thomas D. Dean

I have a recent -current that hangs in 'make -j36 world'.

I am now running a UP kernel of the same date:
# uname -a
FreeBSD celebris 5.0-CURRENT FreeBSD 5.0-CURRENT #0: \
  Mon Jan 15 18:42:30 PST 2001 \
  root@celebris:/usr/src/sys/compile/CELEBRIS  i386

dmesg attached.

tomdean

 dmesg from a UP kernel ===
96 blocks, 0.3% fragmentation)
IP Filter: already initialized
de0: enabling BNC port
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #0: Mon Jan 15 18:42:30 PST 2001
root@celebris:/usr/src/sys/compile/CELEBRIS
Timecounter "i8254"  frequency 1193029 Hz
Timecounter "TSC"  frequency 131996485 Hz
CPU: Pentium/P54C (132.00-MHz 586-class CPU)
  Origin = "GenuineIntel"  Id = 0x525  Stepping = 5
  Features=0x3bfFPU,VME,DE,PSE,TSC,MSR,MCE,CX8,APIC
real memory  = 100663296 (98304K bytes)
avail memory = 94597120 (92380K bytes)
Preloaded elf kernel "kernel" at 0xc0334000.
Preloaded userconfig_script "/boot/kernel.conf" at 0xc033409c.
Intel Pentium detected, installing workaround for F00F bug
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: Host to PCI bridge at pcibus 0 on motherboard
pci0: PCI bus on pcib0
ncr0: ncr 53c810 fast10 scsi port 0xec00-0xecff mem 0xfedfbf00-0xfedfbfff irq 11 at 
device 1.0 on pci0
isab0: PCI-ISA bridge at device 2.0 on pci0
isa0: ISA bus on isab0
pci0: display, VGA at 6.0 (no driver attached)
sis0: NatSemi DP83815 10/100BaseTX port 0xe800-0xe8ff mem 0xfedfa000-0xfedfafff irq 
10 at device 7.0 on pci0
sis0: Ethernet address: 00:a0:cc:74:63:b4
miibus0: MII bus on sis0
ukphy0: Generic IEEE 802.3u media interface on miibus0
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
de0: Digital 21041 Ethernet port 0xe480-0xe4ff mem 0xfedfbe80-0xfedfbeff irq 15 at 
device 8.0 on pci0
de0: DEC DE450-CA 21041 [10Mb/s] pass 1.1
de0: address 00:00:f8:02:76:db
atkbdc0: Keyboard controller (i8042) at port 0x60,0x64 on isa0
atkbd0: AT Keyboard irq 1 on atkbdc0
psm0: PS/2 Mouse irq 12 on atkbdc0
psm0: model Generic PS/2 mouse, device ID 0
fdc0: NEC 72065B or clone at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1440-KB 3.5" drive on fdc0 drive 0
ppc0: Parallel port at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
lpt0: Printer on ppbus0
lpt0: Interrupt-driven port
sc0: System console on isa0
sc0: VGA 16 virtual consoles, flags=0x200
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on isa0
unknown: IBM Enhanced (101/102-key) KC can't assign resources
unknown: Microsoft PS/2 Mouse can't assign resources
unknown: 16550 compatible COM device can't assign resources
unknown: 16550 compatible COM device can't assign resources
unknown: LPT printer port can't assign resources
unknown: Floppy Controller can't assign resources
IP Filter: v3.4.13 initialized.  Default = pass all, Logging = enabled
Waiting 10 seconds for SCSI devices to settle
Mounting root from ufs:/dev/da1s1a
da0 at ncr0 bus 0 target 0 lun 0
da0: QUANTUM FIREBALL1080S 1Q09 Fixed Direct Access SCSI-2 device 
da0: 10.000MB/s transfers (10.000MHz, offset 8)
da0: 1042MB (2134305 512 byte sectors: 255H 63S/T 132C)
cd0 at ncr0 bus 0 target 5 lun 0
cd0: TOSHIBA CD-ROM XM-5401TA 3605 Removable CD-ROM SCSI-2 device 
cd0: 4.237MB/s transfers (4.237MHz, offset 8)
cd0: cd present [326227 x 2048 byte records]
da2 at ncr0 bus 0 target 2 lun 0
da2: QUANTUM FIREBALL ST3.2S 0F0C Fixed Direct Access SCSI-2 device 
da2: 10.000MB/s transfers (10.000MHz, offset 8), Tagged Queueing Enabled
da2: 3090MB (6328861 512 byte sectors: 255H 63S/T 393C)
da1 at ncr0 bus 0 target 1 lun 0
da1: IBM DNES-309170 SAH0 Fixed Direct Access SCSI-3 device 
da1: 10.000MB/s transfers (10.000MHz, offset 8), Tagged Queueing Enabled
da1: 8748MB (17916240 512 byte sectors: 255H 63S/T 1115C)
IP Filter: already initialized
de0: enabling BNC port
stray irq 7
stray irq 7
stray irq 7
stray irq 7
got 5 stray irq 7's: not logging anymore
pid 27080 (communicator-4.7), uid 1001: exited on signal 10 (core dumped)
microuptime() went backwards (272675.344496 - 272675.-694570216)
microuptime() went backwards (287740.708251 - 287740.-694206432)
pid 61804 (m6811-elf-gdb), uid 1001: exited on signal 11 (core dumped)
pid 61824 (m6811-elf-gdb), uid 1001: exited on signal 11 (core dumped)
pid 61834 (m6811-elf-gdb), uid 1001: exited on signal 11 (core dumped)


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread The Hermit Hacker

On Sat, 20 Jan 2001, Mark Murray wrote:

 
  on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
  now hung the machine so that only reset got any attention simply by
  make -j 128 world

 Do you have an easy way to narrow it down to CCD by doing the same
 thing but without ccd involvement?

I don't have CCD, and got home last night from the office and mine was
hung also, on a kernel from the day before ... being in X, pretty much
nothing I could do to try and debug it ... new laptop gets in this week,
so will be setting up the whole serial console debugging env ...




To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread John Baldwin


On 20-Jan-01 The Hermit Hacker wrote:
 On Sat, 20 Jan 2001, Mark Murray wrote:
 
 
  on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
  now hung the machine so that only reset got any attention simply by
 make -j 128 world

 Do you have an easy way to narrow it down to CCD by doing the same
 thing but without ccd involvement?
 
 I don't have CCD, and got home last night from the office and mine was
 hung also, on a kernel from the day before ... being in X, pretty much
 nothing I could do to try and debug it ... new laptop gets in this week,
 so will be setting up the whole serial console debugging env ...

Is it SMP, and does it have multiple SCSI disks hanging off of the same device?

-- 

John Baldwin [EMAIL PROTECTED] -- http://www.FreeBSD.org/~jhb/
PGP Key: http://www.baldwin.cx/~john/pgpkey.asc
"Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread Poul-Henning Kamp

In message [EMAIL PROTECTED], John Baldwin writes:

On 20-Jan-01 The Hermit Hacker wrote:
 On Sat, 20 Jan 2001, Mark Murray wrote:
 
 
  on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
  now hung the machine so that only reset got any attention simply by
 make -j 128 world

 Do you have an easy way to narrow it down to CCD by doing the same
 thing but without ccd involvement?
 
 I don't have CCD, and got home last night from the office and mine was
 hung also, on a kernel from the day before ... being in X, pretty much
 nothing I could do to try and debug it ... new laptop gets in this week,
 so will be setting up the whole serial console debugging env ...

Is it SMP, and does it have multiple SCSI disks hanging off of the same device?

SMP, one scsi disk on each controller, /usr and /home ccd'ed.

dmesg:

Booting [/boot/kernel/kernel]...   
stray irq 7
Copyright (c) 1992-2001 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #0: Fri Jan 19 23:21:40 CET 2001
[EMAIL PROTECTED]:/usr/src/sys/compile/SMP
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium II/Pentium II Xeon/Celeron (333.06-MHz 686-class CPU)
  Origin = "GenuineIntel"  Id = 0x651  Stepping = 1
  
Features=0x183fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR
real memory  = 268435456 (262144K bytes)
avail memory = 258048000 (252000K bytes)
Programming 24 pins in IOAPIC #0
IOAPIC #0 intpin 2 - irq 0
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  1, version: 0x00040011, at 0xfee0
 cpu1 (AP):  apic id:  0, version: 0x00040011, at 0xfee0
 io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec0
Preloaded elf kernel "kernel" at 0xc031b000.
Pentium Pro MTRR support enabled
Using $PIR table, 9 entries at 0xc00fdf20
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: Intel 82443LX (440 LX) host to PCI bridge at pcibus 0 on motherboard
pci0: PCI bus on pcib0
pcib1: PCI-PCI bridge at device 1.0 on pci0
pci1: PCI bus on pcib1
isab0: PCI-ISA bridge at device 2.0 on pci0
isa0: ISA bus on isab0
atapci0: Intel PIIX4 ATA33 controller at device 2.1 on pci0
atapci0: Busmastering DMA not supported
pci0: serial bus, USB at 2.2 (no driver attached)
Timecounter "PIIX"  frequency 3579545 Hz
pci0: bridge, PCI-unknown at 2.3 (no driver attached)
fxp0: Intel Pro 10/100B/100+ Ethernet port 0xfcc0-0xfcdf mem 
0xfed0-0xfedf,0xfecdf000-0xfecd irq 19 at device 3.0 on pci0
fxp0: Ethernet address 00:a0:c9:82:bb:30
ahc0: Adaptec aic7895 Ultra SCSI adapter port 0xf800-0xf8ff mem 
0xfecff000-0xfecf irq 17 at device 9.0 on pci0
aic7895: Wide Channel A, SCSI Id=7, 32/255 SCBs
ahc1: Adaptec aic7895 Ultra SCSI adapter port 0xf400-0xf4ff mem 
0xfecfe000-0xfecfefff irq 17 at device 9.1 on pci0
aic7895: Wide Channel B, SCSI Id=7, 32/255 SCBs
pci0: display, VGA at 13.0 (no driver attached)
atkbdc0: Keyboard controller (i8042) at port 0x60,0x64 on isa0
atkbd0: AT Keyboard flags 0x1 irq 1 on atkbdc0
kbd0 at atkbd0
fdc0: NEC 72065B or clone at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1440-KB 3.5" drive on fdc0 drive 0
ppc0: Parallel port at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
ppi0: Parallel I/O on ppbus0
plip0: PLIP network interface on ppbus0
lpt0: Printer on ppbus0
lpt0: Interrupt-driven port
sc0: System console at flags 0x100 on isa0
sc0: VGA 16 virtual consoles, flags=0x100
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A, console
sio1: configured irq 3 not in bitmap of probed irqs 0
vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on isa0
unknown: PNP0303 can't assign resources
unknown: PNP0501 can't assign resources
unknown: PNP0501 can't assign resources
unknown: PNP0700 can't assign resources
unknown: PNP0400 can't assign resources
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: routing 8254 via IOAPIC #0 intpin 2
Waiting 2 seconds for SCSI devices to settle
Mounting root from ufs:/dev/da0s1a
da0 at ahc0 bus 0 target 0 lun 0
da0: IBM DDRS-34560W S97B Fixed Direct Access SCSI-2 device 
da0: 40.000MB/s transfers (20.000MHz, offset 8, 16bit), Tagged Queueing Enabled
da0: 4357MB (8925000 512 byte sectors: 255H 63S/T 555C)
da1 at ahc1 bus 0 target 0 lun 0
da1: IBM DDRS-34560W S97B Fixed Direct Access SCSI-2 device 
da1: 20.000MB/s transfers (20.000MHz, offset 15), Tagged Queueing Enabled
da1: 4357MB (8925000 512 byte sectors: 255H 63S/T 555C)
WARNING: / was not properly dismounted
SMP: AP CPU #1 Launched!
ccd0-3: Concatenated disk drivers
swapon: adding /dev/da1s1b as swap device
swapon: adding /dev/da0s1b as swap device
Automatic boot in progress...


--
Poul-Henning Kamp   | UNIX since Zilog Zeus 3.20
[EMAIL PROTECTED] | TCP/IP since RFC 956
FreeBSD committer   | BSD 

Re: current hangs...

2001-01-20 Thread The Hermit Hacker

On Sat, 20 Jan 2001, John Baldwin wrote:


 On 20-Jan-01 The Hermit Hacker wrote:
  On Sat, 20 Jan 2001, Mark Murray wrote:
 
  
   on a 2xPII/350, 256M, two scsi disks on ahc, and ccd I have three times
   now hung the machine so that only reset got any attention simply by
  make -j 128 world
 
  Do you have an easy way to narrow it down to CCD by doing the same
  thing but without ccd involvement?
 
  I don't have CCD, and got home last night from the office and mine was
  hung also, on a kernel from the day before ... being in X, pretty much
  nothing I could do to try and debug it ... new laptop gets in this week,
  so will be setting up the whole serial console debugging env ...

 Is it SMP, and does it have multiple SCSI disks hanging off of the same device?

yes and yes ...

ahc1: Adaptec 2940 Ultra SCSI adapter port 0xcc00-0xccff mem 0xef10-0xef100fff 
irq 17 at device 13.0 on pci0
da0 at ahc1 bus 0 target 0 lun 0
da0: SEAGATE ST36530N 1281 Fixed Direct Access SCSI-2 device
da0: 20.000MB/s transfers (20.000MHz, offset 15), Tagged Queueing Enabled
da0: 6208MB (12715920 512 byte sectors: 255H 63S/T 791C)
da1 at ahc1 bus 0 target 1 lun 0
da1: WDIGTL WD91 ULTRA2 1.00 Fixed Direct Access SCSI-2 device
da1: 20.000MB/s transfers (20.000MHz, offset 15), Tagged Queueing Enabled
da1: 8727MB (17873040 512 byte sectors: 255H 63S/T 1112C)





To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs...

2001-01-20 Thread Matthew Jacob


I'll confirm a hang on both alpha  i386 (both SCSI, qlogic  ahc resp). The
i386 was SMP. The Qlogic wasn't. Both were hung with phk's make -j 256
foolery.



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2001-01-01 Thread Poul-Henning Kamp

In message [EMAIL PROTECTED], Matt Dillon writes:
:
:Why not this:
:
:s = splbio();
:TAILQ_FOREACH(bp, vp-v_dirtyblkhd, b_vnbufs) {

First rule when making simple bug fixes by copying working code from one
source file to another is:  Dont try to optimize the code on the
fly.

Personally speaking, I don't find the FOREACH macros any more readable
vs an explicit for loop.  They hide too much... like for example the
fact that you are dependant on the current pointer remaining valid to
get the next pointer in the loop.


Is that any different from

for (i = 0; i  N; i++) {
bla;
}

?

--
Poul-Henning Kamp   | UNIX since Zilog Zeus 3.20
[EMAIL PROTECTED] | TCP/IP since RFC 956
FreeBSD committer   | BSD since 4.3-tahoe
Never attribute to malice what can adequately be explained by incompetence.


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-31 Thread John Baldwin


On 30-Dec-00 Matt Dillon wrote:
  #include sys/param.h
 @@ -352,12 +352,25 @@
   return (0);
  
   /*
 +  * MARK/SCAN initialization to avoid infinite loops
 +  */
 + s = splbio();
 +for (bp = TAILQ_FIRST(vp-v_dirtyblkhd); bp;
 + bp = TAILQ_NEXT(bp, b_vnbufs)) {
 +bp-b_flags = ~B_SCANNED;
 + }
 + splx(s);

Why not this:

s = splbio();
TAILQ_FOREACH(bp, vp-v_dirtyblkhd, b_vnbufs) {
bp-b_flags = ~B_SCANNED;
}

We do have foreach queue(3) macros. :)

-- 

John Baldwin [EMAIL PROTECTED] -- http://www.FreeBSD.org/~jhb/
PGP Key: http://www.Baldwin.cx/~john/pgpkey.asc
"Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-31 Thread Matt Dillon

:
:Why not this:
:
:s = splbio();
:TAILQ_FOREACH(bp, vp-v_dirtyblkhd, b_vnbufs) {

First rule when making simple bug fixes by copying working code from one
source file to another is:  Dont try to optimize the code on the
fly.

Personally speaking, I don't find the FOREACH macros any more readable
vs an explicit for loop.  They hide too much... like for example the
fact that you are dependant on the current pointer remaining valid to
get the next pointer in the loop.

-Matt



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-31 Thread John Baldwin


On 01-Jan-01 Matt Dillon wrote:
:
:Why not this:
:
:s = splbio();
:TAILQ_FOREACH(bp, vp-v_dirtyblkhd, b_vnbufs) {
 
 First rule when making simple bug fixes by copying working code from one
 source file to another is:  Dont try to optimize the code on the
 fly.

That works.

 Personally speaking, I don't find the FOREACH macros any more readable
 vs an explicit for loop.  They hide too much... like for example the
 fact that you are dependant on the current pointer remaining valid to
 get the next pointer in the loop.

Erm, I find one succint line easier to read than something sprawled across 2-3
lines of code.  Also, I was never under the assumption that you could do a
*_FOREACH() loop while the list changed out from under you.  I doubt anyone
else is either. :)

   -Matt

-- 

John Baldwin [EMAIL PROTECTED] -- http://www.FreeBSD.org/~jhb/
PGP Key: http://www.Baldwin.cx/~john/pgpkey.asc
"Power Users Use the Power to Serve!"  -  http://www.FreeBSD.org/


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-30 Thread Matt Dillon

A bug in specfs's fsync dating back to Kirk's original softupdates work 
( which required a similar mark/scan fix to the FFS fsync ) appears to 
have been exposed by recent pageout peformance commits I made.

I've committed a mark/scan fix to specfs's fsync, which appears
to solve the lockups Poul was getting doing a 'cvs update -PdA' under
-current.  It should solve the problem for the other two people who
reported the same lockup.

I'm not sure why -stable isn't affected.  The bug is in -stable as well.
I'll MFC it in two days unless I see complaints sooner.  It's a simple
bug fix.

At some point I need to go through all the fsync implementations...
they need the same sort of placemarker fix that I threw into the
pageout daemon scan.  The current code uses the 'goto loop' hack, which
is terribly inefficient when combined with a heavily loaded 
softupdates-enabled system.

-Matt


Index: spec_vnops.c
===
RCS file: /home/ncvs/src/sys/miscfs/specfs/spec_vnops.c,v
retrieving revision 1.147
retrieving revision 1.148
diff -u -r1.147 -r1.148
--- spec_vnops.c2000/12/26 19:41:37 1.147
+++ spec_vnops.c2000/12/30 23:32:24 1.148
@@ -31,7 +31,7 @@
  * SUCH DAMAGE.
  *
  * @(#)spec_vnops.c8.14 (Berkeley) 5/21/95
- * $FreeBSD: src/sys/miscfs/specfs/spec_vnops.c,v 1.147 2000/12/26 19:41:37 dillon 
Exp $
+ * $FreeBSD: src/sys/miscfs/specfs/spec_vnops.c,v 1.148 2000/12/30 23:32:24 dillon 
+Exp $
  */
 
 #include sys/param.h
@@ -352,12 +352,25 @@
return (0);
 
/*
+* MARK/SCAN initialization to avoid infinite loops
+*/
+   s = splbio();
+for (bp = TAILQ_FIRST(vp-v_dirtyblkhd); bp;
+ bp = TAILQ_NEXT(bp, b_vnbufs)) {
+bp-b_flags = ~B_SCANNED;
+   }
+   splx(s);
+
+   /*
 * Flush all dirty buffers associated with a block device.
 */
 loop:
s = splbio();
for (bp = TAILQ_FIRST(vp-v_dirtyblkhd); bp; bp = nbp) {
nbp = TAILQ_NEXT(bp, b_vnbufs);
+   if ((bp-b_flags  B_SCANNED) != 0)
+   continue;
+   bp-b_flags |= B_SCANNED;
if (BUF_LOCK(bp, LK_EXCLUSIVE | LK_NOWAIT))
continue;
if ((bp-b_flags  B_DELWRI) == 0)


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-28 Thread Matt Dillon

:
:I'm seeing this kind of hang about twice a day on my build-box.
:
:Any clues ? insights ?

When did this start occuring?  I committed some pageout 
buffer-cache-related I/O pipelining a day or two ago to -current
(which has been well tested under -stable and reasonably well tested
under -current).  But -current has generally been very unstable for me
for months now and that makes it extremely difficult to isolate problems.

What's the filesystem configuration being hit by the compile?  Are you
running any MFS or VN paritions?  Is softupdates turned on on the
underlying filesystem?

On the face of it it looks like a bawrite() is being turned into a
bdwrite() somehow and that spec_fsync() is hitting its 'goto loop'
restart code and looping on the same buffer... which can happen if 
the bawrite() is being turned into a bdwrite().  It looks like
spec_fsync() is in an infinite loop.

-Matt

:login: [halt - sent]
:Stopped at  siointr1+0xb1:  jmp siointr1+0x1b7
:db ps
:  pid   proc addruid  ppid  pgrp  flag stat wmesg   wchan   cmd
:39121 d56c7200 d581b0000 39120 24496 004006  2  ld
:...
:  121 d2a0a420 d568c0000 1   121 84  3  select c02bbed0 syslogd
:5 d2a0aa80 d2f5c0000 0 0 000204  2  syncer
:4 d2a0aca0 d2f5a0000 0 0 100204  3  psleep c02a4924 bufdaemon
:3 d2a0aec0 d2f580000 0 0 000204  3  psleep c02b02e0 vmdaemon
:2 d2a0b0e0 d2f560000 0 0 100204  3  psleep c0288018 pagedaemon
:   21 d2a0b300 d2e2b0000 0 0 000204  6  swi0: tty:si
:o
:   20 d2a0b520 d2e290000 0 0 000204  6  irq1: atkbd0
:   19 d2a0b740 d2e270000 0 0 000204  6  irq6: fdc0
:   18 d2a0b960 d2e250000 0 0 000204  2  irq9: fxp0
:...
:   14 d2a0c1e0 d2e1c0000 0 0 000204  3  rndslp c0295008 random
:...
:   13 d2a0c400 d2e1a0000 0 0 000204  6  swi4: vm
:   12 d2a0c620 d2e180000 0 0 00020c  2  swi6: clock
:   11 d2a0c840 d2e160000 0 0 000204  6  swi1: net
:   10 d2a0ca60 d2a130000 0 0 00020c  2  idle
:1 d2a0cc80 d2a110000 0 1 004284  3wait d2a0cc80 init
:0 c02baf40 c03310000 0 0 010204  3   sched c02baf40 swapper
:db where
:No such command
:db trace
:siointr1(c104e400,cbb336b0,1,0,d2f5de78) at siointr1+0xb1
:siointr(c104e400,10,10,10,0) at siointr+0x4e
:Xfastintr4(cbb336b0,cbb336b0,cbb336b0,cbb336b0,1) at Xfastintr4+0x25
:bdwrite(cbb336b0,0,0,cbb336b0,d2f54400) at bdwrite+0xb9
:bwrite(cbb336b0,d2f5def8,c0182c2d,d2f5df10,d2f5df04) at bwrite+0x76
:vop_stdbwrite(d2f5df10,d2f5df04,c0193695,d2f5df10,d2f5df1c) at vop_stdbwrite+0xe
:vop_defaultop(d2f5df10,d2f5df1c,c017d49a,d2f5df10,c0274460) at vop_defaultop+0x15
:spec_vnoperate(d2f5df10,c0274460,d2f54400,cbb336b0,d2f5df48) at spec_vnoperate+0x15
:bawrite(cbb336b0,0,cbb336b0) at bawrite+0x32
:spec_fsync(d2f5df7c,d2f5df9c,c01857be,d2f5df7c,0) at spec_fsync+0x141
:spec_vnoperate(d2f5df7c) at spec_vnoperate+0x15
:sched_sync(0) at sched_sync+0x13e
:fork_trampoline() at fork_trampoline+0x15
:db 
:
:--
:Poul-Henning Kamp   | UNIX since Zilog Zeus 3.20
:[EMAIL PROTECTED] | TCP/IP since RFC 956
:FreeBSD committer   | BSD since 4.3-tahoe


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-28 Thread Matt Dillon


If possible, 'print *bp' from a gdb'd kernel dump if you can.

I suspect this may be related to 'bp-b_xflags  BX_BKGRDINPROG'.
If a bitmap is undergoing a background write and is then dirtied
a second time and bawrite()n, the bawrite() will be turned into a
bdwrite() (because a prior write is already in progress).

If this is the case we may have to implement the same MARK/SCAN code
for spec_fsync() that we implemented for FFS's fsync code.

FFS's fsync code got into infinite loops trying to fsync() buffers with
softupdates dependancies -- whos bawrite()'s were being turned into
bdwrite()'s due to the dependancies.

-Matt


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-28 Thread Poul-Henning Kamp

In message [EMAIL PROTECTED], Matt Dillon writes:
:
:I'm seeing this kind of hang about twice a day on my build-box.
:
:Any clues ? insights ?

When did this start occuring? 

Can't say for sure, it's certainly noticeable today, but I have not
had much FreeBSD time recently so I would be hard pressed to say
more definitively when it started.

But -current has generally been very unstable for me
for months now and that makes it extremely difficult to isolate problems.

OFF TOPIC
Yeah, I'm increasingly in favour of a "bake-out" period where no new
subsystems are SMPng'ed until -current is stabilized.  Sos tells me that
running SMP is an instant death request these days.  My SMP systems are
not even powered on right now.
/OFF TOPIC

What's the filesystem configuration being hit by the compile?  Are you
running any MFS or VN paritions?  Is softupdates turned on on the
underlying filesystem?

Two ATA disks one CCD striped, softupdates.

On the face of it it looks like a bawrite() is being turned into a
bdwrite() somehow and that spec_fsync() is hitting its 'goto loop'
restart code and looping on the same buffer... which can happen if 
the bawrite() is being turned into a bdwrite().  It looks like
spec_fsync() is in an infinite loop.

There is no other signs of trouble, processes hang when they attempt
some (but maybe not all) kind of I/O.

--
Poul-Henning Kamp   | UNIX since Zilog Zeus 3.20
[EMAIL PROTECTED] | TCP/IP since RFC 956
FreeBSD committer   | BSD since 4.3-tahoe
Never attribute to malice what can adequately be explained by incompetence.


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs...

2000-12-28 Thread Thomas D. Dean

I am seeing a couple of problems with -current.

Running gdb on a recent -current,

# uname -a
FreeBSD dsl081-020-229-sea1.dsl-isp.net 5.0-CURRENT FreeBSD 5.0-CURRENT \
  #1: Wed Dec 27 16:20:32 PST 2000  \
  root@celebris:/usr/src/sys/compile/CELEBRIS  i386

causes the console to go out into left field somewhere.

# gdb hello
(gdb) b main
(gdb) r

and all xterms stop responding, although keyboard input is echoed.
Ctl-Alt-F1 switches to console.  Keyboard input is echoed, but, not
responded to.  Clt-Alt-Esc breaks into the debugger, but, all looks
like breaking into the debugger under normal operation.

With -current SMP, I get Fatal Trap 12's during 'make world' and
transactions to lpd.

I have posted the SMP problem to -smp, and, am hiding and watching.
So far, no resolution.

tomdean


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-11-06 Thread Mark Murray

 Did I miss something on the /dev/random hang?

I don't know...

 During a `shutdown -r now`, the boot process hangs for more than an
 hour.  I thought this was supposed to work.  However, jwd's receipe
 for recovery works.  I repeated this three times, although I only
 waited an hour the last time (dinner!).
 
 I am running a recent SMP system, uname and dmesg below.

Are you _completely_ up to date with /etc/*? Dou you run mergemaster
after each make world/make kernel?

M
--
Mark Murray
Join the anti-SPAM movement: http://www.cauce.org


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-11-06 Thread Thomas D. Dean

I cannot make mergemaster work.  Tried twice earlier in the year and
took several hours to recover...  Don't know what my problem is.

However, I have a script that compares and lists diffs in /etc/rc* and
/etc/defaults/* to those in src/etc.  Normally, I manually copy those
files to /etc.

# grep '$FreeBSD' /etc/rc /etc/rc.shutdown /etc/defaults/rc.conf 
/etc/rc:# $FreeBSD: src/etc/rc,v 1.239 2000/10/22 19:10:13 phk Exp $
/etc/rc.shutdown:# $FreeBSD: src/etc/rc.shutdown,v 1.15 2000/10/20 20:26:05 ache Exp$
/etc/defaults/rc.conf:# $FreeBSD: src/etc/defaults/rc.conf,v 1.83 2000/10/29 19:59:04 
ume Exp $

These are the proper versions.

After looking through rc.shutdown, 'Writing entropy file.' is not
displayed.  In /etc/defaults/rc.conf and /usr/src/etc/defaults/rc.conf,
I have

...
entrop^%]0y_file="/entropy" # Set to NO to disable caching entropy through reboots.
...

Garble?

Fixing this fixes the problem.

Thanks Mark.

tomdean


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-11-03 Thread Thomas D. Dean

Did I miss something on the /dev/random hang?

During a `shutdown -r now`, the boot process hangs for more than an
hour.  I thought this was supposed to work.  However, jwd's receipe
for recovery works.  I repeated this three times, although I only
waited an hour the last time (dinner!).

I am running a recent SMP system, uname and dmesg below.

tomdean

= from the console =
...
Doing additional network setup: portmap.
Starting final network daemons: mountd nfsd rpc.statd nfsiod \
  NFS access cache time=2.
setting ELF ldconfig path: /usr/lib /usr/lib/compat \
   /usr/X11R6/lib /usr/local/lib

--- ^t at this point, after waiting an hour

load: 0.00  cmd: ldconfig 137 [rndblk] 0.00u 0.00s 0% 104k
^\Reboot interrupted
Nov  3 17:38:40 celebris /boot/kernel/kernel: pid 137 (ldconfig), \
 uid 0: exited on signal 3 (core dumped)
Enter full pathname of shell or RETURN for /bin/sh:
# ls -al /etc  /dev/random
# exit
Skipping disk checks ...
Doing initial network setup:.
...

 uname -a ==

FreeBSD celebris 5.0-CURRENT FreeBSD 5.0-CURRENT #1: Fri Nov 3 \
16:30:23 PST 2000 root@celebris:/usr/src/sys/compile/CELEBRIS-SMP i386


=== dmesg 
Copyright (c) 1992-2000 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.0-CURRENT #1: Fri Nov  3 16:30:23 PST 2000
root@celebris:/usr/src/sys/compile/CELEBRIS-SMP
Timecounter "i8254"  frequency 1193182 Hz
CPU: Pentium/P54C (122.68-MHz 586-class CPU)
  Origin = "GenuineIntel"  Id = 0x525  Stepping = 5
  Features=0x3bfFPU,VME,DE,PSE,TSC,MSR,MCE,CX8,APIC
real memory  = 100663296 (98304K bytes)
avail memory = 94740480 (92520K bytes)
Programming 16 pins in IOAPIC #0
IOAPIC #0 intpin 2 - irq 0
FreeBSD/SMP: Multiprocessor motherboard
 cpu0 (BSP): apic id:  0, version: 0x00030010, at 0xfee0
 cpu1 (AP):  apic id:  1, version: 0x00030010, at 0xfee0
 io0 (APIC): apic id:  2, version: 0x000f0011, at 0xfec0
Preloaded elf kernel "kernel" at 0xc0314000.
Preloaded userconfig_script "/boot/kernel.conf" at 0xc031409c.
Intel Pentium detected, installing workaround for F00F bug
npx0: math processor on motherboard
npx0: INT 16 interface
pcib0: Host to PCI bridge at pcibus 0 on motherboard
pci0: PCI bus on pcib0
ncr0: ncr 53c810 fast10 scsi port 0xec00-0xecff mem 0xfedfbf00-0xfedfbfff irq 11 at 
device 1.0 on pci0
isab0: Intel 82378IB PCI to ISA bridge at device 2.0 on pci0
isa0: ISA bus on isab0
pci0: Matrox MGA Millennium 2064W graphics accelerator at 6.0 irq 9
de0: Digital 21041 Ethernet port 0xe880-0xe8ff mem 0xfedfbe80-0xfedfbeff irq 10 at 
device 8.0 on pci0
de0: DEC DE450-CA 21041 [10Mb/s] pass 1.1
de0: address 00:00:f8:02:76:db
atkbdc0: Keyboard controller (i8042) at port 0x60,0x64 on isa0
atkbd0: AT Keyboard irq 1 on atkbdc0
psm0: PS/2 Mouse irq 12 on atkbdc0
psm0: model Generic PS/2 mouse, device ID 0
fdc0: NEC 72065B or clone at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1440-KB 3.5" drive on fdc0 drive 0
ppc0: Parallel port at port 0x378-0x37f irq 7 on isa0
ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode
lpt0: Printer on ppbus0
lpt0: Interrupt-driven port
sc0: System console on isa0
sc0: VGA 16 virtual consoles, flags=0x200
sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0
sio0: type 16550A
sio1 at port 0x2f8-0x2ff irq 3 on isa0
sio1: type 16550A
vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on isa0
unknown: IBM Enhanced (101/102-key) KC can't assign resources
unknown: Microsoft PS/2 Mouse can't assign resources
unknown: 16550 compatible COM device can't assign resources
unknown: 16550 compatible COM device can't assign resources
unknown: LPT printer port can't assign resources
unknown: Floppy Controller can't assign resources
APIC_IO: Testing 8254 interrupt delivery
APIC_IO: routing 8254 via IOAPIC #0 intpin 2
Waiting 10 seconds for SCSI devices to settle
Mounting root from ufs:/dev/da1s1a
da0 at ncr0 bus 0 target 0 lun 0
da0: QUANTUM FIREBALL1080S 1Q09 Fixed Direct Access SCSI-2 device 
da0: 10.000MB/s transfers (10.000MHz, offset 8)
da0: 1042MB (2134305 512 byte sectors: 255H 63S/T 132C)
da2 at ncr0 bus 0 target 2 lun 0
da2: QUANTUM EMPIRE_1080S 1240 Fixed Direct Access SCSI-2 device 
da2: 10.000MB/s transfers (10.000MHz, offset 8), Tagged Queueing Enabled
da2: 1029MB (2109376 512 byte sectors: 255H 63S/T 131C)
cd0 at ncr0 bus 0 target 5 lun 0
cd0: TOSHIBA CD-ROM XM-5401TA 3605 Removable CD-ROM SCSI-2 device 
cd0: 4.237MB/s transfers (4.237MHz, offset 8)
cd0: Attempt to query device size failed: NOT READY, Medium not present
da1 at ncr0 bus 0 target 1 lun 0
da1: IBM DNES-309170 SAH0 Fixed Direct Access SCSI-3 device 
da1: 10.000MB/s transfers (10.000MHz, offset 8), Tagged Queueing Enabled
da1: 8748MB (17916240 512 byte sectors: 255H 63S/T 1115C)
SMP: AP CPU #1 Launched!
de0: enabling BNC port
pid 137 (ldconfig), uid 0: exited on signal 

Re: current hangs when boot

2000-10-25 Thread Johan Kruger

In message [EMAIL PROTECTED] Bigbear writes:
: i update my system from 4.1 to current, when system boot, it hangs when:
: start elf ldconfig: /usr/lib /usr/lib/compat /usr/X11R6/lib
: why?


This is not a hang, otherwise you would not have been able to boot by
pressing ^C or ^T
The files , especially rc files in /etc are not the newest ones. The
script rc is suppose to use ldconfig to update the library database,
but when it starts it waits. If you bypass this by pressing keys, the
it might happen that you get errors with certain libraries, seeing that
they are not loaded.

So, all you need to do is to take /usr/src/etc and copy rc* /etc
This solved the problem for me. I did a make buildworld and installworld
yesterday and i got the problem on a second reboot.
I had a look in the /etc directory and saw that the files did not install ...
which makes sense since the spec of buildworld/installworld is that the etc dir
is not updated.

I started a make release 25 minutes ago, will see if the end result of a
snapshot will give the same problem.

P.S. After updating your files in /etc, do a shutdown -r now , the shutdown rc
script will take care of the entropy factor next time around.


 : ^C also works.
 : 
 : ^T is generally useful if you suspect something is hanging on bootup but
 : don't know what it is.
 
 We also found at bsdcon that lots of keystrokes would also make the
 system boot.
 
 Which reminds me of the Sun software problem report that had as the
 engineering reply: "Don't hit on the keyboard like a wild monkey" .. :-)
 
--
Unix Software Developer/Engineer
E-Mail: Johan Kruger [EMAIL PROTECTED]
Date: 25-Oct-00
Time: 11:02:51
OS: FreeBSD 5.0-2724-SNAP
All good things come to those who ... run FreeBSD
--


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



re: -current hangs during boot (UPDATING entry)

2000-10-25 Thread Mark Hittinger


It does look like an updating entry is needed for this badly.

I did the following things, some of which may not be needed, and now my 
-current boxes boot OK.

1.  update MAKEDEV from /usr/src/etc, run MAKEDEV all

2.  update /etc/rc /etc/rc.* /etc/defaults/rc.conf from /usr/src/etc

3.  add random_load="YES" to /boot/loader.conf

4.  update /etc/login.conf from /usr/src/etc

5.  do a "shutdown -r now" which creates the entropy file and reboots

Now the system boot does not hang at the ldconfig spots.  It does look like
there is a lot to do to get back on course, enough to justify adding to
UPDATING.

Later

Mark Hittinger
Earthlink
[EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-25 Thread John W. De Boskey

Thanks for the updates. A few questions below.

-John

- Mark Hittinger's Original Message -
 
 It does look like an updating entry is needed for this badly.
 
 I did the following things, some of which may not be needed, and now my 
 -current boxes boot OK.
 
 1.  update MAKEDEV from /usr/src/etc, run MAKEDEV all
 
 2.  update /etc/rc /etc/rc.* /etc/defaults/rc.conf from /usr/src/etc
 
 3.  add random_load="YES" to /boot/loader.conf

or add the random device to the kernel config file.

 4.  update /etc/login.conf from /usr/src/etc

I don't remember having to do this... was there a
specific reason?

 5.  do a "shutdown -r now" which creates the entropy file and reboots
 
 Now the system boot does not hang at the ldconfig spots.  It does look like
 there is a lot to do to get back on course, enough to justify adding to
 UPDATING.
 
 Later
 
 Mark Hittinger
 Earthlink
 [EMAIL PROTECTED]
 


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-24 Thread void

On Mon, Oct 23, 2000 at 06:30:29PM -0400, John W. De Boskey wrote:
 
 5. At this time, remove ALL MFS filesystems from /etc/fstab.
They can be hand mounted after bootup or via a local rc
startup script.

You can leave them in, with option "noauto", and mount them later with
"mount -a -t mfs".

-- 
 Ben

220 go.ahead.make.my.day ESMTP Postfix


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-24 Thread Warner Losh

In message [EMAIL PROTECTED] Leif Neland 
writes:
:   Hi there,
:  
:   I've done a very recent week's make world(S) on -current, making
:   and installing world and kernel go fine, but all hang on boot,
:   with no error codes or msgs.
:  
: Solution: Due to changes in the random/entropy stuff, you have to reboot
: the machine with "shutdown -r now" or Ctrl-alt-del, not "reboot".
: 
: (Why do we have a separate reboot command? Couldn't it just be an alias
: for shutdown -r now?)

reboot should be a seprate command.  It should, however, cause the
shutdown scripts to run unless it is run -f.  shutdown implies sending 
all messages to all users of NFS filesystems, which isn't desirable in 
many cases.  At least that's why my fingers type reboot to this day:
On sunos 4.0 (really Solbourne OS/MP 4.0D) in a large company these
messages would scare more people than they informed...

Warner


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs when boot

2000-10-24 Thread Warner Losh

In message [EMAIL PROTECTED] "Matthew N. Dodd" 
writes:
: On Mon, 23 Oct 2000, Brian O'Shea wrote:
:  I am also having this problem.  If you interrupt it (with ^\ to send
:  SIGQUIT), ldconfig generates a core.  Then ldconfig will hang while
:  setting a.out ldconfig path:
: 
: ^C also works.
: 
: ^T is generally useful if you suspect something is hanging on bootup but
: don't know what it is.

We also found at bsdcon that lots of keystrokes would also make the
system boot.

Warner


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-24 Thread David O'Brien

On Mon, Oct 23, 2000 at 06:30:29PM -0400, John W. De Boskey wrote:
 2. Make sure your kernel includes:
 
 devicerandom  # Entropy device

Are you implying the random.ko module is broken?
 


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-24 Thread John W. De Boskey

It didn't work without the device line when I tested it
last week(Thursday/Friday).

-John

- David O'Brien's Original Message -
 On Mon, Oct 23, 2000 at 06:30:29PM -0400, John W. De Boskey wrote:
  2. Make sure your kernel includes:
  
  device  random  # Entropy device
 
 Are you implying the random.ko module is broken?
  


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



BSDcon Was: Re: current hangs when boot

2000-10-24 Thread Andy Farkas


On Tue, 24 Oct 2000, Warner Losh wrote:

 We also found at bsdcon that lots of keystrokes would also make the
 system boot.
 
 Warner

For those of us that couldn't go, has anybody posted pictures from the con
yet?  ...any kind of pictures - not just ones of keyboard bashing...

--
 
 :{ [EMAIL PROTECTED]
  
Andy Farkas
System Administrator
   Speednet Communications
 http://www.speednet.com.au/
  




To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-24 Thread Steve Kargl

You need to put

random_load="YES"

into /boot/loader.conf.

Optionally, you can load random.ko during the boot process.

John W. De Boskey wrote:
 It didn't work without the device line when I tested it
 last week(Thursday/Friday).
 
 - David O'Brien's Original Message -
  On Mon, Oct 23, 2000 at 06:30:29PM -0400, John W. De Boskey wrote:
   2. Make sure your kernel includes:
   
   devicerandom  # Entropy device
  
  Are you implying the random.ko module is broken?
   


-- 
Steve


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs when boot

2000-10-24 Thread Wilko Bulte

On Tue, Oct 24, 2000 at 12:38:41PM -0600, Warner Losh wrote:
 In message [EMAIL PROTECTED] "Matthew N. 
Dodd" writes:
 : On Mon, 23 Oct 2000, Brian O'Shea wrote:
 :  I am also having this problem.  If you interrupt it (with ^\ to send
 :  SIGQUIT), ldconfig generates a core.  Then ldconfig will hang while
 :  setting a.out ldconfig path:
 : 
 : ^C also works.
 : 
 : ^T is generally useful if you suspect something is hanging on bootup but
 : don't know what it is.
 
 We also found at bsdcon that lots of keystrokes would also make the
 system boot.

Which reminds me of the Sun software problem report that had as the
engineering reply: "Don't hit on the keyboard like a wild monkey" .. :-)

-- 
Wilko Bulte Arnhem, the Netherlands
[EMAIL PROTECTED]   http://www.freebsd.org  http://www.nlfug.nl



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-24 Thread Doug Barton

On Mon, 23 Oct 2000, John W. De Boskey wrote:

 I'm beginning to think we need an updating entry.
 
 1. Make sure /dev/random exists 'cd /dev  sh MAKEDEV std'

Unless you are using devfs.

 2. Make sure your kernel includes:
 
 devicerandom  # Entropy device

Or the appropriate line in /boot/mumble.conf

 3. Make sure /etc/rc is at rev 1.237 or higher.
 
 4. Make sure /etc/rc.shutdown is at rev 1.13 or 1.15
 
 5. At this time, remove ALL MFS filesystems from /etc/fstab.
They can be hand mounted after bootup or via a local rc
startup script.

A better solution would be to increase the granularity of the file
system mounts so that the MFS systems were mounted after the entropy
gathering. There are a couple of reasons I changed the ordering on the two
reseeding attempts, one of the big ones was that for the "manual" reseed
(i.e., no entropy file) we want more parts of the system mounted so that
we have more potential entropy gathering tools. 

Rest sounds good...

Doug
-- 
"The dead cannot be seduced."
- Kai, "Lexx"

Do YOU Yahoo!?




To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs when boot

2000-10-23 Thread Brian O'Shea

(Yikes, my message turned out to be a bit long, sorry)
I did a little poking around.

I'm running -current as of last Saturday:

# uname -a
FreeBSD panic.localdomain 5.0-CURRENT FreeBSD 5.0-CURRENT #0: Sat Oct 
21 22:20:11 PDT 2000 [EMAIL PROTECTED]:/usr/obj/usr/local/cvs
up/current/src/sys/PANIC  i386


On Mon, Oct 23, 2000 at 12:27:25AM +, Bigbear wrote:
 i update my system from 4.1 to current, when system boot, it hangs when:
 start elf ldconfig: /usr/lib /usr/lib/compat /usr/X11R6/lib
 why?

I am also having this problem.  If you interrupt it (with ^\ to send
SIGQUIT), ldconfig generates a core.  Then ldconfig will hang while
setting a.out ldconfig path:

^Csetting a.out ldconfig path: /usr/lib/aout /usr/lib/compat/aout

This can be interrupted too, and then it hangs while starting sshd.
Interrupting sshd allows the boot to procede.

I got a core from each program during the hang, and here's what I
found:

Here's the backtrace from the core obtained from ldconfig (rebuilt
with -g) the first time around:
(starting elf ldconfig)

(gdb) bt
#0  0x8054340 in read ()
#1  0x804c966 in mktemp ()
#2  0x804ca33 in arc4random_stir ()
#3  0x804cad9 in arc4random ()
#4  0x804c791 in mktemp ()
#5  0x804c692 in mkstemp ()
#6  0x804886a in write_elf_hints ()
#7  0x8048818 in update_elf_hints ()
#8  0x8048c61 in main ()
#9  0x8048139 in _start ()


And the second time around:
(setting a.out ldconfig path)

(gdb) bt
#0  0x8054340 in read ()
#1  0x804c966 in mktemp ()
#2  0x804ca33 in arc4random_stir ()
#3  0x804cad9 in arc4random ()
#4  0x804c791 in mktemp ()
#5  0x804c692 in mkstemp ()
#6  0x8049590 in buildhints ()
#7  0x8048e39 in main ()
#8  0x8048139 in _start ()


And from sshd:

(gdb) bt
#0  0x28208784 in read () from /usr/lib/libc.so.4
#1  0x282081ce in __sread () from /usr/lib/libc.so.4
#2  0x281f67a6 in __srefill () from /usr/lib/libc.so.4
#3  0x281f23bd in fread () from /usr/lib/libc.so.4
#4  0x281217c1 in RAND_SSLeay () from /usr/lib/libcrypto.so.1
#5  0x28121869 in RAND_SSLeay () from /usr/lib/libcrypto.so.1
#6  0x281212cc in RAND_bytes () from /usr/lib/libcrypto.so.1
#7  0x28146099 in DSA_OpenSSL () from /usr/lib/libcrypto.so.1
#8  0x28146151 in BN_rand () from /usr/lib/libcrypto.so.1
#9  0x280e4561 in BN_is_prime_fasttest () from /usr/lib/libcrypto.so.1
#10 0x280e3e03 in BN_generate_prime () from /usr/lib/libcrypto.so.1
#11 0x280da4a8 in RSA_generate_key () from /usr/lib/libcrypto.so.1
#12 0x8059437 in getsockname ()
#13 0x804c35b in getsockname ()
#14 0x804b76d in getsockname ()


Running ldconfig manually, 'top' shows ldconfig sleeping on 'rndblk':

  PID USERNAME PRI NICE  SIZERES STATETIME   WCPUCPU COMMAND
...
  228 root  46   0   216K   104K rndblk   0:00  0.00%  0.00% ldconfig


More investigation:

# fstat /dev/urandom
USER CMD  PID   FD MOUNT  INUM MODE SZ|DV R/W NAME
root ldconfig 2283 /  7973 crw-r--r--  urandom  r  /dev/urandom

# ps auxw | grep ldconfig
root 228  0.0  0.4   216  104  d0  I 2:18AM   0:00.00 ldconfig -elf /usr/lib


This commit from Peter Wemm on Oct 18 might shed some light:
 
 peter   2000/10/18 03:39:18 PDT 
 
   Modified files:   
 sys/dev/random   randomdev.c
   Log:  
   Attempt to fix the random read blocking.  The old code slept at   
   priority "0" and without PCATCH, so it was uninterruptable.  And  
   even when it did wake up after entropy arrived, it exited after the   
   wakeup without actually reading the freshly arrived entropy.  I   
   sent this to Mark before but it seems he is in transit.   
   Mark: feel free to replace this if it gets in your way.   
 
   Revision  ChangesPath 
   1.16  +14 -15src/sys/dev/random/randomdev.c   


Maybe this is a related problem (except now random read blocking is
interruptable?)

-- 
Brian O'Shea
[EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-23 Thread John W. De Boskey

I'm beginning to think we need an updating entry.

1. Make sure /dev/random exists 'cd /dev  sh MAKEDEV std'

2. Make sure your kernel includes:

device  random  # Entropy device

3. Make sure /etc/rc is at rev 1.237 or higher.

4. Make sure /etc/rc.shutdown is at rev 1.13 or 1.15

5. At this time, remove ALL MFS filesystems from /etc/fstab.
   They can be hand mounted after bootup or via a local rc
   startup script.
   
6. Reboot your system via 'init 6' or 'shutdown -r now' (or similar).
   Do not use 'reboot'.

7. If at any time during the boot process a 'cntrl-t' shows a
   process hung in 'rndblk', hit 'cntrl-\', 'ls -al /etc  /dev/random',
   and then 'exit'. This should get your machine to boot for this
   session until you figure out which of the above are not done.

-John

- Brian O'Shea's Original Message -
 On Sun, Oct 22, 2000 at 07:35:39PM -0400, John W. De Boskey wrote:
  - David O'Brien's Original Message -
   On Sun, Oct 22, 2000 at 04:14:08PM +0800, Donny Lee wrote:
 with the mfs enable, it hangs there right after FILESYSTEM
 checking, and with random_load set to YES, it hangs at ldconfig.
   
   mount_mfs is blocking on "rndblk".  Our /dev/*random is fubar'ed *again*.
  
 I saw this the other day...
  
 I've been examining the mfs code, and I'm not really sure why
  it needs to use random values...
 
 When ldconfig hangs, it is in a mkstemp() call.  The mkstemp()
 function probably uses random numbers to generate unique file names:
 
 #0  0x8054340 in read ()
 #1  0x804c966 in mktemp ()
 #2  0x804ca33 in arc4random_stir ()
 #3  0x804cad9 in arc4random ()
 #4  0x804c791 in mktemp ()
 #5  0x804c692 in mkstemp ()
 #6  0x804886a in write_elf_hints ()
 #7  0x8048818 in update_elf_hints ()
 #8  0x8048c61 in main ()
 #9  0x8048139 in _start ()
 
 
 -brian
 
 -- 
 Brian O'Shea
 [EMAIL PROTECTED]
 
 
 To Unsubscribe: send mail to [EMAIL PROTECTED]
 with "unsubscribe freebsd-current" in the body of the message


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs when boot

2000-10-23 Thread Matthew N. Dodd

On Mon, 23 Oct 2000, Brian O'Shea wrote:
 I am also having this problem.  If you interrupt it (with ^\ to send
 SIGQUIT), ldconfig generates a core.  Then ldconfig will hang while
 setting a.out ldconfig path:

^C also works.

^T is generally useful if you suspect something is hanging on bootup but
don't know what it is.

-- 
| Matthew N. Dodd  | '78 Datsun 280Z | '75 Volvo 164E | FreeBSD/NetBSD  |
| [EMAIL PROTECTED] |   2 x '84 Volvo 245DL| ix86,sparc,pmax |
| http://www.jurai.net/~winter | This Space For Rent  | ISO8802.5 4ever |



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-23 Thread Makoto MATSUSHITA


jwd 5. At this time, remove ALL MFS filesystems from /etc/fstab.
jwdThey can be hand mounted after bootup or via a local rc
jwdstartup script.

Is there any chance to mount MFS filesystem listed in /etc/fstab just
after the /dev/random reseeding is done ? I cannot put up with that we
cannot put 'mfs' line to /etc/fstab forever. Moreover, /tmp (common
MFS candidate) is already used by X server after rebooting; you may
not want to mount /tmp at hand later.

Or, it's only for upgrading procedure, and we can put 'mfs' lines back?

-- -
Makoto `MAR' MATSUSHITA


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot (UPDATING entry)

2000-10-23 Thread John W. De Boskey

- Makoto MATSUSHITA's Original Message -
 
 jwd 5. At this time, remove ALL MFS filesystems from /etc/fstab.
 jwdThey can be hand mounted after bootup or via a local rc
 jwdstartup script.
 
 Is there any chance to mount MFS filesystem listed in /etc/fstab just
 after the /dev/random reseeding is done ? I cannot put up with that we
 cannot put 'mfs' line to /etc/fstab forever. Moreover, /tmp (common
 MFS candidate) is already used by X server after rebooting; you may
 not want to mount /tmp at hand later.
 
 Or, it's only for upgrading procedure, and we can put 'mfs' lines back?

   #5 above represents a bug in the current code. It needs to be
fixed, but I don't know what the 'correct' thing to do is just
yet.

   Does the FSIRAND code in mfs require crypto strength randomness?

-John


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Leif Neland



On Sun, 22 Oct 2000, Donny Lee wrote:

 
  Hi there,
 
  I've done a very recent week's make world(S) on -current, making
  and installing world and kernel go fine, but all hang on boot,
  with no error codes or msgs.
 
  When boot, all go fine at the beginning, and stop right after
  showing the msg:
 
  :
  :
  /dev/da0s1e: FILESYSTEM CLEAN, SKIPPING CHECK
  /dev/da0s1e: clean, 92565 free (125 frags, 11555 blocks,
 0.1%fragmentation) 
  
  then no more, I got to power off and on, and reboot using an 
  old working kernel.
 
metoo
My hangs at loading ldconf. I can continue with ^C.
It then hangs at loading aout-ldconf. I can continue with ^C
It hangs a 3. time, where I again can continue with ^C.
It then complains for missing .so's, naturally because the ldconf was
interrupted, and some commands won't work because of that, but the system
is at least somewhat working...
/metoo

I'll see if I can narrow it down somewhat.

Leif



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Donny Lee

   
 John W. De Boskey noticed me some points to check out,
 I now can boot my fbsd box with the recent built kernel.

 I disabled mfs, and random_load, these 2 made my fbsd box
 hangs (not really hangs, it looks like waiting for some 
 jobs done) 2 times at boot. 

 with the mfs enable, it hangs there right after FILESYSTEM
 checking, and with random_load set to YES, it hangs at ldconfig.

--
 // Donny

Leif Neland wrote:
   When boot, all go fine at the beginning, and stop right after
   showing the msg:
   :
   /dev/da0s1e: FILESYSTEM CLEAN, SKIPPING CHECK
   /dev/da0s1e: clean, 92565 free (125 frags, 11555 blocks,
  0.1%fragmentation)
 metoo
 My hangs at loading ldconf. I can continue with ^C.
 It then hangs at loading aout-ldconf. I can continue with ^C
 It hangs a 3. time, where I again can continue with ^C.
 It then complains for missing .so's, naturally because the ldconf was
 interrupted, and some commands won't work because of that, but the system
 is at least somewhat working...
 /metoo
 
 I'll see if I can narrow it down somewhat.


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Makoto MATSUSHITA


leifn My hangs at loading ldconf. I can continue with ^C.

Are you using MFS ? I suppose you are not.

I've found that:

- if /etc/fstab has 'mfs' line, system stops during mounting
  filesystem. Interrupting with CTRL-C works (and system goes to
  single user mode).

- if /etc/fstab does not have 'mfs' line, system stops during ldconfig.
  Also interruptable. System is working (you can login from console),
  but something goes wrong (network interface doesn't work).

Your case is the latter one, and previous email you have mentioned is
the former one.

I can reproduce both cases with a -current box cvsuped two days before.

-- -
Makoto `MAR' MATSUSHITA


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Donny Lee

Leif Neland wrote:
   I've done a very recent week's make world(S) on -current, making
   and installing world and kernel go fine, but all hang on boot,
   with no error codes or msgs.
 Solution: Due to changes in the random/entropy stuff, you have to reboot
 the machine with "shutdown -r now" or Ctrl-alt-del, not "reboot".

  not quite understand this... 

  do you mean one should use "shutdown -r now" or Ctrl-Alt-Del
  instead of "reboot" right after make world and kernel?

 (Why do we have a separate reboot command? Couldn't it just be an alias
 for shutdown -r now?)

  I don't know, it's there. :)

  try man reboot, it gives me lots.  no it's not alias for any other
  commands.

--
 // Donny


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread John W. De Boskey

- Leif Neland's Original Message -
 
 
 On Sun, 22 Oct 2000, Donny Lee wrote:
 
  
   Hi there,
  
   I've done a very recent week's make world(S) on -current, making
   and installing world and kernel go fine, but all hang on boot,
   with no error codes or msgs.
  
 Solution: Due to changes in the random/entropy stuff, you have to reboot
 the machine with "shutdown -r now" or Ctrl-alt-del, not "reboot".
 
 (Why do we have a separate reboot command? Couldn't it just be an alias
 for shutdown -r now?)

   See pr 14829 and:

http://www.FreeBSD.org/cgi/cvsweb.cgi/src/etc/rc

   revision 1.235 for a few additional comments.

-John

 Leif


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Leif Neland


- Original Message -
From: "Donny Lee" [EMAIL PROTECTED]
To: "Leif Neland" [EMAIL PROTECTED]
Cc: [EMAIL PROTECTED]
Sent: Sunday, October 22, 2000 10:59 AM
Subject: Re: -current hangs during boot


 Leif Neland wrote:
I've done a very recent week's make world(S) on -current, making
and installing world and kernel go fine, but all hang on boot,
with no error codes or msgs.
  Solution: Due to changes in the random/entropy stuff, you have to reboot
  the machine with "shutdown -r now" or Ctrl-alt-del, not "reboot".

   not quite understand this...

   do you mean one should use "shutdown -r now" or Ctrl-Alt-Del
   instead of "reboot" right after make world and kernel?

Exactly. Except not just after make world/kernel, every time you want to
reboot/halt.

Leif




To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Donny Lee

Leif Neland wrote:
do you mean one should use "shutdown -r now" or Ctrl-Alt-Del
instead of "reboot" right after make world and kernel?
 Exactly. Except not just after make world/kernel, every time you 
 want to reboot/halt.

  Does this make any different? 

  shutdown is fine, but ctrl-alt-del is not so good,
  whenever I did this, the next boot takes a long time on
  fixing filesystems.

--
 // Donny


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Leif Neland



 Leif Neland wrote:
 do you mean one should use "shutdown -r now" or Ctrl-Alt-Del
 instead of "reboot" right after make world and kernel?
  Exactly. Except not just after make world/kernel, every time you
  want to reboot/halt.

   Does this make any different?

   shutdown is fine, but ctrl-alt-del is not so good,
   whenever I did this, the next boot takes a long time on
   fixing filesystems.

You mean your system doesn't unmount the disks at ctrl-alt-del? Does it a
hard reset instead? Strange...

Leif





To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread Donny Lee

Leif Neland wrote:
Does this make any different?
shutdown is fine, but ctrl-alt-del is not so good,
whenever I did this, the next boot takes a long time on
fixing filesystems.
 You mean your system doesn't unmount the disks at ctrl-alt-del? 
 Does it a hard reset instead? Strange...

  i might remember wrong, i haven't done so for a long time.  
   
  Ok, i'm wrong... i'm just back from ctrl-alt-del.   :) 

--
 // Donny


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread David O'Brien

On Sun, Oct 22, 2000 at 04:14:08PM +0800, Donny Lee wrote:
  with the mfs enable, it hangs there right after FILESYSTEM
  checking, and with random_load set to YES, it hangs at ldconfig.

mount_mfs is blocking on "rndblk".  Our /dev/*random is fubar'ed *again*.
 
-- 
-- David  ([EMAIL PROTECTED])
  GNU is Not Unix / Linux Is Not UniX


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread John W. De Boskey

- David O'Brien's Original Message -
 On Sun, Oct 22, 2000 at 04:14:08PM +0800, Donny Lee wrote:
   with the mfs enable, it hangs there right after FILESYSTEM
   checking, and with random_load set to YES, it hangs at ldconfig.
 
 mount_mfs is blocking on "rndblk".  Our /dev/*random is fubar'ed *again*.

   I saw this the other day...

   I've been examining the mfs code, and I'm not really sure why
it needs to use random values...

   Anyone know the history on the '#ifdef FSIRAND' code?

-john

ps: break out of the boot sequence, 'ls -al /etc/  /dev/random'
and then 'exit' will get around this and other problems, but
requires hands on reboots.

  
 -- 
 -- David  ([EMAIL PROTECTED])
   GNU is Not Unix / Linux Is Not UniX


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: -current hangs during boot

2000-10-22 Thread sthaug

Anyone know the history on the '#ifdef FSIRAND' code?

Randomized file generation numbers to make NFS attacks more difficult,
as far as I can remember.

Steinar Haug, Nethelp consulting, [EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs in make world (softupdates on ccd on ata)

2000-01-10 Thread Marcel Moolenaar

Poul-Henning Kamp wrote:
 
 Make world hangs reliably for me right now:
[snip]
 |yacc -d -o c-parse.c c-parse.y
 +
 
 0 19921 1  83  -2  0  1276  932 getblk D p00:00.09 yacc -d -o 
c-parse.c c-parse.y
 
 This is a softupdates filesystem on a ccd on ata...

The format used for mkstemp has been changed (4 chars. longer). Does the
following patch work?

Index: main.c
===
RCS file: /home/ncvs/src/usr.bin/yacc/main.c,v
retrieving revision 1.11
diff -u -r1.11 main.c
--- main.c  2000/01/10 08:54:09 1.11
+++ main.c  2000/01/10 17:47:33
@@ -304,7 +304,7 @@
 if (tmpdir == 0) tmpdir = "/tmp";
 
 len = strlen(tmpdir);
-i = len + 13;
+i = len + 17;
 if (len  tmpdir[len-1] != '/')
++i;
 
-- 
Marcel Moolenaarmailto:[EMAIL PROTECTED]
SCC Internetworking  Databases   http://www.scc.nl/
The FreeBSD projectmailto:[EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs in make world (softupdates on ccd on ata)

2000-01-10 Thread Matthew Dillon


:Make world hangs reliably for me right now:
:+
:|...
:|/* starting time is 16:18:8 */
:|/* ending time is 16:18:8 */
:|ln -sf gxx-hash.h hash.h
:|echo '#include "cp/cp-tree.def"' gencheck.h
:|echo '#include "objc/objc-tree.def"' gencheck.h
:|sed -e "/^ifobjc$/,/^end ifobjc$/d"  -e "/^ifc$/d" -e "/^end ifc$/d"  
:/syv/src/gnu/usr.bin/cc/cc_tools/../../../../contrib/gcc/c-parse.in  c-parse.y
:|yacc -d -o c-parse.c c-parse.y
:+
:
:0 19921 1  83  -2  0  1276  932 getblk D p00:00.09 yacc -d -o 
:c-parse.c c-parse.y
:
:This is a softupdates filesystem on a ccd on ata...
:
:--
:Poul-Henning Kamp FreeBSD coreteam member
:[EMAIL PROTECTED]   "Real hackers run -current on their laptop."
:FreeBSD -- It will take a long time before progress goes too far!

No vinum?

Please ps axl and pick out any other processes stuck in odd places.

Then gdb -k kernel.debug /dev/mem   (assuming you have kernel.debug
sitting around somewhere).

proc 19921
back
proc ...other processes...
back ...

note: if you have a custom .gdbinit that sets the base to hex you may
have to type the process number in hex.

-Matt
Matthew Dillon 
[EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: current hangs in make world (softupdates on ccd on ata)

2000-01-10 Thread Kris Kennaway

On Mon, 10 Jan 2000, Marcel Moolenaar wrote:

 The format used for mkstemp has been changed (4 chars. longer). Does the
 following patch work?

Sorry, but FWIW I did test the change by recompiling cvs (which uses yacc)
before I committed it :( I guess it was just the gods of stack which
allowed that one to build.

Kris



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs when dump is run?

1999-12-11 Thread Christian Gusenbauer

Hello!

Since Nov 12th, I've been having this problem, too. Is there a
solution/workaround available yet? I'm using a SCSI DAT drive and making
backups using dump or tar works, but amanda still manages to hang the machine!

Many thanks,
Christian.


AFAIR Taavi Talvik wrote on Sa , 20 Nov 1999:
 On Fri, 19 Nov 1999, Poul-Henning Kamp wrote:
 
  If you're using the ata driver, please remove the ad_sleep() call
  at the bottom of the adopen() in ata-disk.c and see if that
  helps you.
 
 Yes, i am using ata driver, but your suggestion didn't help. Still
 same story. Crash dumps are available..
 
  Current seems to hangs, when amanda tries to run dump.
  This appeared approximately 2 weeks ago, and is present in
  yesterdays current also.
 
 #0  0xc013daa4 in boot ()
 #1  0xc013de41 in panic ()
 #2  0xc011d739 in db_panic ()
 #3  0xc011d6d9 in db_command ()
 #4  0xc011d79e in db_command_loop ()
 #5  0xc011f82f in db_trap ()
 #6  0xc02011ef in kdb_trap ()
 #7  0xc020f434 in trap ()
 #8  0xc020144b in Debugger ()
 #9  0xc01fcb06 in scgetc ()
 #10 0xc01f8271 in sckbdevent ()
 #11 0xc01f1a8f in atkbd_intr ()
 #12 0xc021cd74 in atkbd_isa_intr ()
 #13 0xc0217c66 in splx ()
 #14 0xc0140a55 in tsleep ()
 #15 0xc0146889 in diskopen ()
 #16 0xc0170251 in spec_open ()
 #17 0xc017014d in spec_vnoperate ()
 #18 0xc01bb579 in ufs_vnoperatespec ()
 #19 0xc016a57c in vn_open ()
 #20 0xc01669dd in open ()
 #21 0xc020fd02 in syscall ()
 #22 0xc0201af6 in Xint0x80_syscall ()
 #23 0x804b91a in ?? ()
 #24 0x804b6b3 in ?? ()
 
 [109:0] taavi@tt:/var/crash #ps -raxwu -M vmcore.5 -N kernel.5
 USER PID %CPU %MEM   VSZ  RSS  TT  STAT STARTED  TIME COMMAND
 amanda  1365 99.0  0.0   6280  ??  R 1Jan70   0:00.00  (dump)
 root   1  0.0  0.0   5160  ??  ILs   1Jan70   0:00.02 /sbin/init
 --
 root   2  0.0  0.0 00  ??  RL1Jan70   0:00.03
 (pagedaemon)
 root   3  0.0  0.0 00  ??  DL1Jan70   0:00.00  (vmdaemon)
 root   4  0.0  0.0 00  ??  RL1Jan70   0:00.03  (bufdaemon)
 root   5  0.0  0.0 00  ??  RL1Jan70   0:00.88  (syncer)
 
 
 ---
 Taavi Talvik| Internet: [EMAIL PROTECTED] 
 Unineti Andmeside AS| phone: +372 6405150
 Ravala pst. 10  | fax: +372 6405151
 Tallinn 10143, Estonia  |
 
 
 
 To Unsubscribe: send mail to [EMAIL PROTECTED]
 with "unsubscribe freebsd-current" in the body of the message


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs when dump is run?

1999-11-20 Thread Taavi Talvik

On Fri, 19 Nov 1999, Poul-Henning Kamp wrote:

 If you're using the ata driver, please remove the ad_sleep() call
 at the bottom of the adopen() in ata-disk.c and see if that
 helps you.

Yes, i am using ata driver, but your suggestion didn't help. Still
same story. Crash dumps are available..

 Current seems to hangs, when amanda tries to run dump.
 This appeared approximately 2 weeks ago, and is present in
 yesterdays current also.

#0  0xc013daa4 in boot ()
#1  0xc013de41 in panic ()
#2  0xc011d739 in db_panic ()
#3  0xc011d6d9 in db_command ()
#4  0xc011d79e in db_command_loop ()
#5  0xc011f82f in db_trap ()
#6  0xc02011ef in kdb_trap ()
#7  0xc020f434 in trap ()
#8  0xc020144b in Debugger ()
#9  0xc01fcb06 in scgetc ()
#10 0xc01f8271 in sckbdevent ()
#11 0xc01f1a8f in atkbd_intr ()
#12 0xc021cd74 in atkbd_isa_intr ()
#13 0xc0217c66 in splx ()
#14 0xc0140a55 in tsleep ()
#15 0xc0146889 in diskopen ()
#16 0xc0170251 in spec_open ()
#17 0xc017014d in spec_vnoperate ()
#18 0xc01bb579 in ufs_vnoperatespec ()
#19 0xc016a57c in vn_open ()
#20 0xc01669dd in open ()
#21 0xc020fd02 in syscall ()
#22 0xc0201af6 in Xint0x80_syscall ()
#23 0x804b91a in ?? ()
#24 0x804b6b3 in ?? ()

[109:0] taavi@tt:/var/crash #ps -raxwu -M vmcore.5 -N kernel.5
USER PID %CPU %MEM   VSZ  RSS  TT  STAT STARTED  TIME COMMAND
amanda  1365 99.0  0.0   6280  ??  R 1Jan70   0:00.00  (dump)
root   1  0.0  0.0   5160  ??  ILs   1Jan70   0:00.02 /sbin/init
--
root   2  0.0  0.0 00  ??  RL1Jan70   0:00.03
(pagedaemon)
root   3  0.0  0.0 00  ??  DL1Jan70   0:00.00  (vmdaemon)
root   4  0.0  0.0 00  ??  RL1Jan70   0:00.03  (bufdaemon)
root   5  0.0  0.0 00  ??  RL1Jan70   0:00.88  (syncer)


---
Taavi Talvik| Internet: [EMAIL PROTECTED] 
Unineti Andmeside AS| phone: +372 6405150
Ravala pst. 10  | fax: +372 6405151
Tallinn 10143, Estonia  |



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs when dump is run?

1999-01-17 Thread Christian Weisgerber

Taavi Talvik [EMAIL PROTECTED] wrote:

 Current seems to hangs, when amanda tries to run dump.
 This appeared approximately 2 weeks ago, and is present in
 yesterdays current also.

Not for me. I made the world yesterday, and this morning's backup
(with dump) ran fine.

-- 
Christian "naddy" Weisgerber  [EMAIL PROTECTED]



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Current hangs when dump is run?

1999-01-16 Thread Poul-Henning Kamp


If you're using the ata driver, please remove the ad_sleep() call
at the bottom of the adopen() in ata-disk.c and see if that
helps you.

Poul-Henning

In message [EMAIL PROTECTED], Taavi Talvi
k writes:

Current seems to hangs, when amanda tries to run dump.
This appeared approximately 2 weeks ago, and is present in
yesterdays current also.

#0  0xc013daa4 in boot ()
(kgdb) bt
#0  0xc013daa4 in boot ()
#1  0xc013de41 in panic ()
#2  0xc011d739 in db_panic ()
#3  0xc011d6d9 in db_command ()
#4  0xc011d79e in db_command_loop ()
#5  0xc011f82f in db_trap ()
#6  0xc02011fb in kdb_trap ()
#7  0xc020f444 in trap ()
#8  0xc0201457 in Debugger ()
#9  0xc01fcb12 in scgetc ()
#10 0xc01f827d in sckbdevent ()
#11 0xc01f1a9b in atkbd_intr ()
#12 0xc021cd84 in atkbd_isa_intr ()
#13 0xc0202fe2 in vec1 ()
#14 0xc0140a55 in tsleep ()
#15 0xc0146889 in diskopen ()
#16 0xc0170251 in spec_open ()
#17 0xc017014d in spec_vnoperate ()
#18 0xc01bb579 in ufs_vnoperatespec ()
#19 0xc016a57c in vn_open ()
#20 0xc01669dd in open ()
#21 0xc020fd12 in syscall ()
#22 0xc0201b06 in Xint0x80_syscall ()
#23 0x804b91a in ?? ()
#24 0x804b6b3 in ?? ()
#25 0x80491c0 in ?? ()

and ps shows
[127:0] taavi@tt:/var/crash #ps -raxwu -M vmcore.4 -N kernel.4
USER PID %CPU %MEM   VSZ  RSS  TT  STAT STARTED  TIME COMMAND
amanda   380 99.0  0.0   6280  ??  R 1Jan70   0:00.00  (dump)
root   1  0.0  0.0   5160  ??  ILs   1Jan70   0:00.02 /sbin/init
--
root   2  0.0  0.0 00  ??  RL1Jan70   0:00.01
(pagedaemon)
root   3  0.0  0.0 00  ??  DL1Jan70   0:00.00  (vmdaemon)
root   4  0.0  0.0 00  ??  RL1Jan70   0:00.01  (bufdaemon)
root   5  0.0  0.0 00  ??  RL1Jan70   0:00.28  (syncer)
.

I have kernel and core saved, if someone needs them.

best regards,
taavi
---
Taavi Talvik| Internet: [EMAIL PROTECTED] 
Unineti Andmeside AS| phone: +372 6405150
Ravala pst. 10  | fax: +372 6405151
Tallinn 10143, Estonia  |



To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message


--
Poul-Henning Kamp FreeBSD coreteam member
[EMAIL PROTECTED]   "Real hackers run -current on their laptop."
FreeBSD -- It will take a long time before progress goes too far!


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message