Re: Wierd su problem still there

1999-09-26 Thread Benjamin Lewis


[EMAIL PROTECTED] said:
 adams@nightfall(18:06:51)$ su
 Password:load: 0.37  cmd: su 381 [ttyin] 0.00u 0.00s 0% 668k
 load: 0.37  cmd: su 381 [ttyin] 0.00u 0.00s 0% 668k
 load: 0.37  cmd: su 381 [ttyin] 0.00u 0.00s 0% 668k
 
 each "load" line is an enter press.

[snip]

 As soon as I get this the app becomes totaly useless, I can't control-z or
 control-c it, I have to kill the term.
 
 VNC (on the password prompt) and other random apps do this too, but you
 get the idea, I just rebuild/installed world from an cvsup from about 30
 minutes ago, this has been going on for over a week (I cvsup every day).
 
 Any ideas?

I too have encountered the same symptoms.  I have no idea why this started 
happening, but I have found a couple ways around it.  First, two Control-Ds 
will act as a Return if you find the status lines appearing.  Second, the
stty command will fix it totally.  I use `stty status ^T' to remap the status 
command to Control-T rather than Control-J (the ^T is a carat, T):

  akira:/home/bhlewis [123]su
  Password:load: 0.03  cmd: su 24921 [ttyin] 0.00u 0.00s 0% 624k
  [2 ^D's here]
  akira:/home/bhlewis [101]#exit
  exit
  akira:/home/bhlewis [124]stty
  speed 38400 baud;
  lflags: echoe echok echoke echoctl pendin
  iflags: -ixany ignpar
  oflags: -oxtabs
  cflags: cs8 -parenb -hupcl
  status
  ^J  
  akira:/home/bhlewis [125]stty status ^T
  akira:/home/bhlewis [126]stty
  speed 38400 baud;
  lflags: echoe echok echoke echoctl pendin
  iflags: -ixany ignpar
  oflags: -oxtabs
  cflags: cs8 -parenb -hupcl
  akira:/home/bhlewis [127]su
  Password:
  akira:/home/bhlewis [101]#

I hope this helps,

-Ben
-- 
Benjamin Lewis
[EMAIL PROTECTED] -or-   [EMAIL PROTECTED]




To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: cvs commit: src/sbin/dump dump.8 main.c

2002-12-04 Thread Benjamin Lewis
On Tue, 2002-12-03 at 13:21, Kirk McKusick wrote:
 mckusick2002/12/03 10:21:09 PST
 
   Modified files:
 sbin/dumpdump.8 main.c 
   Log:
   Add the `L' option [...]

Aha! I've been waiting for that ever since it was hinted at in the
original snapshots readme.

I do wonder, however, if there is any way to use the feature without
running it as root? I normally run dump as operator (via amanda); when
I hacked up amdump to use L, the dumps failed, like so:

?/-- akira.woss /usr/local lev 1 STRANGE
sendbackup: start [akira.wossname.net:/usr/local level 1]
sendbackup: info BACKUP=/sbin/dump
sendbackup: info RECOVER_CMD=/sbin/restore -f... -
sendbackup: info end
| dump: Cannot create /usr/local/.dump_snapshot: Operation not permitted
| 
??error [/sbin/dump returned 1]? dumper: strange [missing size line from
sendbackup]
? dumper: strange [missing end line from sendbackup]
\

I've played with sysctl vfs.usermount and various permissions for the
mount points but just haven't gotten it to work for operator.  Does
anyone have any hints? Is it even possible to create a snapshot without
being the super-user?

I think I can work around this by configuring amanda to run as root but
I did want to point out that this might be an unexpected problem if the
silent snapshot semantics are ever adopted (as foreshadowed by the
commit message).  I can't be the only one running dump as operator, 
can I?

-Ben

-- 
Benjamin Lewis [EMAIL PROTECTED]


To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Re: Building -STABLE on -CURRENT

2001-01-22 Thread Benjamin Lewis


Robert,

You wrote:

 For the last few days (not sure when it started) I've been unable to build
 -STABLE on a -CURRENT machine.  This has proven a problem for recent
 RELENG_3 MFC's of security fixes; I've tried upgrading to the most recent
 -CURRENT on the box, making sure /usr/include is updated, et al.  I'm
 guessing this is /usr/include pollution in the /usr/src build, but won't
 speculate too much more as I'm travelling tomorrow.  Attached below is the
 breakage from buildworld.

[...]
 
 cd /usr/src/share/syscons/scrnmaps; make build-tools
 cc -static -O -pipe   -I/usr/src/share/syscons/scrnmaps
 -DFIL=\"koi8-r2cp866\"  
  -o koi8-r2cp866.mk /usr/src/share/syscons/scrnmaps/mkscrfil.c
 In file included from /usr/src/share/syscons/scrnmaps/mkscrfil.c:28:
 /usr/include/machine/console.h:3: #error "this file includes
 machine/console.h
  which is deprecated, use sys/{kb,cons,fb}io.h instead"

I've seen the same thing.  I hacked up the -stable source to include the
correct -current include files to get past this, but that strikes me as
extremely non-optimal.  I tried hacking up the Makefile in 
/usr/src/share/syscons/scrnmaps to use a different include path, but don't
have the know-how or mojo to get that working.

Have you had any replies to this message or further insights? If so, I'd 
very much appreciate hearing them.

Thank you,

-Ben










To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-current" in the body of the message



Re: Is anyone else having trouble with dump(8) on -current?

2002-08-11 Thread Benjamin Lewis

On Fri, 2002-08-09 at 18:07, Ian Dowse wrote:
 
 [replying to an old message]
 
 In message [EMAIL PROTECTED], Alexander Leidi
 nger writes:
 On  7 Mai, Benjamin Lewis wrote:
 |   DUMP: slave couldn't reopen disk: Interrupted system call
 
 Try the attached patch.

[...]

 I was just looking at PR bin/18319 when I remembered this message.
 Many of the changes in your patch are not necessary I believe, as
 read(2) will restart after a signal by default. How about just
 fixing the open call that actually triggers the reported error? I
 suspect that many of the other cases are either impossible or
 extremely unlikely in practice. Could someone who can reproduce the
 couldn't reopen disk error try the following?

[...]

My apologies for not keeping up on this.  Since buying a house, time
for computer-related hobbies has evaporated! Nearly three months later,
some less essential parts of the old home network still remain in boxes.

In any case, I have been using Alex Leidinger's first patch since he
sent it to me.  At various times I have tried an unpatched dump and
the results have varied from fairly good (only 1 out of 6 filesystems
failing each night) to miserable (thousands of errors while reading
the disk) depending on the general state of -current.  As of today's
build, things were behaving on the fairly good side.

I have rebuilt dump with Ian Dowse's patch and things look good so far.
Sometimes it takes several full backup runs by Amanda before a problem
surfaces, so I will report back later in the week.  I wish I knew why
I seem to be the only one seeing this on -current!

-Ben




To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message



Compaq built-in ncr tl controllers with 4.0

1999-02-17 Thread Benjamin Lewis
Hello-

I've been trying to get 3.1 or 4.0 to run on a Compaq Professional Workstation
6000 with dual PII-300s, using the built-in symbios 53c875 SCSI controller, and
the built-in ThunderLan ethernet adapter.  The machine works perfectly with
2.2.8, but we'd like to get it running 3.1 (or 4.0 if necessary) to take 
advantage of the second CPU.

We have been able to complete a make upgrade with freshly cvsupped 3.1 source,
to the point where the new kernel boots.  While booting, the GENERIC kernel
does not find the ncr or tl devices, and of course fails to mount root.  I'd
include dmesg from those boots, but it doesn't get far enough to write it
anywhere.  The new bootblocks seem to be working fine, and are able to boot
the kernel, but it fails with a cannot mount root message and then panics.

We then tried to boot with the 3.1 and the 4.0 boot floppies.  Neither was
able to find the SCSI controller or the ethernet device.  Of course, we find
it odd that 2.2.8 found the devices ok, but newer releases do not.  As far
as I can tell, the hardware is supported by CAM, etc. (I have a Tekram 390F
53c875-based card in another 4.0 machine that works great).  The installs
failed with the complaint that no disks could be found to install on.

A search through the mailing list archives yielded little information that
still seemed relevant (apparently, the tl0 driver wasn't around in 2.2.6 or
earlier but that obviously changed before 2.2.8).  

I've included the 2.2.8 dmesg output below.  I'm hoping that someone out there
will see something in them that I cannot and will provide us with the magic
incantation needed to get this thing running 3.1 or 4.0.  Our suspicions are
on the PCI bridge, since both the unfound devices are on pci bus 1, while
the detected devices reside on bus 0, but we don't know what to do about that.

By the way, the unidentified storage device that doesn't get a driver assigned
on pci1:10 is a Jaz Jet card, apparently with an Advansys chipset, that 2.2.8
doesn't grok, but 3.1+ should find ok.  It doesn't get detected by 3.1 or 4.0
kernels either.

Thank you in advance,

-Ben

Copyright (c) 1992-1998 FreeBSD Inc.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California.  All rights reserved.

FreeBSD 2.2.8-STABLE #0: Wed Feb 17 09:02:09 GMT 1999
bhle...@server2.mediumlook.com:/usr/src/sys/compile/CALVIN
CPU: Pentium II (299.53-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0x633  Stepping=3
  Features=0x80fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMO
V,MMX
real memory  = 536870912 (524288K bytes)
avail memory = 524316672 (512028K bytes)
Probing for devices on PCI bus 0:
chip0 Ross (?) host to PCI bridge rev 1 on pci0:0:0
pci0:9:Compaq, device=0xa0f8, class=serial, subclass=0x03 int a irq 11 [no d
river assigned]
vga0 VGA-compatible display device rev 1 int a irq 11 on pci0:13:0
chip1 generic PCI bridge (vendor=0e11 device=a0f3 subclass=1) rev 12 on pci0:1
5:0
pci0:15:1: Compaq, device=0xae33, class=storage (ide) int a irq 15 [no driver as
signed]
chip2 Ross (?) host to PCI bridge rev 1 on pci0:17:0
Probing for devices on PCI bus 1:
tl0 Compaq Netelligent 10/100 Proliant rev 16 int a irq 11 on pci1:7:0
tl0: Ethernet address: 00:80:5f:85:23:8b
tl0: autoneg complete, link status good (half-duplex, 10Mbps)
ncr0 ncr 53c875 fast20 wide scsi rev 4 int a irq 11 on pci1:9:0
ncr0 waiting for scsi devices to settle
(ncr0:0:0): WIDE SCSI (16 bit) enabled(ncr0:0:0): 10.0 MB/s (200 ns, offset 15)
(ncr0:0:0): COMPAQ WDE4360W 1.52 type 0 fixed SCSI 2
sd0(ncr0:0:0): Direct-Access 
sd0(ncr0:0:0): WIDE SCSI (16 bit) enabled
sd0(ncr0:0:0): 40.0 MB/s (50 ns, offset 15)
4094MB (8386000 512 byte sectors)
pci1:10:vendor=0x10cd, device=0x1300, class=storage (scsi) int a irq 11 [no 
driver assigned]
Probing for devices on the ISA bus:
sc0 at 0x60-0x6f irq 1 on motherboard
sc0: VGA color 16 virtual consoles, flags=0x0
sio0 at 0x3f8-0x3ff irq 4 on isa
sio0: type 16550A
sio1 at 0x2f8-0x2ff irq 3 on isa
sio1: type 16550A
lpt0 at 0x378-0x37f irq 7 on isa
lpt0: Interrupt-driven port
lp0: TCP/IP capable interface
lpt1 not found at 0x
mse0 not found at 0x23c
psm0 at 0x60-0x64 irq 12 on motherboard
psm0: model Generic PS/2 mouse, device ID 0
fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1.44MB 3.5in
wdc0 at 0x1f0-0x1f7 irq 14 on isa
wdc0: unit 0 (atapi): CD-ROM CDU571-Q/1.1a, removable, accel, dma, iordis
wcd0: 1378KB/sec, 128KB cache, audio play, 256 volume levels, ejectable tray
wcd0: door open, unlocked
wdc1 not found at 0x170
uha0 not found at 0x330
aha0 not found at 0x330
aic0 not found at 0x340
nca0 not found at 0x1f88
nca1 not found at 0x350
sea0 not found
wt0 not found at 0x300
mcd0 not found at 0x300
matcdc0 not found at 0x230
scd0 not found at 0x230
npx0 flags 0x1 on motherboard
npx0: INT 16 interface

-- 
Benjamin Lewis
bhle...@gte.net -or-   bhle...@purdue.edu




To Unsubscribe: send mail

Re: Compaq built-in ncr tl controllers with 4.0

1999-02-21 Thread Benjamin Lewis
Mr. Rabson-

Sorry it's been so long for me to get back to you about the patch you sent.  
The machine is located accross country  3 time zones away, so coordinating 
with the people at the console has been tedious.

In any case, the patch worked brilliantly.  The machine is now running a 4.0
generic kernel and is in the process of building and installing the SMP kernel.
I would suggest that the patches make their way into the 4.0 and 3.1 source 
trees, although I have yet to test them on a machine that was not affected by 
the PCI bus probe problem.  I'll incorporate the patch into a more 
conventional 4.0 machine later today and let you know the results.  

We want to thank you so much for your quick response.  The owners of the 
machine are quite gratified that they will be able to take advantage of the 
second processor after all.

The dmesg from the machine appears below, in case you find it interesting.

-Ben


Copyright (c) 1992-1999 FreeBSD Inc.
Copyright (c) 1982, 1986, 1989, 1991, 1993
The Regents of the University of California. All rights reserved.
FreeBSD 4.0-CURRENT #0: Sat Feb 20 16:51:12 PST 1999
bhle...@server.mediumlook.com:/usr/src/sys/compile/GENERIC
Timecounter i8254  frequency 1193182 Hz
CPU: Pentium II (299.53-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0x633  Stepping=3
  Features=0x80fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMO
V,MMX
real memory  = 16777216 (16384K bytes)
avail memory = 13316096 (13004K bytes)
Preloaded elf kernel kernel at 0xf0342000.
Probing for devices on PCI bus 0:
chip0: Ross (?) host to PCI bridge rev 0x01 on pci0.0.0
vga0: S3 Trio graphics accelerator rev 0x43 int a irq 11 on pci0.13.0
chip1: PCI to ISA bridge (vendor=0e11 device=a0f3) rev 0x0c on pci0.15.0
ide_pci0: PCI IDE controller (busmaster capable) rev 0x0a int a irq 15 on pci0
.15.1
chip2: Ross (?) host to PCI bridge rev 0x01 on pci0.17.0
Probing for devices on PCI bus 1:
tl0: Compaq Netelligent 10/100 Proliant rev 0x10 int a irq 11 on pci1.7.0
tl0: Ethernet address: 00:80:5f:85:97:2e
tl0: autoneg complete, link status good (half-duplex, 10Mbps)
ncr0: ncr 53c875 fast20 wide scsi rev 0x04 int a irq 11 on pci1.9.0
adv0: AdvanSys ASC3150 Ultra SCSI controller rev 0x02 int a irq 11 on pci1.10.
0
adv0: AdvanSys Ultra SCSI Host Adapter, SCSI ID 0, queue depth 16
Probing for devices on PCI bus 2:
Probing for PnP devices:
CSN 1 Vendor ID: ESS1868 [0x68187316] Serial 0x Comp ID: @@@ [0x
]
Probing for devices on the ISA bus:
sc0 on isa
sc0: VGA color 16 virtual consoles, flags=0x0
ed0 not found at 0x280
fe0 not found at 0x300
atkbdc0 at 0x60-0x6f on motherboard
atkbd0 irq 1 on isa
psm0 irq 12 on isa
psm0: model Generic PS/2 mouse, device ID 0
sio0 at 0x3f8-0x3ff irq 4 flags 0x10 on isa
sio0: type 16550A
sio1 at 0x2f8-0x2ff irq 3 on isa
sio1: type 16550A
ppc0 at 0x378 irq 7 on isa
ppc0: Generic chipset (ECP/PS2/NIBBLE) in COMPATIBLE mode
ppc0: FIFO with 16/16/8 bytes threshold
plip0: PLIP network interface on ppbus 0
lpt0: generic printer on ppbus 0
lpt0: Interrupt-driven port
ppi0: generic parallel i/o on ppbus 0
fdc0 at 0x3f0-0x3f7 irq 6 drq 2 on isa
fdc0: FIFO enabled, 8 bytes threshold
fd0: 1.44MB 3.5in
wdc0 at 0x1f0-0x1f7 irq 14 on isa
wdc0: unit 0 (atapi): CD-ROM CDU571-Q/1.1a, removable, accel, dma, iordis
acd0: drive speed 1378KB/sec, 128KB cache
acd0: supported read types: CD-DA
acd0: Audio: play, 256 volume levels
acd0: Mechanism: ejectable tray
acd0: Medium: no/blank disc inside, unlocked
wdc1 not found at 0x170
wt0 not found at 0x300
mcd0 not found at 0x300
matcdc0 not found at 0x230
scd0 not found at 0x230
ie0: unknown board_id: f000
ie0 not found at 0x300
ep0 not found at 0x300
ex0 not found
le0 not found at 0x300
lnc0 not found at 0x280
ze0 not found at 0x300
zp0 not found at 0x300
cs0 not found at 0x300
adv0 not found at 0x330
bt0 not found at 0x134
aha0 not found at 0x134
vga0 at 0x3b0-0x3df maddr 0xa msize 131072 on isa
npx0 on motherboard
npx0: INT 16 interface
Waiting 15 seconds for SCSI devices to settle
changing root device to da0s1a
da1 at adv0 bus 0 target 4 lun 0
da1: iomega jaz 1GB J.83 Removable Direct Access SCSI-2 device 
da1: 10.000MB/s transfers (10.000MHz, offset 15)
da1: Attempt to query device size failed: NOT READY, Medium not present
da0 at ncr0 bus 0 target 0 lun 0
da0: COMPAQ WDE4360W 1.52 Fixed Direct Access SCSI-2 device 
da0: 40.000MB/s transfers (20.000MHz, offset 15, 16bit), Tagged Queueing Enabled
da0: 4094MB (8386000 512 byte sectors: 255H 63S/T 522C)


-- 
Benjamin Lewis
bhle...@gte.net -or-   bhle...@purdue.edu




To Unsubscribe: send mail to majord...@freebsd.org
with unsubscribe freebsd-current in the body of the message



Recent -current hangs on Tyan S2460 before finishing boot

2003-11-12 Thread Benjamin Lewis
Hello,

I'm having trouble getting recent (post- device apic, pre- turnstile) 
kernels to boot on my Tyan S2460 (Tiger MP) system with dual AMD 
Athlons.  What happens is that the machine seems to get stuck soon
after the Waiting for SCSI devices to settle message is printed -- it
appears to be willing to wait forever rather than the SCSI_DELAY time.

Disabling ACPI in the BIOS has no apparent effect on the hang.  Using
SCHED_4BSD or SCHED_ULE likewise makes no difference.  I've been 
following the current@ list hoping to see someone else report a problem
similar to mine but haven't seen anything yet.

I do have a serial console attached to the machine and DDB enabled so
I'm able to provide some information and get more if needed.  I'm
including a copy of the boot messages from my last attempt to boot
FreeBSD 5.1-CURRENT #2: Tue Nov 11 17:35:40 EST 2003 which was 
cvsup'ed shortly prior to the build date.  Included in the messages are
the output of ps and trace once I broke into ddb.

I'm also including output from acpidump -t and mptable -verbose 
since I've seen that information requested in the past.

Some details about the system that may be pertinent:
1. It has two 1Ghz Athlon Thunderbird (Not MP) processors.
   That hasn't been a problem so far.
2. The BIOS is version 1.04 (latest is 1.05).  The last time I
   tried updating to 1.05 (some time ago) I saw lots of error
   messagess complaining about undefined ACPI stuff so I
   reverted.
3. There is a Tekram 390F (I think that's the model -- it uses
   the sym driver) and an Adaptec 3944 SCSI controller.  A
   single internal SCSI drive is connected to the Tekram and 10
   external drives are connected to the two ports on the 3944.
   The external drives are configured as a Vinum Raid10 array.
   There's also a single IDE drive connected to one of the
   built-in IDE controllers.
Please let me know if there is anything more you want to know.

Thanks,

-Ben

Type '?' for a list of commands, 'help' for more detailed help.
OK boot -sv
-\|/-\|SMAP type=01 base= len=0009f400
SMAP type=02 base=0009f400 len=0c00
SMAP type=02 base=000e4800 len=0001b800
SMAP type=01 base=0010 len=0fef
SMAP type=03 base=0fff len=fc00
SMAP type=04 base=0c00 len=0400
SMAP type=02 base=fec0 len=0001
SMAP type=02 base=fee0 len=1000
SMAP type=02 base=fff8 len=0008
Copyright (c) 1992-2003 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.1-CURRENT #2: Tue Nov 11 17:35:40 EST 2003
[EMAIL PROTECTED]:/export/obj/usr/src-all/current/src/sys/AKIRA.ULE
Preloaded elf kernel /boot/kernel/kernel at 0xc089d000.
Preloaded elf module /boot/kernel/linux.ko at 0xc089d250.
Preloaded elf module /boot/kernel/snd_pcm.ko at 0xc089d2fc.
Preloaded elf module /boot/kernel/snd_es137x.ko at 0xc089d3a8.
Preloaded elf module /boot/kernel/usb.ko at 0xc089d458.
Preloaded elf module /boot/kernel/ums.ko at 0xc089d500.
Preloaded elf module /boot/kernel/agp.ko at 0xc089d5a8.
Preloaded elf module /boot/kernel/random.ko at 0xc089d650.
Calibrating clock(s) ... i8254 clock: 1192972 Hz
CLK_USE_I8254_CALIBRATION not specified - using default frequency
Timecounter i8254 frequency 1193182 Hz quality 0
Calibrating TSC clock ... TSC clock: 141373 Hz
CPU: AMD Athlon(tm) Processor (1000.04-MHz 686-class CPU)
  Origin = AuthenticAMD  Id = 0x642  Stepping = 2
  
Features=0x183fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR
  AMD Features=0xc044RSVD,AMIE,DSP,3DNow!
Data TLB: 24 entries, fully associative
Instruction TLB: 16 entries, fully associative
L1 data cache: 64 kbytes, 64 bytes/line, 1 lines/tag, 2-way associative
L1 instruction cache: 64 kbytes, 64 bytes/line, 1 lines/tag, 2-way associative
L2 internal cache: 256 kbytes, 64 bytes/line, 1 lines/tag, 8-way associative
real memory  = 268369920 (255 MB)
Physical memory chunk(s):
0x1000 - 0x0009efff, 647168 bytes (158 pages)
0x0010 - 0x003f, 3145728 bytes (768 pages)
0x00c29000 - 0x0fb3dfff, 250695680 bytes (61205 pages)
avail memory = 251088896 (239 MB)
ACPI APIC Table: PTLTDAPIC  
APIC ID: physical 0, logical 0:0
APIC ID: physical 1, logical 0:1
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): APIC ID:  1
 cpu1 (AP): APIC ID:  0
APIC: CPU 0 has ACPI ID 0
APIC: CPU 1 has ACPI ID 1
bios32: Found BIOS32 Service Directory header at 0xc00f7480
bios32: Entry = 0xfd6c0 (c00fd6c0)  Rev = 0  Len = 1
pcibios: PCI BIOS entry at 0xfd6c0+0x120
pnpbios: Found PnP BIOS data at 0xc00f74d0
pnpbios: Entry = 

Re: Recent -current hangs on Tyan S2460 before finishing boot

2003-11-13 Thread Benjamin Lewis
On November 13 2003, John Baldwin wrote:

 
 On 13-Nov-2003 Benjamin Lewis wrote:

[trouble booting -current on Tyan Tiger MP]

 Can you do a 'show intrcnt' from the ddb prompt?  It sounds like you
 may be getting an interrupt storm due to a mis-routed PCI interrupt.

I didn't see any unexpectedly large numbers, but IRQ 15 isn't in the
list:

[...]
GEOM: create disk ad0 dp=0xc2e19370
ad0: 57241MB WDC WD600AB-00BVA0 [116301/16/63] at ata0-master UDMA100
Waiting 10 seconds for SCSI devices to settle
[halt - sent]
Stopped at  siointr1+0xec:  jmp siointr1+0x220
db show intrcnt
irq4: sio0  2
irq8: rtc   1725
irq13: npx0 1
stray irq13 1
irq14: ata0 17
irq19: ohci0 ahc0   1
irq0: clk   1
db cont
[halt - sent]
Stopped at  siointr1+0xec:  jmp siointr1+0x220
db show intrcnt
irq4: sio0  4
irq8: rtc   3407
irq13: npx0 1
stray irq13 1
irq14: ata0 17
irq19: ohci0 ahc0   1
irq0: clk   1
db cont
[halt - sent]
Stopped at  siointr1+0xec:  jmp siointr1+0x220
db show intrcnt
irq4: sio0  6
irq8: rtc   7418
irq13: npx0 1
stray irq13 1
irq14: ata0 17
irq19: ohci0 ahc0   1
irq0: clk   1
db 

 Actually, I think the problem is in the ata driver.  Well, there are
 possibly bugs in the interrupt code in that interrupts that don't exist
 in the mptable (IRQ's 11 and 15) still get created, but, the fact that
 the mptable has no IRQ 15 to me means that there is no IRQ 15 and thus
 there should not be an ata1.  Note that in your dmesg, ata1 does say
 that it doesn't do DMA because it has been disabled.  Perhaps the ata
 driver needs to disable ata1 altogether on that chipset if it sees that
 condition.  My guess is that the ata driver is waiting forever for an
 interrupt from ata1 which is never going to arrive, hence the hang.
 Do you have a boot -v dmesg from a working kernel?

Unfortunately, I'll have to wait until I get home to try enabling the
ata1 controller in the BIOS.  dmesg output from a boot -v with my working
kernel is below.

Thanks for looking at this!

-Ben

Copyright (c) 1992-2003 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD 5.1-CURRENT #1: Sat Oct 18 13:47:46 EST 2003
[EMAIL PROTECTED]:/export/obj/usr/src-all/current/src/sys/AKIRA
Preloaded elf kernel /boot/kernel.ok/kernel at 0xc086.
Preloaded elf module /boot/kernel.ok/snd_pcm.ko at 0xc0860254.
Preloaded elf module /boot/kernel.ok/snd_es137x.ko at 0xc0860304.
Preloaded elf module /boot/kernel.ok/usb.ko at 0xc08603b8.
Preloaded elf module /boot/kernel.ok/ums.ko at 0xc0860464.
Preloaded elf module /boot/kernel.ok/agp.ko at 0xc0860510.
Preloaded elf module /boot/kernel.ok/random.ko at 0xc08605bc.
Preloaded elf module /boot/kernel.ok/acpi.ko at 0xc086066c.
Calibrating clock(s) ... i8254 clock: 1192965 Hz
CLK_USE_I8254_CALIBRATION not specified - using default frequency
Timecounter i8254 frequency 1193182 Hz quality 0
Calibrating TSC clock ... TSC clock: 138815 Hz
CPU: AMD Athlon(tm) Processor (1000.04-MHz 686-class CPU)
  Origin = AuthenticAMD  Id = 0x642  Stepping = 2
  
Features=0x183fbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR
  AMD Features=0xc044RSVD,AMIE,DSP,3DNow!
Data TLB: 24 entries, fully associative
Instruction TLB: 16 entries, fully associative
L1 data cache: 64 kbytes, 64 bytes/line, 1 lines/tag, 2-way associative
L1 instruction cache: 64 kbytes, 64 bytes/line, 1 lines/tag, 2-way associative
L2 internal cache: 256 kbytes, 64 bytes/line, 1 lines/tag, 8-way associative
real memory  = 268369920 (255 MB)
Physical memory chunk(s):
0x1000 - 0x0009efff, 647168 bytes (158 pages)
0x0010 - 0x003f, 3145728 bytes (768 pages)
0x00c29000 - 0x0fb3dfff, 250695680 bytes (61205 pages)
avail memory = 251080704 (239 MB)
Programming 24 pins in IOAPIC #0
IOAPIC #0 intpin 2 - irq 0
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
 cpu0 (BSP): apic id:  1, version: 0x00040010, at 0xfee0
 cpu1 (AP):  apic id:  0, version: 0x00040010, at 0xfee0
 io0 (APIC): apic id:  2, version: 0x00170011, at 0xfec0
bios32: Found BIOS32 Service Directory header at 0xc00f7480
bios32: Entry = 0xfd6c0 (c00fd6c0)  Rev = 0  Len = 1
pcibios: PCI BIOS entry at 0xfd6c0+0x120
pnpbios: Found PnP BIOS data at 0xc00f74d0
pnpbios: Entry = f:9ece  Rev = 1.0
Other BIOS signatures found:
null: null device, zero device
mem: memory  I/O
Pentium Pro MTRR support enabled
random: entropy source
SMP: CPU0 bsp_apic_configure():
 lint0: 0x00010700 lint1: 0x0400 TPR: 0x SVR: 0x01ff
npx0: math processor on motherboard
npx0: INT 16 interface
acpi0: PTLTDRSDT   on motherboard
pci_open(1):mode

Re: Recent -current hangs on Tyan S2460 before finishing boot

2003-11-13 Thread Benjamin Lewis
On November 13 2003, John Baldwin wrote:

  On 13-Nov-2003 Benjamin Lewis wrote:
  [trouble booting -current on Tyan Tiger MP]

  Can you do a 'show intrcnt' from the ddb prompt?  It sounds like you
  may be getting an interrupt storm due to a mis-routed PCI interrupt.


  db show intrcnt
[...]
  irq0: clk   1

 You aren't getting any clock interrupts it seems.  Very weird.  Can
 you try using NO_MIXED_MODE instead?  I still think the ata1 is bogus,
 but I don't think that is causing your hang now.

NO_MIXED_MODE did the trick -- an otherwise identical kernel to the one
that was hanging now boots to multiuser successfully.

Thank you so much for your help!

-Ben

-- 
Benjamin Lewis [EMAIL PROTECTED]
Database Analyst/Programmer
IT Security and Policy - Identity and Access Management
Purdue University

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-current
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Is anyone else having trouble with dump(8) on -current?

2002-05-15 Thread Benjamin Lewis

On Tue, 2002-05-14 at 05:34, Alexander Leidinger wrote:
 On  7 Mai, Benjamin Lewis wrote:
 
  Now, on to the problem.  I use amanda for backups, and since mid-April
  I've been seeing items like the following in the backup report:

[edited for brevity]

  |   DUMP: slave couldn't reopen disk: Interrupted system call
  |   DUMP: The ENTIRE dump is aborted.
  sendbackup: error [/sbin/dump returned 3]
  \
 
 Try the attached patch. I also have a similar patch for restore. I don't
 like the patch, I think I should use SA_RESTART with sigaction(), so
 think about this patch as a proof of concept (if it solves your
 problem).

For those that are interested and those that come across this in the
archives, I would like to thank Alexander Leidinger very much.  The
patch he sent me seems to have solved the problem illustrated above.

I ran several test dumps immediately after rebuilding /sbin/dump, and
it was very promising in that no errors were reported.  Prior to the
patch, I was consistently able to generate an error within 3 attempts.

Since then, Amanda has run and there were no reported issues with the
run.  Last night was the first entirely successful run in a long time.
I pulled all of the dumps back from tape, restored them to a secondary
location, and used cmp to compare them to the originals.  The only
differences found were the expected logfiles and other ephemeral data.

In short, the patch seems to have completely solved my problem

My thanks to M. Leidinger,

-Ben



To Unsubscribe: send mail to [EMAIL PROTECTED]
with unsubscribe freebsd-current in the body of the message