Re: Patch: sym(4) VTOBUS FAILED panics on amd64, amd64/89550

2006-09-21 Thread Doug White

On Fri, 22 Sep 2006, Jan Mikkelsen wrote:


Quick summary:  sym(4) assumes on amd64 that virtual addresses provided by
bus_dmamem_alloc() have the same alignment as the physical addresses (in
this case, 2*PAGE_SIZE).  They don't, and stuff breaks.  This patch works
around that.


Why is this? busdma supports alignment constraints; why not just set the 
alignment to what you need it set at? I realize sym has its own hand 
rolled DMA management craziness but alignment is something busdma can take 
care of easily.


Also the changes to MEMO_CLUSTER_SIZE seems to be already compensated for 
by the code blocks that calculate it:


#define MEMO_SHIFT  4   /* 16 bytes minimum memory chunk */
#ifndef __amd64__
#define MEMO_PAGE_ORDER 0   /* 1 PAGE  maximum */
#else
#define MEMO_PAGE_ORDER 1   /* 2 PAGEs maximum on amd64 */
#endif
#if 0
#define MEMO_FREE_UNUSED/* Free unused pages immediately */
#endif
#define MEMO_WARN   1
#define MEMO_CLUSTER_SHIFT  (PAGE_SHIFT+MEMO_PAGE_ORDER)
#define MEMO_CLUSTER_SIZE   (1UL  MEMO_CLUSTER_SHIFT)
#define MEMO_CLUSTER_MASK   (MEMO_CLUSTER_SIZE-1)

This results in 2*PAGE_SIZE clusters on amd64 and PAGE_SIZE clusters on 
other platforms. Since you seem to like doing MEMO_CLUSTER_SIZE * 2, why 
not just increase MEMO_PAGE_ORDER to 2 (and get 4*PAGE_SIZE clusters) ?


Oh dear, I didn't notice that the call to bus_dma_tag_create() has bad 
arguments. From the (RELENG_6) source:


if (!bus_dma_tag_create(dev_dmat, 1, MEMO_CLUSTER_SIZE,
   BUS_SPACE_MAXADDR_32BIT,
   BUS_SPACE_MAXADDR_32BIT,
   NULL, NULL, MEMO_CLUSTER_SIZE, 1,
   MEMO_CLUSTER_SIZE, 0,
   busdma_lock_mutex, Giant, mp-dmat)) {

As you fixed, that second BUS_SPACE_MAXADDR_32BIT should be 
BUS_SPACE_MAXADDR since its an exclusion zone. I'm suprised that doesn't 
fix it right there.


If increasing MEMO_PAGE_ORDER fixes the issue, then the #ifs are 
extraneous.


Also we generally prefer diffs in unidiff (-u) format, its a little easier 
to figure out exactly what changed just by looking at the diff. Thanks!


--
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Dual Opteron system will not run SMP

2006-06-09 Thread Doug White

On Wed, 7 Jun 2006, Pete French wrote:


If just non-ACPI isnt sufficient, the other thing SAFE does is turn
off disk DMA. I have an as-yet unreleased system that has this
same type of issue, and the problem is that two PCI device ID's
are not recognized, so maybe that will be your problem.


So, I got around to booting the system without ACPI and there are quite
a number of 'unknown' reports at boot, viz:

unknown: PNP0c01 can't assign resources (memory)
unknown: PNP0303 can't assign resources (port)
unknown: PNP0f13 can't assign resources (irq)
unknown: PNP0501 can't assign resources (port)
unknown: PNP0501 can't assign resources (port)
unknown: PNP0700 can't assign resources (port)
unknown: PNP0400 can't assign resources (port)


These are normal motherboard resources that FreeBSD reserves via other 
methods. They can be safely ignored.



pciconf -l gives me 5 devices without drivers attached, these being:

[EMAIL PROTECTED]:7:2: class=0x0c0500 card=0x13101462 chip=0x746a1022 rev=0x02 
hdr=0x00
   vendor   = 'Advanced Micro Devices (AMD)'
   device   = 'AMD-8111 SMBus 2.0 Controller'
   class= serial bus
   subclass = SMBus


You need to load a device driver to attach to this function; I believe its 
called amdpm. It is only needed if you want to inquiry devices on SMBus.



[EMAIL PROTECTED]:7:3: class=0x068000 card=0x13101462 chip=0x746b1022 rev=0x05 
hdr=0x00
   vendor   = 'Advanced Micro Devices (AMD)'
   device   = 'AMD-8111 ACPI System Management Controller'
   class= bridge
[EMAIL PROTECTED]:10:1:class=0x080010 card=0x13101462 chip=0x74511022 
rev=0x01 hdr=0x00
   vendor   = 'Advanced Micro Devices (AMD)'
   device   = 'AMD-8131 PCI-X IOAPIC'
   class= base peripheral
   subclass = interrupt controller
[EMAIL PROTECTED]:11:1:class=0x080010 card=0x13101462 chip=0x74511022 
rev=0x01 hdr=0x00
   vendor   = 'Advanced Micro Devices (AMD)'
   device   = 'AMD-8131 PCI-X IOAPIC'
   class= base peripheral
   subclass = interrupt controller


FreeBSD detects and attaches these through ACPI and not through the PCI 
bus. In the non-ACPI case (which BTW is not exactly kosher for amd64, 
since ACPI is a required part of the platform spec), IOAPICs are 
enumerated and attached through MPTable.


The fact that the IOAPICs have PCI bus attachments at all is an 
implementation detail. They didn't back in the Pentium Pro days.



[EMAIL PROTECTED]:6:0: class=0x03 card=0x13101462 chip=0x47521002 rev=0x27 
hdr=0x00
   vendor   = 'ATI Technologies Inc'
   device   = 'Rage XL PCI'
   class= display
   subclass = VGA


This usually gets grabbed either by the agp driver or by the DRM (Direct 
Rendering Module) appropriate for the chip. The console uses generic VGA 
register accesses and doesn't need the PCI attachment to figure that out. 
If you don't run X then this is normal.


My recommendation for problems with amd64 boards:

1. Upgrade to the latest release BIOS from the vendor. This is really, 
really, really important; the latest BIOS often has revised memory and bus 
timings that improve compatibility. The Tyan S2892 in particular needs it 
to avoid lockups if any PCI cards are installed.


2. Lockups at the SCSI detection phase are usually due to interrupt 
routing issues (an interrupt gets stuck on and won't clear since we can't 
figure out who's triggering it). If you have PCI cards installed try 
moving them to different slots or removing them entirely. Also disconnect 
any USB peripherals you may have connected.


3. You could have a defective IOAPIC, or other device, on your system 
board. Very rare, but possible.


--
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.1-RELEASE: WARNING - WRITE_DMA soft error

2006-06-09 Thread Doug White

On Mon, 5 Jun 2006, srwadleigh wrote:


I have 6.1-RELEASE installed on a Supermicro SuperServer 6014P-TR
with Supermicro motherboard: Super X6DHP-TG 
http://supermicro.com/products/motherboard/Xeon800/E7520/X6DHP-TG.cfm


I have four SATA drives attached to the internal backplane which uses
the following controller, I am Not using the onboard RAID features:

   Marvell 88SX6081 4-port SATA Controller with 3rd-Party Adaptec
   AIC-8110(4x drive), RAID 0, 1, JBOD support

The problem I am seeing occurs with the fourth drive, ad10, and appears
on all read/write operations:

ad10: WARNING - WRITE_DMA48 soft error (ECC corrected) LBA=293046767
kernel:ad10: WARNING - WRITE_DMA soft error (ECC corrected) LBA=12393823
kernel:ad10: WARNING - READ_DMA soft error (ECC corrected) LBA=12480567

This same warning message appears on 6.0-RELEASE and 6.1-STABLE


I have problems with a similar chassis w/ SCSI and bay 3 throwing spurious 
errors on a number of systems, so I think its just poor backplane design 
on SuperMicro's part. Try getting them to replace your backplane board.


--
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Suggestions for Adaptec SAS AIC9410

2006-06-09 Thread Doug White

On Mon, 5 Jun 2006, Andy Dills wrote:



I'm trying to get a Supermicro 6024H-32R setup with 6.1, but I'm
discovering that the Adaptec SAS AIC9410 controller isn't yet supported.

I see one brief thread about it on this mailing list back in March, but no
further mention or conclusive details. There was talk of MFCing the mpt
driver...is anybody using this in production?


Can you get a pciconf -lv off this system? It might just be a case of 
needing to add an ID to an existing driver (aac?).


--
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.1 kernel unable to find /dev ?

2006-06-03 Thread Doug White

On Sat, 3 Jun 2006, Brian Tao wrote:


   I had a very stable 6.1-R amd64 server (once I swapped out some
bad RAM, that is) that needed a couple more hard drives installed.
There were some problems with the upgrade (device renumbering woes,
basically...  topic of another thread), and it had to be rolled back.

   Upon rolling back, the previously-good kernel would no longer
complete the boot after the device probe.  I saw two types of panics
on the serial console:

| Trying to mount root from ufs:/dev/ad4s1a
| Lookup of /dev for devfs, error: 20


Error 20 is ENOTDIR which means something along the requested path exists, 
but it is not a directory. From this output it looks the root directory 
entry is somehow corrupted or being misinterpeted.



| exec /sbin/init: error 20
| exec /sbin/oinit: error 20
| exec /sbin/init.bak: error 20
| exec /rescue/init: error 20
| exec /stand/sysinstall: error 20
| init: not found in path
| /sbin/init:/sbin/oinit:/sbin/init.bak:/rescue/init:/stand/sysinstall
| panic: no init
| Uptime: 8s
| Cannot dump. No dump device defined.
| Automatic reboot in 15 seconds - press a key on the console to abort
| -- Press a key on the console to reboot,
| -- or switch off the system now.

... and:

| Trying to mount root from ufs:/dev/ad4s1a
| pid 47 (sh), uid 0: exited on signal 11
| TPTE at 0x840028e0  IS ZERO @ VA 80051c000
| panic: bad pte
| Uptime: 8s


This is usually indicative of bad RAM or a faulty processor. Since you 
seem to be having disk problems, it may just be due to the disk returning 
faulty data. Or there is a bad kernel module in the mix that is randomly 
corrupting data.



   The first one is suggesting that /dev does not exist (or is not a
directory)... I'm thinking this means that devfs is somehow
unavailable, but I did not think it is even possible to disable devfs
via the kernel config file these days.

   The second one leaves me clueless... I have not been able to find
any useful information on that panic during boot.  Granted, I've only
see the bad pte panic twice... all other reboot attempts result in
the first type of problem.

   Fortunately, I did happen to keep an old 6.0-RELEASE-p6 kernel
around (Apr 15 2006 build).  That kernel boots fine, using the same
filesystem as newer kernels on that drive.  I am up-to-date with the
RELENG_6_1 tag.  Should I perhaps to a make installkernel installworld
before rebooting?  The installed binaries on the server are from an
early 6.1-RELEASE (which *was* successfully booted by this server).  I
am running into a few minor but surmountable problems because of the
older kernel version, but I obviously would like to get my world and
kernel back in sync ASAP.


My gut feeling is that there is still a disconnect on what the root 
filesystem is. That or there is hidden corruption that 6.0 isn't noticing 
that 6.1 is.  Here's what I'd do next:


1. Capture the boot output from both the working 6.0 kernel and your 
broken 6.1 kernel and compare the two. If there are differences or errors 
being returned from the ATA controller or disks then those will need to be 
addressed.


2. Try a splat-over reinstall of 6.1-R from CD to force everything to 
match up. Mount the filesystems but don't mark them to be newfs'd. Install 
the GENERIC kernel only.


If you are going to be tracking a branch, please read the instructions at 
the end of src/UPDATING on how to perform the build. There is a specific 
procedure and not following it can cause significant issues. While 
unlikely, it is possible to irreparibly damage the system by not following 
the instructions to the letter.


--
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.1 buildkernel question

2006-06-03 Thread Doug White

On Fri, 2 Jun 2006, [EMAIL PROTECTED] wrote:

My system works, however the buildkernel process had about 2000 
warnings, with 1/2 of those compiling aic7xxx (see below). This was 
discussed on BSDForums but as far as I can tell, different compiler 
options were used; the conclusion was using -O3 was the problem.


Using compilation options other than the defaults (except for options 
enabled through the CPUTYPE make.conf option) is not recommended or 
supported. Bug reports about compile errors or warnings while non-standard 
options are enabled will be ignored.


That being said:


./machine/bus.h:221: warning: inlining failed in call to 'bus_space_read_1': --p
aram inline-unit-growth limit reached


This is normal. We're not entirely sure how we're hitting the limit in 
this component, but the warning is harmless. You may safely ignore this 
message.


--
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: How to disable libcom_err from being built?

2006-05-06 Thread Doug White
On Fri, 5 May 2006, Peter Losher wrote:

 I have an install base of machines running MIT Krb5 (which have their
 own com_err implementation), and I have always used NO_KERBEROS=true so
 that the integrated Heimdal stuff wouldn't be built during a buildworld.
  However libcom_err does, and that causes issues when trying to link in
 programs that are linked to MIT Krb5.

 What I am asking is - can NO_KERBEROS be extended to cover com_err?

According to src/lib/Makefile, libcom_err is also needed for PAM. Grepping
around it looks like its just picked up for the pam_krb5 and pam_ksu
modules, so this seems like a reasonable request thats easy to implement
in the current framework.

There doesn't appear to be a make.conf option to inhibit libcom_err from
building right now, though. You could always edit it out of
src/lib/Makefile.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.1-RC and the audit group

2006-05-06 Thread Doug White
On Sat, 6 May 2006, Mark Kirkwood wrote:

 I found that installworld stops, because the 'audit' group has not been
 created. Now I just pressed 'return' for the default actions during
 mergemaster -p, but I didn't notice any mention of the audit group.

The default action is to do nothing, so its possible you just missed it as
you were blazing through. :-)

Also, if you didn't delete the temporary directory last time, mergemaster
will ask you if you want to re-use it. 9 times out of 10, the answer is
'no', but the default may be to save it.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: jail issue question

2006-05-06 Thread Doug White
On Sat, 6 May 2006, Gergely CZUCZY wrote:

 hi all

 i've got this issue:
 http://www.freebsd.org/cgi/query-pr.cgi?pr=96729
 http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/89528

 i'd like to look into it, because i'm not experience with the
 development of the freebsd (or any other) kernel, yet i'd like
 to create a temporary solution. it would consist of a userspace
 utility that invokes a kernelspace function that cleans out the
 unused jail entry from the jail registry, and perhaps cleans out
 other related entries(such as processes inside the jail).

 my question is, on which kernel tree should I try to manage this,
 on my desktop currently i run 6-STABLE, first i though this could
 be good, but maybe i should do it on -CURRENT. so, which tree should
 i examine, check and maybe modify a bit?

If you look at the audit trail in kern/89528 (96729 was closed as a
duplicate) its related to the known pty leak issue. Writing a tool to
brute-force free those resources will not help at all and likely lead to
an immediate panic when something tries to reference the object you
forcibly detached from the jail.

If you can come up with a reliable test case that causes this pty leak,
and be willing to test patches, then it would greatly help the debugging
effort.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: USB Flash reader under RELENG_6: force GEOM rescan

2006-02-09 Thread Doug White
On Thu, 9 Feb 2006, Dmitry Morozovsky wrote:

 OF  
 OF   dd if=/dev/da0 of=/dev/da0 count=1
 OF  
 OF   which results in error, but actually create GEOMs
 OF
 OF The following should work as well, without giving an error:
 OF
 OF dd if=/dev/null of=/dev/da0 count=0
 OF
 OF It opens the device for writing (without actually writing
 OF anything) and immediately closes it again, which causes
 OF devfs to be triggered.

 Aha, actually, it works. Thanks.

How about 'fdisk da0'? :)

I have a usb drive that takes a while to become ready after being
inserted. It initially generates errors when attempting to access it.
Yours may have a similar issue.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD6 on PowerEdge 4200

2006-02-04 Thread Doug White
On Thu, 2 Feb 2006, Enrique Ayesta Perojo wrote:

 Hello, i'm trying to install FreeBSD6 on an old PowerEdge 4200. But i cannot
 even get to sysinstall, when it's booting from the cd it gets to a point
 where the system reboots, these are the messages i can see on the console:

 ...
 amr0: LSILogic MegaRAID 1.51 port 0xf480-0xf4ff irq 9 at device 19.0 on pci0
 amr0: busmaster bit not set, enabling

Try updating the system BIOS and the PERC card's firmware to the latest,
and make sure the card is enabled in the BIOS (if its onboard).
This indicates the device isn't being configured properly by the BIOS.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: PXE Installation

2006-01-26 Thread Doug White
On Fri, 20 Jan 2006, Marten Vijn wrote:

 On Fri, 2006-01-20 at 03:36 +0100, Karel Miklav wrote:
  Hi,
 
  I'm trying to install FreeBSD 6.0 to a sub-notebook without
  floppy or optical unit.

  Please give me a hand, I'm playing with this for whole week.
 
 nice game aint it? sure of your friends is tcpdump :)


 steps to follow (more below), create:

 1 pxeboot
 2 exports a nfs bootdir
 3 populate bootdir
 4 config dhcpd.conf
 5 add hosts to dns or /etc/hosts
 6 proper config for the bootdir

[...]

If the goal is to simply start sysinstall on the target then it is not
necessary to construct a fully-populated root filesystem for the purpose.
Just tell loader to use the mfsroot disk image that's in the boot/
directory and sysinstall will start up like it does when booting directly
from CD.

Instructions for this are at:

http://people.freebsd.org/~dwhite/pxeboot.html

Its fundamentally the same, except step 3 takes about 1 minute (cp -Rp)
instead of an hour.

Admittedly these instructions assume proficiency with configuring NFS,
DHCP, and so forth.  Its mainly a note to myself on how to do it. :)

The addition of vfs.root.mountfrom=ufs:/dev/md0c line to loader.conf is
the key. The CDROM loader.rc already pulls in mfsroot.gz, you just have to
point the kernel at that image instead of the NFS root that pxeboot will
pass on.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.0 on Dell 1850 with PERC4e/DC RAID?

2006-01-05 Thread Doug White
On Thu, 5 Jan 2006, Scott Mitchell wrote:

 Hi all,

 I may be getting a new Dell PE1850 soon, to replace our ancient CVS server
 (still running 4-STABLE).  The new machine will ideally run 6.0 and have a
 PERC4e/DC RAID card - the one with battery-backed cache.  This is listed as
 supported by amr(4), but I'm wondering how well it actually works in the
 case of a disk failure.  Will the driver tell me that disk has failed (a
 syslog message would be enough) or will I have to make a daily trip into
 the server room to check the front panel lights?  Presumably it handles
 hot-swapping a replacement drive OK?

From what I remember, you will receive status-change kernel messages when
disks disappear, rebuilds start, and so forth. So for most day-to-day
manipulation you should be fine.

You may want to make sure the auto rebuild option is enabled in the
controller's BIOS since no working control programs from userland are
generally available at this time. That also means you can't create new
volumes at runtime, but thats not so horrible...

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: ACPI not work

2005-12-20 Thread Doug White
On Tue, 20 Dec 2005, Paul.LKW wrote:

 Hello All:
 I have a problem about ACPI not work with the following error from the
 message log

 Dec 21 20:22:24 amd-64 kernel: ioapic0: Assuming intbase of 0
 Dec 21 20:22:24 amd-64 kernel: ioapic0 Version 1.1 irqs 0-23 on
 motherboard
 Dec 21 20:22:24 amd-64 kernel: ACPI-0397: *** Error: NsSearchAndEnter: Bad
 character in ACPI Name: 43005350
 Dec 21 20:22:24 amd-64 kernel: ACPI-0381: *** Error: Looking up [0x43005350]
 (NON-ASCII)
 Dec 21 20:22:24 amd-64 kernel: in namespace, AE_BAD_CHARACTER
 Dec 21 20:22:24 amd-64 kernel: ACPI-0204: *** Error: AcpiLoadTables: Could
 not load namespace: AE_BAD_CHARACTER
 Dec 21 20:22:24 amd-64 kernel: ACPI-0213: *** Error: AcpiLoadTables: Could
 not load tables: AE_BAD_CHARACTER
 Dec 21 20:22:24 amd-64 kernel: ACPI: table load failed: AE_BAD_CHARACTER

 Any idea to fix it.

Try upgrading or reflashing your BIOS. The DSDT in the BIOS image is
corrupted.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Problems with ata RAID?

2005-12-15 Thread Doug White
On Wed, 14 Dec 2005, Steven Hartland wrote:

 Just spotted the following in the logs of one of our machines
 There seems to errors across too many disks for it to be disk issues.
 Any one seen this before / got any advice?

I'd say ad4 went kaboom and is corrupting traffic on its bus, thus the ad5
CRC warnings.  (Remember that Parallel ATA drives are master/slave.)

 ad4: timeout waiting to issue command
 ad4: error issueing WRITE_DMA command
 ar0: WARNING - mirror protection lost. RAID0+1 array in DEGRADED mode
 ad4: timeout waiting to issue command
 ad4: error issueing WRITE_DMA command
 ad4: write metadata failed
 ad6: req=0xc49b0578 WRITE_DMA semaphore timeout !! DANGER Will Robinson !!
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=2325663
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=2325695
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=2325695
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=2325727
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=2325632
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=393344
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=393344
 ad5: WARNING - WRITE_DMA UDMA ICRC error (retrying request) LBA=393344
 ad5: FAILURE - WRITE_DMA status=51READY,DSC,ERROR error=84ICRC,ABORTED 
 LBA=393344
 ad4: req=0xc8585258 WRITE_DMA semaphore timeout !! DANGER Will Robinson !!
 ad4: req=0xc8585258 WRITE_DMA semaphore timeout !! DANGER Will Robinson !!
 ad4: req=0xc8585258 WRITE_DMA semaphore timeout !! DANGER Will Robinson !!
 ad4: req=0xc8585258 WRITE_DMA semaphore timeout !! DANGER Will Robinson !!
 ad5: FAILURE - SETFEATURES SET TRANSFER MODE status=51READY,DSC,ERROR 
 error=4ABORTED
 ad5: FAILURE - SETFEATURES SET TRANSFER MODE status=51READY,DSC,ERROR 
 error=4ABORTED
 ad5: FAILURE - SETFEATURES ENABLE RCACHE status=51READY,DSC,ERROR 
 error=4ABORTED
 ad5: FAILURE - SETFEATURES ENABLE WCACHE status=51READY,DSC,ERROR 
 error=4ABORTED
 ad5: FAILURE - SET_MULTI status=51READY,DSC,ERROR error=4ABORTED
 ad4: FAILURE - WRITE_DMA timed out LBA=234441585
 ad4: write metadata failed
 ad5: FAILURE - WRITE_DMA status=59READY,DSC,DRQ,ERROR error=4ABORTED 
 dma=0x06 LBA=234441585
 ad5: write metadata failed
 ad6: req=0xc853c190 WRITE_DMA semaphore timeout !! DANGER Will Robinson !!

 [dmesg]
 atapci0: Promise PDC20267 UDMA100 controller port 
 0xa400-0xa407,0xa800-0xa803,0xac00-0xac07,0xb000-0xb003,0xb400-0xb43f mem
 0xf812-0xf813 irq 18 at device 12.0 on pci0
 ata2: ATA channel 0 on atapci0
 ata3: ATA channel 1 on atapci0
 ad4: 114473MB WDC WD1200JB-00DUA3 75.13B75 at ata2-master UDMA100
 ad5: 114473MB WDC WD1200JB-00DUA3 75.13B75 at ata2-slave UDMA100
 ad6: 114473MB WDC WD1200JB-00GVA0 08.02D08 at ata3-master UDMA100
 ad7: 114473MB WDC WD1200JB-00GVA0 08.02D08 at ata3-slave UDMA100
 ar0: 228946MB Promise Fasttrak RAID0+1 (stripe 64 KB) status: READY
 ar0: disk0 READY (master) using ad4 at ata2-master
 ar0: disk1 READY (master) using ad5 at ata2-slave
 ar0: disk2 READY (mirror) using ad6 at ata3-master
 ar0: disk3 READY (mirror) using ad7 at ata3-slave
 [/dmesg]



 
 This e.mail is private and confidential between Multiplay (UK) Ltd. and the 
 person or entity to whom it is addressed. In the event of misdirection, the 
 recipient is prohibited from using, copying, printing or otherwise 
 disseminating it or any information contained in it.

 In the event of misdirection, illegible or incomplete transmission please 
 telephone (023) 8024 3137
 or return the E.mail to [EMAIL PROTECTED]

 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Trouble with amd64

2005-11-30 Thread Doug White
Since this references 6.0-RELEASE it should have been sent to -stable, or
since it references amd64 specifically it should have been sent to -amd64.
Rerouting to -stable.

On Wed, 23 Nov 2005, Nils Berzins wrote:

 Hi !

 Few days ago I downloaded release 6.0 ISOs, in hope that I will finally be
 able to run FreeBSD on my home computer. Unfortunately, booting from CD dies
 with:

  panic: sym: VTOBUS FAILED !

 Is there something I can do to get around this error, or do I just wait for
 more less stable 7.0 :-(

According to the code this panic occurs when there is a problem with the
DMA memory management. Typically we see this when the driver is not 64-bit
clean, but I'd find that hard to believe with amd64 since the same driver
runs on sparc64 fine (and must since many Sun machines have embedded sym
controllers).

Unfortunately it doesn't help that the sym driver has its own bizarre DMA
management. It uses busdma but its kinda bolted on over the old
memory-block management. Not going to make it any easier to debug.

 P.S. I have ASUS A8V-E Deluxe with Athlon 64D.

Can we get the dmesg from this system?

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_6: ACPI-0698: *** Warning: Type override:

2005-11-22 Thread Doug White
On Sat, 19 Nov 2005, Doug White wrote:

 On Fri, 18 Nov 2005, Ricardo A. Reis wrote:

  Hi all,
 
 
Updating proxy server running 5.4, i resolve enable acpi,smp
  kernel  to use HTT.

 You need to set

 machdep.hyperthreading_enabled=1

Sorry, this should be:

machdep.hyperthreading_allowed=1

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Mount related panic with FreeBSD 6?

2005-11-22 Thread Doug White
On Sun, 20 Nov 2005, [UTF-8] V??clav Haisman wrote:

 Hi,
 I got this panic on freshly installed FreeBSD 6. I did this df -h and
 noticed that /mnt/oldroot/home is somewhat mangled. The /mnt/oldroot is
 root of FreeBSD 4.11 system. I successfully copied some settings and all
 user accounts from that /mnt/oldroot/home earlier today. This is what I
 did before the panic:

[...]

I discovered this by accident with a CDROM the other day. In 6.0 you can
overlay read-only mounts (i.e., mount the same R/O FS on top of itself)
but unmounting it will cause GEOM to tear down the underlying device while
leaving the first mount behind. Next access to the mountpoint will panic
the system.

You can't mount a read/write mount on top of itself, or a r/o mount on a
r/w mount -- you get an error. A quick discussion with phk points to a
faulty or missing access check in GEOM. I'm not familiar with the VFS
operations required to mount a filesystem, though, so I'm not sure where
to look to put in the fix.

In the interim, be careful not to mount a read-only FS multiple times.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RELEASE: integer divide panic (trap 18) during install boot

2005-11-22 Thread Doug White
On Mon, 21 Nov 2005 [EMAIL PROTECTED] wrote:


 Hi.  I've got a system that's dying during the scsi probe of the
 initial install boot.  The console messages are along the lines of:

   [...usual boot stuff...]
   vga0 Generic ISA VGA
   unknown: PNP0303 can't assign resources (port)
   unknown: PNP0c02 can't assign resources (port)
   unknown: PNP0501 can't assign resources (port)
   unknown: PNP0501 can't assign resources (port)
   unknown: PNP0400 can't assign resources (port)
   unknown: PNP0700 can't assign resources (port)
   Timecounter [...]
   Timecounters [...]
   Waiting 15 seconds for SCSI
   md0 preloaded image /mfsroot 4423680 bytes at 0xc0a34270
   da0 at bt0 bus 0 target 0 lun 0
   da0: SEAGATE ST39173LW 6246 Fixed Direct Access SCSI-2 device
   fatal trap 18: integer divide fault while in kernel mode

Random guess: the disk is broken and is reporting its size as 0 bytes and
CAM divides that value by 1024*1024 to get megabytes.

Try removing or replacing the disk.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: freebsd-stable Digest, Vol 136, Issue 3

2005-11-22 Thread Doug White
On Tue, 22 Nov 2005, Rutger Bevaart wrote:


 There is no doubt that a lot of people have stable FreeBSD systems on Dell
 hardware. The strange thing is though that there are also a lot of people
 having problems with 1750's and 1850's (and therefore 2850's as well, as
 they have an identical board).

 We have installed about 4 1750's and 8 1850/2850's in the last year and
 _all_ except 1 of them have the automatic reboot feature (using 4_10,
 4_11, 5_3 and 5_4). We've had 1 lockup on the 1750 but have been unable to
 trace that to a cause, might be unrelated. Strangely, the 1750 that does
 ok is the one with the highest load (pushing several Mbits and an average
 load 2) and running 5.3-RC3.

I ran -CURRENT on a 1750 during performance tests in March -- that
eventually became 6.0. Solid as a rock.

You might look into putting RedHat or Windows on the machine and
installing the Dell OpenManage Server Administrator and checking if its
complaining about anything.  These reboots could be caused by multibit ECC
errors which would be logged to the hardware log.

You can download OMSA from Dell's site if you go to the downloads page for
the machine and select Server Administration as the category (I think --
they move that around).

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: CF card and /dev filesystem entries

2005-11-19 Thread Doug White
On Thu, 17 Nov 2005, Oliver Fromme wrote:

 Brian Candler [EMAIL PROTECTED] wrote:
   However, when I insert a CF card with normal partioning I need /dev/da0s1,
   and this is not present in the /dev filesystem because the partition table
   has not been read.
  
   # mount -t msdos /dev/da0s1 /mnt/cf
   mount_msdosfs: /dev/da0s1: No such file or directory
  
   Just reading the first block is not sufficient:
  
   # dd if=/dev/da0 of=/dev/null count=1
   1+0 records in
   1+0 records out
   512 bytes transferred in 0.040984 secs (12493 bytes/sec)
   # mount -t msdos /dev/da0s1 /mnt/cf
   mount_msdosfs: /dev/da0s1: No such file or directory

 I think devfs is updated when a descriptor on the device
 which was opended for writing is closed.  But you don't
 actually have to write anything.  That means, the following
 command should do it:

 # dd if=/dev/null of=/dev/da0 count=0

Or fdisk da0 or bsdlabel da0 or something like that :-)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: message too long when sending broadcasts

2005-11-19 Thread Doug White
On Wed, 16 Nov 2005, Michael Voucko wrote:

 Hi,

 I'm trying to install the latest version of heartbeat (linux-ha), which
 unfortunately is not available from the ports tree yet.
 Setup is two boxes running (5.4-STABLE FreeBSD from Oct 17 2005) both 
 connected
 to a hub with lnc NICs on a private 10.0.0.0/24 subnet (actually its only a
 'simulated hub' provided by VMWare if this matters).

 After some initial struggles configuration, build and installations works as
 expected - but now the most basic parts of this application won't work 
 anymore.
 When trying to use UDP broadcast messages for the heartbeat it would error out
 with Message too long for packets larger than 1472 bytes, in other words as
 soon as fragmentation of the package would be necessary (MTU is 1500).

 I received a code snippet from the heartbeat developers to isolate the problem
 (create sender and receiver socket, send broadcast packets of a certain size).
 To rule out VMware as the basic problem I tried real boxes but the problem
 persists.

 Is there any size restriction for UDP broadcast messages?
 Is there anything which prevents UDP broadcast from being fragmented?

 I searched RFCs, man pages for socket, setsockopt, ioctl, sendto and other
 places but did not find anything that could explain the behaviour (which seems
 to be no problem on other OS).

 Any pointers, comments?

From sendmsg(2):

 The address of the target is given by to with tolen specifying its size.
 The length of the message is given by len.  If the message is too long to
 pass atomically through the underlying protocol, the error EMSGSIZE is
 returned, and the message is not transmitted.

I remember debugging some funky behavior regarding fragmenting packets and
UDPv6 some time ago. I don't remember if a fix was committed or not.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_6: ACPI-0698: *** Warning: Type override:

2005-11-19 Thread Doug White
On Fri, 18 Nov 2005, Ricardo A. Reis wrote:

 Hi all,


   Updating proxy server running 5.4, i resolve enable acpi,smp
 kernel  to use HTT.

You need to set

machdep.hyperthreading_enabled=1

in /boot/loader.conf, and make sure HyperThreading is enabled in your
BIOS.

The ACPI messages noted in the subject line are harmless.

 dmesg output,

 Copyright (c) 1992-2005 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
 The Regents of the University of California. All rights reserved.
 FreeBSD 6.0-STABLE #0: Fri Nov 18 12:53:50 BRST 2005
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/SMP
 ACPI APIC Table: INTEL  SWV25   
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Xeon(TM) CPU 2.66GHz (2658.11-MHz 686-class CPU)
   Origin = GenuineIntel  Id = 0xf27  Stepping = 7

 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
   Features2=0x400CNTX-ID
   Hyperthreading: 2 logical CPUs
 real memory  = 2147418112 (2047 MB)
 avail memory = 2093625344 (1996 MB)
 FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
  cpu0 (BSP): APIC ID:  0
  cpu1 (AP): APIC ID:  1
 ACPI-0698: *** Warning: Type override - [DEB_] had invalid type
 (Integer) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [MLIB] had invalid type
 (Integer) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [DATA] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [SIO_] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [LEDP] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [GPEN] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [GPST] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [WUES] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [WUSE] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [SBID] had invalid type
 (String) for Scope operator, changed to (Scope)
 ACPI-0698: *** Warning: Type override - [SWCE] had invalid type
 (String) for Scope operator, changed to (Scope)

[snipped]

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Panic: ad0: WARNING - removed from configuration

2005-11-19 Thread Doug White
On Fri, 18 Nov 2005, Tom Jensen wrote:

 Hi

 Seen this panic twice, there seems to be no pattern and I have no idea what
 triggers this, is it failing hardware?

Its certainly possible.  Or a bad cable, or a bad firmware interaction.

 System: FreeBSD 5.4-STABLE FreeBSD 5.4-STABLE #10: Sat Nov  5 17:14:46 CET
 2005

 db show msgbuf
 msgbufp = 0xc101cfe4
 magic = 63062, size = 32740, r= 33501, w = 63892, ptr = 0xc1015000, cksum=
 3187082
 ad0: WARNING - removed from configuration
 swap_pager: I/O error - pagein failed; blkno 5212,size 4096, error 0
 vm_fault: pager read error, pid 282 (syslogd)
 6pid 282 (syslogd), uid 0: exited on signal 11
 ata0-master: FAILURE - WRITE_DMA timed out
 initiate_write_filepage: already started
 swap_pager: I/O error - pagein failed; blkno 6496,size 12288, error 6
 vm_fault: pager read error, pid 657 (courierlogger)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Promise TX4300 boot problems

2005-11-15 Thread Doug White
On Tue, 15 Nov 2005, Lawrence Farr wrote:

 I've stuck a Promise TX4300 controller in to replace
 the marvell onboard SATA that I can't boot off on a
 Supermicro server. If I set 2 discs up as Raid 0
 or Raid 1 it's fine, But 4 discs as Raid 0+1 is detected,
 installs without any problems, but hangs at the boot
 loader with elf32_loadimage: read failed. I'm guesssing
 the boot loader thinks it's a mirror, not a stripe and
 fails to read past the first stripe on the disk?

If this is the case then its a bug in the controller firmware; loader is
just doing BIOS I/O calls. Check for a firmware update for your
controller.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Consistent 6.0-STABLE Hang.

2005-11-14 Thread Doug White
On Mon, 14 Nov 2005, Robert Watson wrote:

  I think you got it.  After rebooting under a fresh 6.0-STABLE (build on
  Mon Nov 14 05:39:01 CET 2005), the problem didn't appear again...  no
  more need for a serial console on this machine, i think :)

 Sounds like a good MFC candidate, assuming it can be MFC'd
 non-disruptively.

It's been MFC'd to RELENG_6 already, but I need to poll re@ on whether
this merits an errata for 6.0-R.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Strange boot messages under 6.0-STABLE

2005-11-14 Thread Doug White
On Mon, 14 Nov 2005, Kris Kennaway wrote:

  That reboot after panic message is strange, because the box is running
  normally.
  Can someone plese tell me why I'm geting those messages.

 Your system panicked at some point in the past, but it can't save the
 dump for the reason specified.

Note that if you're never going to attempt to recover this crashdump you
can clear it by running 'savecore -c'.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


umounting overlapping mount panics

2005-11-14 Thread Doug White
Hey folks,

I accidentally mounted a CDROM over itself when installing X. OK, no
problem, umounted it and then started browsing the CD. Panic.

The panic output:
Fatal trap 12: page fault while in kernel mode
fault virtual address   = 0x0
fault code  = supervisor read, page not present
instruction pointer = 0x20:0xc05ffe4e
stack pointer   = 0x28:0xda969ad4
frame pointer   = 0x28:0xda969ae8
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, def32 1, gran 1
processor eflags= interrupt enabled, resume, IOPL = 0
current process = 485 (csh)
trap number = 12
panic: page fault

The trace:

#0  doadump () at pcpu.h:165
#1  0xc0638202 in boot (howto=260) at
/usr/src/sys/kern/kern_shutdown.c:399
#2  0xc0638498 in panic (fmt=0xc084e5a2 %s)
at /usr/src/sys/kern/kern_shutdown.c:555
#3  0xc0807c30 in trap_fatal (frame=0xda969a94, eva=0)
at /usr/src/sys/i386/i386/trap.c:831
#4  0xc080799b in trap_pfault (frame=0xda969a94, usermode=0, eva=0)
at /usr/src/sys/i386/i386/trap.c:742
#5  0xc08075d9 in trap (frame=
  {tf_fs = 8, tf_es = 40, tf_ds = 40, tf_edi = -1040124992, tf_esi =
0, tf_ebp = -627664152, tf_isp = -627664192, tf_ebx = -1027565108, tf_edx
= 2048, tf_ecx = 0, tf_eax = 1, tf_trapno = 12, tf_err = 0, tf_eip =
-1067450802, tf_cs = 32, tf_eflags = 66182, tf_esp = 1, tf_ss = 0})
at /usr/src/sys/i386/i386/trap.c:432
#6  0xc07f6dca in calltrap () at /usr/src/sys/i386/i386/exception.s:139
#7  0xc05ffe4e in g_io_request (bp=0xc2c099cc, cp=0xc200f3c0)
at /usr/src/sys/geom/geom_io.c:259
#8  0xc0602399 in g_vfs_strategy (bo=0x1, bp=0xcbed1d68)
at /usr/src/sys/geom/geom_vfs.c:106
#9  0xc060a2d9 in cd9660_strategy (ap=0x1)
at /usr/src/sys/isofs/cd9660/cd9660_vnops.c:755
#10 0xc0817a49 in VOP_STRATEGY_APV (vop=0xc08bd320, a=0xda969b3c)
at vnode_if.c:1796
#11 0xc0681de8 in bufstrategy (bo=0xc1f883f0, bp=0x1) at vnode_if.h:928
#12 0xc067c78d in breadn (vp=0xc1f88330, blkno=0, size=2048, rablkno=0x0,
rabsize=0x0, cnt=0, cred=0x0, bpp=0x1) at buf.h:415
#13 0xc067c6d0 in bread (vp=0xc1f88330, blkno=0, size=2048, cred=0x0,
bpp=0xda969bc8) at /usr/src/sys/kern/vfs_bio.c:719
#14 0xc0606be5 in cd9660_blkatoff (vp=0x800, offset=0, res=0x0,
bpp=0xda969c28)
at /usr/src/sys/isofs/cd9660/cd9660_lookup.c:406
#15 0xc0609db2 in cd9660_readdir (ap=0xda969c90)
at /usr/src/sys/isofs/cd9660/cd9660_vnops.c:513
#16 0xc0817754 in VOP_READDIR_APV (vop=0x1, a=0x800) at vnode_if.c:1427
#17 0xc0696e67 in getdirentries (td=0xc200e180, uap=0xda969d04)
at vnode_if.h:746
#18 0xc0807f47 in syscall (frame=
  {tf_fs = 59, tf_es = 59, tf_ds = 59, tf_edi = 136507528, tf_esi =
-1078307728, tf_ebp = -1078347096, tf_isp = -627663516, tf_ebx =
672974052, tf_edx = 0, tf_ecx = 0, tf_eax = 196, tf_trapno = 22, tf_err =
2, tf_eip = 672447091, tf_cs = 51, tf_eflags = 582, tf_esp = -1078347140,
tf_ss = 59})
at /usr/src/sys/i386/i386/trap.c:976
#19 0xc07f6e1f in Xint0x80_syscall () at
/usr/src/sys/i386/i386/exception.s:200
#20 0x0033 in ?? ()
Previous frame inner to this frame (corrupt stack?)

Reproduction case:

mount /cdrom
mount /cdrom
ls /cdrom
umount /cdrom
ls /cdrom
*panic*

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.0 on VMWare 5.0: `calcru: negative runtime of -12728437 usec for pid 28 (irq17: lnc0)'

2005-11-13 Thread Doug White
Adjusted cc: to remove private list.

On Wed, 9 Nov 2005, Lev Serebryakov wrote:

 Hello freebsd-stable,

   FreeBSD 6.0 on VMWare 5.0 prints such messages every 3-4 minute.
   Also, messages like

 calcru: runtime went backwards from 40644816 to 40005944 usec for pid 3 (g_up)

   is shown routinely.

Get used to it. VMWare does evil, evil things with the virtual machine
clocks. We have problems with massive clock drift with Linux as the host
and guest OSen.

Comment out the printf and rebuild your kernel :-)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: Adaptec SATA 1210SA+ Freebsd 6.0

2005-11-13 Thread Doug White
On Thu, 10 Nov 2005, Filip Wuytack wrote:

 Hi,

 I also tried to create the label via the command line (following the
 handbook).

 web2# dd if=/dev/zero of=/dev/ar0 count=2
 2+0 records in
 2+0 records out
 1024 bytes transferred in 0.000757 secs (1352746 bytes/sec)
 web2# disklabel /dev/ar0 | disklabel -B -R -r ar0 /dev/stdin
 disklabel: /dev/ar0: no valid label found

 And the system freezes and shows 'ata2: DISCONNECT requested'

I'd take a first guess that the disk attached to ata2 is having issues.
Reseat and perhaps replace the cables. If that doesn't work then download
and run the manufacturer's diagnostic tool and see if you can get that to
error out.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Consistent 6.0-STABLE Hang

2005-11-13 Thread Doug White
On Thu, 10 Nov 2005, Cy Schubert wrote:

 Running the Citrix ICA wfcmgr (from ports) to change settings, e.g.
 password, for connection to a Windows terminal server hangs FreeBSD 6.0 (as
 of Nov 4) quite consistently. I've tried this on one of my 6.0 systems at
 home and a 6.0 system here at work. I suspect it may have something to do
 with Linux emulation however until I dig into this more I won't know for
 sure.

 The only way to unhang the system is to press the reset switch.

For fun, try updating to RELENG_6. I committed a fix recently for a hang
thats caused by doing 'ls /dev' in the linuxulator.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Crashing after umounting unavailable filesystems

2005-11-13 Thread Doug White
On Thu, 10 Nov 2005, William Denton wrote:

 On 10 November 2005, Michael C. Shultz wrote:

  Opps, I crashed the machine. When I moved it, I unplugged the
  USB DVD-RW and I had mounted one of the dist discs on there.
  When I did a umount it paniced. My bad.

 That happened to me the other day too.  I'd mounted a USB flash drive,
 unplugged the USB hub, replugged it, remounted the drive, and then of
 course I couldn't umount the first mount.  I did a umount -f and the
 machine crashed instantly.

 Is this the sort of thing that's really hard to handle, or might it be
 safer one day?  I don't know how serious it is to do bad things to mounted
 file systems, but I'm curious.

Its Hard To Handle.

:-)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Hyperthreading issues.

2005-11-13 Thread Doug White
On Fri, 11 Nov 2005, kama wrote:

 Just upgraded from 5.4 to 6.0 and hyperthreading stoped working.
 Everything looks ok, but it doesn't use two of the logical CPU's.

This is disabled by default due to a information-leak vulnerability across
the hyperthreaded cores. The details from the release notes:

Because of an information disclosure vulnerability on processors using
Hyper-Threading Technology (HTT), the machdep.hyperthreading_allowed
sysctl variable has been added. It defaults to 1 (HTT enabled) on FreeBSD
CURRENT, and 0 (HTT disabled) on the 4-STABLE and 5-STABLE development
branches and supported security fix branches. More information can be
found in security advisory FreeBSD-SA-05:09.htt. [MERGED]

If you don't care about this, add

machdep.hyperthreading_allowed=1

to /boot/loader.conf and reboot.

 One other thing is when I try to switch off hyperthreading in BIOS, it
 will hang at bootup when it are settling the scsi drives. After awhile it
 will give me scsi timeouts. This only happens when I have two cpu enabled
 and hyperthreading off. If I disable one cpu w ht off it will boot wo
 problems, or two cpus w ht. But booting with ht + two cpu's gives me the
 other problem.

Sounds like the BIOS is not rerouting the interrupts correctly. Check for
a BIOS update.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 6.0 freezes shortly after logging in.

2005-11-13 Thread Doug White
On Sun, 13 Nov 2005, Timm Florian Gloger wrote:

 Hi list,

 I get some strange freezes with my fresh 6.0 installation.

 After booting for a several times now i got 2 more freezes in not
 coherent periods of time after logging in.

 It is a complete freeze as far as i can explore, neither any keyboard
 nor mouse input is noticed and keyboard hangs also (no numblock/
 shiftlock switching possible).

 I am able to access the fs through the other os so i checked the message
 and other logfiles, but i cannot find any panic or other bad logentries.

Try turning off background fsck by adding this to your rc.conf and
rebooting:

background_fsck=NO

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4 fails to find boot drive, then panics.

2005-07-02 Thread Doug White
On Wed, 29 Jun 2005, Mike Meyer wrote:

 In [EMAIL PROTECTED], Doug White [EMAIL PROTECTED] typed:
  On Fri, 24 Jun 2005, Mike Meyer wrote:
 
   I'm trying to install 5.4-RELEASE off of CDROM on an ASUS P5S800 
   motherboard
   with a SATA drive in it. The SATA drive shows up as ad4 during the install
   process, and the install goes just fine.
  
   But the resulting system fails to boot. It goes through the boot sequence,
   draws the cute 5.x booto menu, then issues the messages:
  
   Can't work out which disk we are booting from.
   Guessed BIOS device 0x not found by probes, defaulting to disk0:
  
   panic: free: guard1 fail @0x5195c from 
   /usr/src/sys/boot/i386/loader/../../common/module.c:957
   -- Press a key on the console to reboot --
  
   Booting with debug messages doesn't change this at all.
 
  This is in the loader. Your machine is seriously messed up. I'd suggest
  checking for a BIOS update. Also try rearranging your ATA channels and
  disabling DMA mode for the ATA controllers in the BIOS setup.

 I figured that out. Since writing the original message, I managed to
 get the thing (mostly) working by hacking the loader to wire down the
 values for the boot drive rather than having it probe for them.

  Unfortunately ASUS boards of this type are known to have braindamaged
  ACPI, so this is only the beginning of a long, painful journey. :(

 How far does turning off ACPI in the BIOS go towards solving these
 problems? I'd be surprised if it does much, if anything.

Reportedly, the machine doesn't boot.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-p1 crash

2005-06-29 Thread Doug White
On Sat, 25 Jun 2005, Mitch Parks wrote:

 On Sun, 19 Jun 2005, Robert Watson wrote:

  there is a PR for it : kern/74319
 
  This sounds very similar to a serial console related tty bug I was
  experiencing on -STABLE a few months ago, and that is believed may have been
  worked around in 5.4 tweaks before release.  In particular, that there are
  reference counting related bugs in the 5.x tty code that are fixed by a
  partial rewrite of the tty code in 6.x, but that are too large and 
  disruptive
  to merge to RELENG_5.  If the problem is persisting, it may be worth trying
  to merge anyway, but it is a pretty big change and would break device driver
  binary compatibility, etc.  What we might want to do here is wait until 6.x
  has settled out a bit more, then consider merging it to 5.x once 6.x has
  gotten burned in with similar workloads and continued to not illustrate the
  5.x tty reference bugs.

 On Fri, 24 Jun 2005, Doug White wrote:

  I've run out of time to debug this, unfortunately...

 I went back to reports I made in January about 5.3:
 http://lists.freebsd.org/pipermail/freebsd-stable/2005-January/010898.html
 which appears to be the same issue. I _thought_ this was resolved when I
 disabled HT, but maybe I was just lucky between the last reboot and the 5.4
 upgrade.

 It sounds like there's a reasonable chance this has been squashed in code
 for 6.0? Since this box is already unstable, I'd be tempted to be an early 6
 adopter to see if it is actually resolved. Especially so if that would be
 helpful to the cause.

6.x is not affected by the tty-related problems that 5.x is having.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4 Installer + Promise FT100TX2 = Loader crash

2005-06-29 Thread Doug White
On Sun, 26 Jun 2005, Mark Kirkwood wrote:

 Daniel O'Connor wrote:

 
  I find even disconnecting the drives so no RAID is detected works - ie it's
  not the presence of the card per se that is a problem.
 

 Good test - I had not tried that.

 In addition, I can confirm your observation that merely deleting the
 array makes the loader work reliably again.

Try:

. Zero off the first megabyte or so of the subdisks with dd or similar
  tool.
. Force an array initialize from the controller BIOS. Wait for it to
  finish.
. Install some other OS that recognizes the array, write the MBR and
  partition table, then install FreeBSD over it.

Some controller BIOSen have been known to peek at the DOS partition table,
and it may be jumping off into space if its seeing half a table from one
disk, or something like that. These actions should blow away any bogus
underlying data.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4 fails to find boot drive, then panics.

2005-06-29 Thread Doug White
On Fri, 24 Jun 2005, Mike Meyer wrote:

 I'm trying to install 5.4-RELEASE off of CDROM on an ASUS P5S800 motherboard
 with a SATA drive in it. The SATA drive shows up as ad4 during the install
 process, and the install goes just fine.

 But the resulting system fails to boot. It goes through the boot sequence,
 draws the cute 5.x booto menu, then issues the messages:

 Can't work out which disk we are booting from.
 Guessed BIOS device 0x not found by probes, defaulting to disk0:

 panic: free: guard1 fail @0x5195c from 
 /usr/src/sys/boot/i386/loader/../../common/module.c:957
 -- Press a key on the console to reboot --

 Booting with debug messages doesn't change this at all.

This is in the loader. Your machine is seriously messed up. I'd suggest
checking for a BIOS update. Also try rearranging your ATA channels and
disabling DMA mode for the ATA controllers in the BIOS setup.

Unfortunately ASUS boards of this type are known to have braindamaged
ACPI, so this is only the beginning of a long, painful journey. :(

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-p1 crash

2005-06-24 Thread Doug White
On Mon, 20 Jun 2005, Philippe PEGON wrote:

 Philippe PEGON wrote:
  Mitch Parks wrote:
 
  On Sun, 19 Jun 2005, Doug White wrote:
 
  On Fri, 17 Jun 2005, Mitch Parks wrote:
 
  Below are details regarding another crash on a Dell 2600 SMP (HTT
  and USB
  disabled). It has been 9 days since the last crash. I didn't have
  the serial
  console in place for this last crash, but it is now.
 
 
 
  As noted, the ttwakeup() panic is a known bug. The best thing we have
  for
  a fix is this patch:
 
  http://people.freebsd.org/~mlaier/tty.t_pgrp.diff
 
  Please give it a try and report back if you have any more panics (or
  don't :-) ).
 
 
 
  Thanks! This patch appears to be for 5.3, but I manually applied the
  chunk of the patch that didn't apply cleanly and the countdown is on.
 
  I'll report back in 10 days unless something bad happens before then.
 
  Below is the patch chunk #10 that I actually applied rather than the
  one given. If I've done something bad here by removing the PGRP_LOCK
  please let me know.
 
 
  I'm not a kernel developper, but if you remove
 
  PGRP_LOCK(tp-t_pgrp);
 
  and the PGRP_UNLOCK(tp-t_pgrp) in the if condition (removed by the
  orginal patch)
 
  there is maybe another PGRP_UNLOCK(tp-t_pgrp); to remove if the if
  condition doesn't match, line 2528 in the original 5.4-p1 tty.c ?

 after having applied the patch (with your modification), there is no
 sx_sunlock(proctree_lock) in the ttyinfo function if the three
 conditions failed. Maybe we have just to replace
 PGRP_UNLOCK(tp-t_pgrp); line 2528 by sx_sunlock(proctree_lock) ?
 I think that we need the helps of a kernel developper.

No, that would be a leaked lock, which would cause hangs.  More likely its
some other case that got missed that needs locks extended to it, or the
aliased pgrp isn't the underlying problem.

I've run out of time to debug this, unfortunately...


 
 
  
  Hunk #6 succeeded at 1154 (offset -51 lines).
  Hunk #7 succeeded at 1215 (offset -6 lines).
  Hunk #8 succeeded at 1203 (offset -51 lines).
  Hunk #9 succeeded at 1946 (offset -5 lines).
  Hunk #10 failed at 2562.
  Hunk #11 succeeded at 2847 (offset -212 lines).
  1 out of 11 hunks failed--saving rejects to tty.c.rej
 
 
  @@ -2495,19 +2511,21 @@
   * On return following a ttyprintf(), we set tp-t_rocount to
  0 so
   * that pending input will be retyped on BS.
   */
  +   sx_slock(proctree_lock);
  if (tp-t_session == NULL) {
  +   sx_sunlock(proctree_lock);
  ttyprintf(tp, not a controlling terminal\n);
  tp-t_rocount = 0;
  return;
  }
  if (tp-t_pgrp == NULL) {
  +   sx_sunlock(proctree_lock);
  ttyprintf(tp, no foreground process group\n);
  tp-t_rocount = 0;
  return;
  }
  -   PGRP_LOCK(tp-t_pgrp);
  -   if ((p = LIST_FIRST(tp-t_pgrp-pg_members)) == 0) {
  -   PGRP_UNLOCK(tp-t_pgrp);
  +   if ((p = LIST_FIRST(tp-t_pgrp-pg_members)) == NULL) {
  +   sx_sunlock(proctree_lock);
  ttyprintf(tp, empty foreground process group\n);
  tp-t_rocount = 0;
  return;
 
  Or the complete patch:
  http://kuoi.asui.uidaho.edu/~mitch/crash/tty_5.4.patch
 
  Mitch Parks
  [EMAIL PROTECTED]
  ___
  freebsd-stable@freebsd.org mailing list
  http://lists.freebsd.org/mailman/listinfo/freebsd-stable
  To unsubscribe, send any mail to [EMAIL PROTECTED]
 
 




-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-p1 crash

2005-06-19 Thread Doug White
On Fri, 17 Jun 2005, Mitch Parks wrote:

 Below are details regarding another crash on a Dell 2600 SMP (HTT and USB
 disabled). It has been 9 days since the last crash. I didn't have the serial
 console in place for this last crash, but it is now.

As noted, the ttwakeup() panic is a known bug. The best thing we have for
a fix is this patch:

http://people.freebsd.org/~mlaier/tty.t_pgrp.diff

Please give it a try and report back if you have any more panics (or
don't :-) ).

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Weird fdisk behavior

2005-06-19 Thread Doug White
On Sat, 18 Jun 2005, Baldur Gislason wrote:

 I am trying to add another partition to my root drive, it has a few gigabytes 
 of unpartitioned space.
 Whenever I try to run fdisk -u it says cannot open disk /dev/ad0: No such 
 file or directory
 ad0 does exist, why does fdisk say otherwise? fdisk can display the partition 
 table but it can't alter it.
 securelevel is -1 and this is FreeBSD 5.4-STABLE from the beginning of May 
 this year.

Be aware that fdisk will fake up a partition table if the volume has no
table. Look for a message like:

warning: invalid partition table found

If you see that then there is no FDISK partition table.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4 RELEASE atkbdc0 problem

2005-06-06 Thread Doug White
On Sun, 5 Jun 2005, Remi Regruson wrote:

 Hi,

 I put the folowing line in /boot/loader.conf :
 usb_load=yes
 ukbd_load=yes
 This haven't done what I exepted : my usb keyboard don't work during the 
 loader prompt and now I can't boot. The kernel message stop with this line:

If you want your USB keyboard to work in loader you need to enable USB
Legacy in the BIOS.

 atkbdc0: Keyboard controller (i8042) port 0x64,0x60 irq 1 on acpi0
 I have the same log with or without : any keyboard(ps2 or usb), usb or apm 
 activating in the bios.

FreeBSD can only handle input from one keyboard at a time. By default the
first keyboard attached wins, which is atkbd.  You have a few options:

1. Edit /boot/device.hints and remove the hint.atkbd.0.flags line, reboot,
and unplug the PS/2 keyboard before the kernel starts. Unless your BIOS
fakes up a keyboard it should keep atkbd0 from attaching and therefore let
the USB keyboard take over kbd0.

2. Add a line to /etc/usbd.conf that calls

kbdcontrol -k kbd1  /dev/console

when a USB keyboard is attached. This selects the USB keyboard as the
active keyboard. If kbd1 goes away (by unplugging the USB keyboard) it'll
fall back to the PS/2 keyboard.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 4.11 panic #2, need help decoding

2005-06-06 Thread Doug White
On Sun, 5 Jun 2005, Charles Sprickman wrote:

 Hello,

 Second panic on this box in less than two weeks.  Can someone help me
 figure out whether this looks to be a hardware issue or not?  Am I on the
 correct list for this type of problem?

You're in the right spot all right...

 Previous info is at:
 http://thread.gmane.org/gmane.os.freebsd.stable/28428 (no followups)

 Here's what gdb has for me this time:

Can you go to frame 38 and print the value of source?  Comparing it to
NULL shouldn't cause a fault, but I suspect its another one of the cases
in that if statement.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Diskless boot problem

2005-05-23 Thread Doug White
On Mon, 23 May 2005, [ISO-8859-2] Sawek ak wrote:

 
   Hi,
 
I have a problem with booting Dell 2850 over network. The machine reads
kernel over net, boots upto mounting / from NFS and then crashes.
  
   What is the NFS server? It seems to think the NFS handle we pulled the
   kernel with is no longer valid.

  FreeBSD 5.3/5.4-STABLE.

 Hm ... dunno then ... does the server complain about the client at all in
 the log?

No complaints whatsoever.The exact os version on NFS server is 5.4-RC2.

All I can suggest is trying rebooting the NFS server.  Something in it
must have lost communication.  If that doesn't fix it then I have no idea,
it must be specific to your situation.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: RELENG_5 panic

2005-05-23 Thread Doug White
On Mon, 23 May 2005, Robin P. Blanchard wrote:

 Here's what I could get out of dmesg, and looking again at the dump

 # dmesg -M /usr/local/var/adm/crash/vmcore.44 -N /boot/kernel/kernel
 kernel trap 12 with interrupts disabled

 Fatal trap 12: page fault while in kernel mode
 cpuid = 0; apic id = 00
 fault virtual address   = 0x24
 fault code  = supervisor read, page not present
 instruction pointer = 0x8:0xc0504808
 stack pointer   = 0x10:0xc7ac0c08
 frame pointer   = 0x10:0xc7ac0c3c
 code segment= base 0x0, limit 0xf, type 0x1b
 = DPL 0, pres 1, def32 1, gran 1
 processor eflags= resume, IOPL = 0
 current process = 27 (swi5: clock sio)
 trap number = 12
 panic: page fault
 cpuid = 0
 Uptime: 3d6h59m25s
 Dumping 127 MB
  16 32 48 64 80 96 112

 [EMAIL PROTECTED] [/usr/obj/usr/src/sys/fastipsec]# kgdb kernel.debug
 /usr/local/var/adm/crash/vmcore.44
 [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so:
 Undefined symbol ps_pglobal_lookup]
 GNU gdb 6.1.1 [FreeBSD]
 Copyright 2004 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain conditions.
 Type show copying to see the conditions.
 There is absolutely no warranty for GDB.  Type show warranty for details.
 This GDB was configured as i386-marcel-freebsd.
 #0  doadump () at pcpu.h:160
 160 __asm __volatile(movl %%fs:0,%0 : =r (td));
 (kgdb) l *0xc0504808
 0xc0504808 is in turnstile_wait (/usr/src/sys/kern/subr_turnstile.c:245).
 240 /*
 241  * Pick up the lock that td is blocked on.
 242  */
 243 ts = td-td_blocked;
 244 MPASS(ts != NULL);
 245 tc = TC_LOOKUP(ts-ts_lockobj);
 246 mtx_lock_spin(tc-tc_lock);
 247
 248 /*
 249  * This thread may not be blocked on this turnstile
 anymore
 (kgdb)

Oh another of these wonderful races... can you go to that frame and print
ts?  If its NULL then someone has ripped out the ts out from under us
since it was checked for NULL in the previous line!


  -Original Message-
  From: Doug White [mailto:[EMAIL PROTECTED]
  Sent: Sunday, May 22, 2005 3:20 PM
  To: Robin P. Blanchard
  Cc: [EMAIL PROTECTED]
  Subject: Re: RELENG_5 panic
 
  On Sat, 21 May 2005, Robin P. Blanchard wrote:
 
   # uname -a
   FreeBSD robinpb.homeip.net 5.4-STABLE FreeBSD 5.4-STABLE
  #0: Tue May
   17
   00:30:47 EDT 2005
   [EMAIL PROTECTED]:/usr/obj/usr/src/sys/fastipsec  i386
  
   # kgdb kernel.debug /usr/local/var/adm/crash/vmcore.44
   [GDB will not be able to debug user-mode threads:
  /usr/lib/libthread_db.so:
   Undefined symbol ps_pglobal_lookup]
   GNU gdb 6.1.1 [FreeBSD]
   Copyright 2004 Free Software Foundation, Inc.
   GDB is free software, covered by the GNU General Public
  License, and
   you are welcome to change it and/or distribute copies of it
  under certain conditions.
   Type show copying to see the conditions.
   There is absolutely no warranty for GDB.  Type show
  warranty for details.
   This GDB was configured as i386-marcel-freebsd.
   #0  doadump () at pcpu.h:160
   160 __asm __volatile(movl %%fs:0,%0 : =r (td));
   (kgdb) bt full
   #0  doadump () at pcpu.h:160
   No locals.
   #1  0xc04dd58c in boot (howto=260) at
  /usr/src/sys/kern/kern_shutdown.c:410
   first_buf_printf = 1
   #2  0xc04ddccd in panic (fmt=0xc066e594 %s) at
   /usr/src/sys/kern/kern_shutdown.c:566
   bootopt = 260
   newpanic = 0
   buf = page fault, '\0' repeats 245 times
 
  can you try to fish the trap output from msgbuf?  That or use
  dmesg's -N and -M options to extract it from the crashdump.
 
   #3  0xc0641e92 in trap_fatal (frame=0xc7ac0bc8, eva=36) at
   /usr/src/sys/i386/i386/trap.c:817
   code = 16
   type = 12
   ss = 16
   esp = 0
   softseg = {ssd_base = 0, ssd_limit = 1048575,
  ssd_type = 27,
   ssd_dpl = 0, ssd_p = 1,
 ssd_xx = 0, ssd_xx1 = 0, ssd_def32 = 1, ssd_gran = 1}
   #4  0xc0642535 in trap (frame=
 {tf_fs = 24, tf_es = -1066598384, tf_ds =
  -1066532848, tf_edi =
   -1053916800, tf_esi = -1049515008, tf_ebp = -945025988, tf_isp =
   -945026060, tf_ebx = -1053916800, tf_edx = -1053937024,
  tf_ecx = 56,
   tf_eax = 0, tf_trapno = 12, tf_err = 0, tf_eip =
  -1068480504, tf_cs =
   8, tf_eflags = 65683, tf_esp = -1053914880, tf_ss = 582}) at
   /usr/src/sys/i386/i386/trap.c:255
   p = (struct proc *) 0xc12e754c
   sticks = 3241036032
   i = 0
   ucode = 0
   type = 12
   code = 0
   eva = 36
   #5  0xc062da3a in calltrap () at
   /usr/src/sys/i386/i386/exception.s:140
   No locals.
   #6  0x0018 in ?? ()
   No symbol table info

Re: FreeBSD 5.3 forgets some of my users

2005-05-22 Thread Doug White
Stripping -bugs and -current cc; this applies to -stable and isn't in
reference to an existing PR.

On Sat, 21 May 2005, [ISO-8859-1] Kövesdán Gábor wrote:

 Hello there,

 there is a strange thing FreeBSD 5.3 randomly frogets some of my
 users. Sometimes I see in the first column of the ps aux output the uid
 number instead of the login name, but next when I run it, there is the
 login name. Besides, I tried to chown a directory to an existing user,
 but the I get an error message, that there wasn't such user.

What are you using for user lookups?  Are you using nss_ldap or something
other than the default 'files' lookup?  Can you post the contents of
/etc/nsswitch.conf?

I'd also check your master.passwd file for any syntax errors or odd
characters and then force a rebuild with:

pwd_mkdb -p /etc/master.passwd

as root.

 I immediately checked passwd, group and master.passwd files in /etc but
 the entry for that user was present there. The pw userdel was unable to
 delete that user, so I had to manually remove it from those three files
 and create it again. It worked then, but a bit later there was the same
 result. I'm quite annoyed now. This state isn't safe enough, I have to
 do something to get around with this. Do You have similar experiences?
 Or do You now some kinda workaround?

 Cheers,

 Gábor Kövesdán


 ___
 freebsd-current@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-current
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: nForce 4, SATA Drive only runs at UDMA33?

2005-05-22 Thread Doug White
On Fri, 20 May 2005, alan bryan wrote:


 --- Doug White [EMAIL PROTECTED] wrote:
  Can you post the output of pciconf -lv? The nForce
  IDE controller is
  properly detected, but it looks like there's another
  one in the system.
  Looking at the spec for the system it may be the
  proprietary nVidia RAID
  controller.  The pciconf output should help us
  identify if thats the
  issue.

 FYI: RAID features are disabled in the BIOS, I'm just
 trying to get a single SATA drive to work at full
 speed.  The other drive in this system is IDE and that
 seems to be working at proper speed.  Thanks for the
 help!

I guess turning off the RAID converts the chips into standard SATA
controllers. I'll have to look into that. An nForce 4 machine recently
appeared at work, so I'll see what I can get it to do.

 [EMAIL PROTECTED]:6:0:   class=0x01018a card=0x50361297
 chip=0x005310de rev=0xa2 hdr=0x00
 vendor   = 'NVIDIA Corporation'
 class= mass storage
 subclass = ATA
 [EMAIL PROTECTED]:7:0:   class=0x010185 card=0x50361297
 chip=0x005410de rev=0xa3 hdr=0x00
 vendor   = 'NVIDIA Corporation'
 class= mass storage
 subclass = ATA
 [EMAIL PROTECTED]:8:0:   class=0x010185 card=0xcb8410de
 chip=0x005510de rev=0xa3 hdr=0x00
 vendor   = 'NVIDIA Corporation'
 class= mass storage
 subclass = ATA

It looks like sos added support for atapci1 and 2 in this listing in the
ATAmkIII patchset.  While that patchset is in -CURRENT you'll have to
apply the -stable patches yourself. Search the list archives for the
location, soren posts it now and again.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Diskless boot problem

2005-05-22 Thread Doug White
On Sat, 21 May 2005, [ISO-8859-2] S³awek ¯ak wrote:

 On 5/20/05, Doug White [EMAIL PROTECTED] wrote:
 Please try to avoid sending your message bodies base64 encoded :) My mail
 client got really confused by it and didn't quote the message properly.

 Also stripping hackers cc:.

I'd like to, but Gmail doesn't offer this option. Sorry :(

 On Thu, 19 May 2005, [ISO-8859-2] Sawek ak wrote:

  Hi,

   I have a problem with booting Dell 2850 over network. The machine reads
   kernel over net, boots upto mounting / from NFS and then crashes.
 
  What is the NFS server? It seems to think the NFS handle we pulled the
  kernel with is no longer valid.

 FreeBSD 5.3/5.4-STABLE.

Hm ... dunno then ... does the server complain about the client at all in
the log?

 Does PXE and the system itself end up pulling different IP addresses?

 No. The IP is the same.

dunno ...

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: RELENG_5 panic

2005-05-22 Thread Doug White
On Sat, 21 May 2005, Robin P. Blanchard wrote:

 # uname -a
 FreeBSD robinpb.homeip.net 5.4-STABLE FreeBSD 5.4-STABLE #0: Tue May 17
 00:30:47 EDT 2005
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/fastipsec  i386

 # kgdb kernel.debug /usr/local/var/adm/crash/vmcore.44
 [GDB will not be able to debug user-mode threads: /usr/lib/libthread_db.so:
 Undefined symbol ps_pglobal_lookup]
 GNU gdb 6.1.1 [FreeBSD]
 Copyright 2004 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain conditions.
 Type show copying to see the conditions.
 There is absolutely no warranty for GDB.  Type show warranty for details.
 This GDB was configured as i386-marcel-freebsd.
 #0  doadump () at pcpu.h:160
 160 __asm __volatile(movl %%fs:0,%0 : =r (td));
 (kgdb) bt full
 #0  doadump () at pcpu.h:160
 No locals.
 #1  0xc04dd58c in boot (howto=260) at /usr/src/sys/kern/kern_shutdown.c:410
 first_buf_printf = 1
 #2  0xc04ddccd in panic (fmt=0xc066e594 %s) at
 /usr/src/sys/kern/kern_shutdown.c:566
 bootopt = 260
 newpanic = 0
 buf = page fault, '\0' repeats 245 times

can you try to fish the trap output from msgbuf?  That or use dmesg's -N
and -M options to extract it from the crashdump.

 #3  0xc0641e92 in trap_fatal (frame=0xc7ac0bc8, eva=36) at
 /usr/src/sys/i386/i386/trap.c:817
 code = 16
 type = 12
 ss = 16
 esp = 0
 softseg = {ssd_base = 0, ssd_limit = 1048575, ssd_type = 27, ssd_dpl
 = 0, ssd_p = 1,
   ssd_xx = 0, ssd_xx1 = 0, ssd_def32 = 1, ssd_gran = 1}
 #4  0xc0642535 in trap (frame=
   {tf_fs = 24, tf_es = -1066598384, tf_ds = -1066532848, tf_edi =
 -1053916800, tf_esi = -1049515008, tf_ebp = -945025988, tf_isp = -945026060,
 tf_ebx = -1053916800, tf_edx = -1053937024, tf_ecx = 56, tf_eax = 0,
 tf_trapno = 12, tf_err = 0, tf_eip = -1068480504, tf_cs = 8, tf_eflags =
 65683, tf_esp = -1053914880, tf_ss = 582}) at
 /usr/src/sys/i386/i386/trap.c:255
 p = (struct proc *) 0xc12e754c
 sticks = 3241036032
 i = 0
 ucode = 0
 type = 12
 code = 0
 eva = 36
 #5  0xc062da3a in calltrap () at /usr/src/sys/i386/i386/exception.s:140
 No locals.
 #6  0x0018 in ?? ()
 No symbol table info available.
 #7  0xc06d0010 in ipq ()
 No symbol table info available.
 #8  0xc06e0010 in sc_buffer.3 ()
 No symbol table info available.
 #9  0xc12e8180 in ?? ()
 No symbol table info available.
 #10 0xc171ac00 in ?? ()
 No symbol table info available.
 #11 0xc7ac0c3c in ?? ()
 No symbol table info available.
 #12 0xc7ac0bf4 in ?? ()
 No symbol table info available.
 #13 0xc12e8180 in ?? ()
 No symbol table info available.
 #14 0xc12e3280 in ?? ()
 No symbol table info available.
 #15 0x0038 in ?? ()
 No symbol table info available.
 #16 0x in ?? ()
 No symbol table info available.
 #17 0x000c in ?? ()
 No symbol table info available.
 #18 0x in ?? ()
 No symbol table info available.
 #19 0xc0504808 in turnstile_wait (ts=0xc12e3280, lock=0xc06d022c,
 owner=0xc171ac00)
 at /usr/src/sys/kern/subr_turnstile.c:243
 tc = (struct turnstile_chain *) 0xc06cb770
 td1 = (struct thread *) 0xc12e8180
 #20 0xc04d2b7f in _mtx_lock_sleep (m=0xc06d022c, td=0xc12e8180, opts=0,
 file=0x0, line=0)
 at /usr/src/sys/kern/kern_mutex.c:552
 ts = (struct turnstile *) 0x0
 owner = (struct thread *) 0xc171ac00
 v = 0
 #21 0xc058a592 in tcp_isn_tick (xtp=0x0) at
 /usr/src/sys/netinet/tcp_subr.c:1380
 projected_offset = 0
 #22 0xc04ed069 in softclock (dummy=0x0) at
 /usr/src/sys/kern/kern_timeout.c:279
 c_func = (void (*)(void *)) 0xc058a4d0 tcp_isn_tick
 c_arg = (void *) 0x0
 c_flags = 14
 c = (struct callout *) 0x0
 bucket = (struct callout_tailq *) 0xc39ba4a8
 steps = 14
 depth = 2
 mpcalls = 2
 gcalls = 0
 wakeup_cookie = 14
 #23 0xc04c460a in ithread_loop (arg=0xc12fd500) at
 /usr/src/sys/kern/kern_intr.c:547
 ih = (struct intrhand *) 0xc12e2c80
 p = (struct proc *) 0xc12e754c
 count = 0
 warming = 5000
 warned = 0
 #24 0xc04c32c2 in fork_exit (callout=0xc04c4550 ithread_loop, arg=0x0,
 frame=0x0)
 at /usr/src/sys/kern/kern_fork.c:791
 p = (struct proc *) 0xc12e754c
 #25 0xc062da9c in fork_trampoline () at
 /usr/src/sys/i386/i386/exception.s:209
 No locals.
 (kgdb)



 ---
 Robin P. Blanchard
 Systems Integration Specialist
 Georgia Center for Continuing Education
 fon: 706.542.2404 - fax: 706.542.6546
 ---
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White

Re: Highpoint HPT371 support (kern/59624)

2005-05-22 Thread Doug White
On Sun, 22 May 2005, Oliver Fromme wrote:

 Why is kern/59624 still open?  It's about 1,5 years old.
 Would someone please commit it?  Or, if it cannot be
 commited, please tell me why.  If any information is
 missing (pcicon -lv output, verbose dmesg or whatever),
 I'd be happy to provide it.

You may want to track down Doug Ambrisko, who has a big stack of ata
patches he's been carting around. I don't know if he committed that set or
not.

RELENG_4 is pretty much dead development-wise -- there will be no more 4.x
releases -- so finding someone to commit this for you will be difficult.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: can't assign requested address with ntpd on 5-STABLE

2005-05-22 Thread Doug White
On Sun, 22 May 2005, Richard Coleman wrote:

 On 5-STABLE, when I try to start ntpd, I get the following error: bind()
 fd 7, family 28, port 123, addr fe80:1::2a0:c9ff:fec8:ea25,
 in6_is_addr_multicast=0 flags=0 fails: Can't assign requested address

 I've used netstat to check and nothing else is on that port (other than
 sshd, there is nothing else on the box).

This is normal; its trying to bind to the ipv6 address and you probably
don't have one defined or have ipv6 disabled. It can be safely ignored. I
think on later -STABLE its been disabled.

Unless you _want_ to bind to the v6 address :)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Dell PowerEdge 1855

2005-05-22 Thread Doug White
On Sun, 22 May 2005, Danny Braniss wrote:

   Hi Doug,
  
   Sorry for taking so long to reply on this, been on 3 weeks leave...
  
   On Fri, 8 Apr 2005, Doug White wrote:
  
On Wed, 6 Apr 2005, Carl Makin wrote:
  
 The boot sequence will hang if the USB drive is attached at boot.  
 There
  
It seems to be acting like Uthe USB controller isn't getting interrupts.
Does the problem recur on the installed system?
  
   Yes it did however that chassis and the servers were a test loaner and
   have been returned.  We have decided to go with the 1855 though (our
   windows guy was *very* enthusiastic about them) and when they arrive I'll
   run the tests you suggested.
  
   Thanks!
 
  so now i'm testing one too :-), any nice things to say about this blade?
  while trying out the scsi the blade panics :-)
  and as to cpu/memory power, it seems to be slower than the pe-1750
 
  danny

 nothing like talking to oneself :-)

 i disabled the mirrowing, and now the disk io is nice, and more importanly, 
 the
 system does not panic, i guess the mpt/(aka da0: DELL VIRTUAL DISK  IM 
 1998) needs some work still :-)

The mpt driver has some issues. I have an IBM machine that freaks out
similiarly if the integrated mirroring is enabled.  However IM works
properly on other machines, so YMMV :)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 5.3 forgets some of my users

2005-05-22 Thread Doug White
On Sun, 22 May 2005, [ISO-8859-1] Kövesdán Gábor wrote:

 Thanks for your suggestion. I use 'files' lookup. As You wrote I checked
 the syntax with pwd_mkdb -C /etc/passwd' and it returned nothing. Then I
 did a rebuild with -p, and now it seems to be okay, the two accounts
 that had gone away are working now.

Okay, perhaps the password database files had become corrupted.


 Anyway I haven't changed the nsswitch.conf, I have the default one:

 [EMAIL PROTECTED] less /etc/nsswitch.conf
 group: compat
 group_compat: nis
 hosts: files dns
 networks: files
 passwd: compat
 passwd_compat: nis
 shells: files

Looks right. A database problem would make sense.


 Cheers,

 Gábor Kövesdán


 What are you using for user lookups?  Are you using nss_ldap or something
 other than the default 'files' lookup?  Can you post the contents of
 /etc/nsswitch.conf?
 
 I'd also check your master.passwd file for any syntax errors or odd
 characters and then force a rebuild with:
 
 pwd_mkdb -p /etc/master.passwd
 
 as root.
 
 
 


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: problems with ASR card/5.4

2005-05-20 Thread Doug White
On Wed, 18 May 2005, Bruce Burden wrote:




Hi Doug,

   well, pulling cards demonstrated where the problem lie, but it
was not a hardware problem. By chance, I happened upon the options
ASR_COMPAT directive, and that solved my problem. However, the
GENERIC kernel configuration for i386 does NOT have this option, but
it does have the asr driver configured.

ASR_COMPAT controls some sizes of fields for use with the control ioctl. I
don't think this would case problems on boot unless you are running an old
control program at boot time, or whenever the panic occurs.

   Having one but not the other causes problems, as I discovered.




   Bruce


 On Wed, May 18, 2005 at 04:03:56PM -0700, Doug White wrote:
  On Mon, 16 May 2005, Bruce Burden wrote:
 
  
  
 Hi folks,
  
 I am trying to get my Adaptec 3210S RAID running under 5.4,
 and all I get for my trouble is pmap_ errors. Sorry, no dumps at
 the  moment.
  
 If I compile device asr into the kernel, I get a message
 about could not copy LDT, and a panic about pmap_enter:
 invalid page directory when the system moves from single to multi
 user. (single user is no problem?)
  
 If I do not compile the asr driver, and kldload asr instead,
 I get a panic from pmap_mapdev, instead.
  
 Any ideas? This is annoying, and I am seriously considering
 returning to 4.11, where asr was working...
 
  This sounds like your system has severe memory corruption issues.  I'd try
  a different PCI slot for your asr card first, then try the card in another
  known working system and see if your problems appear there.
 
  Have you tested the system without the asr card installed?
 
  --
  Doug White|  FreeBSD: The Power to Serve
  [EMAIL PROTECTED]  |  www.FreeBSD.org



-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: SATA378 / SATA150 RAID controller supported by 5.4?

2005-05-20 Thread Doug White
On Wed, 18 May 2005, Rob wrote:


 --- Doug White [EMAIL PROTECTED] wrote:
  On Tue, 17 May 2005, Rob wrote:
 
   The 'pciconf -lv' tells me:
  
[EMAIL PROTECTED]:4:0: class=0x010400 card=0x80f51043
  chip=0x3373105a rev=0x02 hdr=0x00
   vendor   = 'Promise Technology Inc'
   device   = 'PDC20378 FastTrak 378/SATA 378
   RAID Controller'
   class= mass storage
   subclass = RAID
  
   But I get this in my dmesg output:
  
atapci0: Promise PDC20378 SATA150 controller
 port 0xd880-0xd8ff,0xdfa0-0xdfaf,0xdf00-0xdf3f
 mem 0xfeac-0xfead,0xfeafe000-0xfeafefff
   irq 10 at device 4.0 on pci2
atapci0: failed: rid 0x20 is memory, requested 4
 
  That's the PCI busmaster register, although it
  seems to be the wrong resource type. It should be
  I/O space, which maps to PCI config space, and
  not memory-mapped.  I can't seem to find what
  function emits that message.
 
  A full dmesg would be useful.

 It's at:
   http://surfion.snu.ac.kr/~lahaye/dmesg.boot

 Does that help?

Yes .. your system has 2 ATA controllers. The RID failure is probably
normal if the other one is present.  The ICH5 southbridge ATA controller
has grabbed the standard ports and the Promise ports stack in behind.

To answer the question, yes the Promise controller is supported. You'll
need to connect disks to it if you want to use it though :)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: nForce 4, SATA Drive only runs at UDMA33?

2005-05-20 Thread Doug White
-master: stat=0x7f err=0xff lsb=0xff msb=0xff
 ata5-master: stat=0x7f err=0xff lsb=0xff msb=0xff
 ata5-slave:  stat=0x7f err=0xff lsb=0xff msb=0xff
 ata5: reset tp2 stat0=ff stat1=ff devices=0x0
 ata5: [MPSAFE]

 Any hints/ideas for what I can do to make the most of
 my new hardware?

 Thanks,
 Alan Bryan


 __
 Do You Yahoo!?
 Tired of spam?  Yahoo! Mail has the best spam protection around
 http://mail.yahoo.com
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Diskless boot problem

2005-05-20 Thread Doug White
Please try to avoid sending your message bodies base64 encoded :) My mail
client got really confused by it and didn't quote the message properly.

Also stripping hackers cc:.

On Thu, 19 May 2005, [ISO-8859-2] S³awek ¯ak wrote:

 Hi,

 I have a problem with booting Dell 2850 over network. The machine reads
 kernel over net, boots upto mounting / from NFS and then crashes.

What is the NFS server? It seems to think the NFS handle we pulled the
kernel with is no longer valid.

Does PXE and the system itself end up pulling different IP addresses?

(rest of original email follows)

Tcpdump output:

12:15:58.919683 arp who-has 10.158.190.73 tell 10.158.190.74
12:15:58.919702 arp reply 10.158.190.73 is-at 00:11:43:d3:6e:e1
12:15:58.920058 IP 10.158.190.74.475209176  10.158.190.73.2049: 92
getattr [|nfs]
12:15:58.920134 IP 10.158.190.73.2049  10.158.190.74.475209176: reply
ok 28 getattr ERROR: Stale NFS file handle
12:15:58.920432 arp who-has 10.158.190.73 tell 10.158.190.74
12:15:58.920442 arp reply 10.158.190.73 is-at 00:11:43:d3:6e:e1
12:15:58.920681 IP 10.158.190.74.475209177  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.920707 IP 10.158.190.73.2049  10.158.190.74.475209177: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:58.920932 IP 10.158.190.74.475209178  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.920963 IP 10.158.190.73.2049  10.158.190.74.475209178: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:58.952180 IP 10.158.190.74.475209179  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.952277 IP 10.158.190.73.2049  10.158.190.74.475209179: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:58.984785 IP 10.158.190.74.475209180  10.158.190.73.2049: 100
lookup [|nfs]
12:15:58.984866 IP 10.158.190.73.2049  10.158.190.74.475209180: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:59.020500 IP 10.158.190.74.475209181  10.158.190.73.2049: 104
lookup [|nfs]
12:15:59.020573 IP 10.158.190.73.2049  10.158.190.74.475209181: reply
ok 28 lookup ERROR: Stale NFS file handle
12:15:59.054130 IP 10.158.190.74.475209182  10.158.190.73.2049: 104
lookup [|nfs]
12:15:59.054224 IP 10.158.190.73.2049  10.158.190.74.475209182: reply
ok 28 lookup ERROR: Stale NFS file handle

I wonder where the `Stale NFS handle'  error comes from, as the client
doesn't seem to have mounted the filesystem over NFS from what I can
see. On the console of the diskless I have this:

NFS ROOT: 10.158.190.73:/var/www/FreeBSD-5.4-x86-PXE
em0: Link is up 100 Mbps Half Duplex
exec /sbin/init: error 70
exec /sbin/oinit: error 70
exec /sbin/init.bak: error 70
exec /rescue/init: error 70
exec /stand/sysinstall: error 70
init: not found in path
/sbin/init:/sbin/oinit:/sbin/init.bak:/rescue/init:/stand/sysinstall
panic: no init
Uptime: 55s
Cannot dump. No dump device defined.
Automatic reboot in 15 seconds - press a key on the console to abort

The speed for em0 is obviously wrong. Setting on the switch is 100
full-duplex. Our network wizards can f***kup autonegotiation on Cisco
Catalyst, so it must stay that way. Intel em-s tend to hang for a
couple of seconds before getting on the net so it might be the
problem. On the other hand kernel loads just fine over TFTP.

Any thoughts?

Thanks, /S
-- 
S³awek ¯ak / UNIX Systems Administrator


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


RE: Re: Supermicro superserver 5013S-i

2005-05-18 Thread Doug White
On Sun, 15 May 2005, Balgansuren.B wrote:

 Doug,

 I checked website for BIOS files and current loaded BIOS is latest
 version.

 If I to use continously ACPI aware=NO, is there any problem?

You can, but you may have reduced functionality (particularly if ACPI
supports thermal zones).

 Previously, on server was loaded Fedora Core 2, then I removed it.

 BTW, when I install Fedora Core 2 and it works normal.

Fedora doesn't use ACPI by default I don't think. They may also suppress
the messages.

Again, the messages are largely harmless.  I suspect its due to using a
very old ACPI compiler and specification.


 Balgaa


  On Sat, 14 May 2005, Balgansuren.B wrote:
 
   Yesterday (13 May, 2005), I installed FreeBSD 5.4
  Release on this
   server without problem. This morning (14 May, 2005)
  I did cvsup and now
   it is 5.4-STABLE.
  
   But I saw strange thing when I change settings on
  bios setup.
  
 
  [...]
 
   May 14 10:41:41 altainet kernel: ACPI-0698: ***
  Warning: Type override
   - [DEB_] had invalid type (Integer) for Scope
  operator, changed to
   (Scope)
 
  This is a bug in the ACPI DSDT table in the BIOS. If
  upgrading the BIOS
  causes this then contact your systems vendor and
  complain since the prior
  BIOS table did not have these errors. It obviously
  goes away when ACPI is
  disabled since the table is not evaluated in that
  instance.
 
  Its likely harmless.
 
  --
  Doug White|  FreeBSD: The Power
  to Serve
  [EMAIL PROTECTED]  |  www.FreeBSD.org
  ___
  freebsd-stable@freebsd.org mailing list
  http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 
  To unsubscribe, send any mail to
  [EMAIL PROTECTED]
 
 


 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: panic in recent RELENG_5 tcp code path

2005-05-18 Thread Doug White
On Sun, 15 May 2005, Jeremie Le Hen wrote:

 Sorry, I couldn't get a dump.

 %%%
 obiwan:tataz$ uname -a
 FreeBSD obiwan.tataz.chchile.org 5.4-STABLE FreeBSD 5.4-STABLE #16: Fri 
 May 13 01:01:50 CEST 2005 [EMAIL 
 PROTECTED]:/usr/src/sys/i386/compile/OBIWAN  i386
 %%%

 %%%
 Fatal trap 12: page fault while in kernel mode
 fault virtual address   = 0xc
 fault code  = supervisor read, page not present
 instruction pointer = 0x8:0xc05aa4e0
 stack pointer   = 0x10:0xd6dbfaa4
 frame pointer   = 0x10:0xd6dbfabc
 code segment= base 0x0, limit 0xf, type 0x1b
 = DPL 0, pres 1, def32 1, gran 1
 processor eflags= interrupt enabled, resume, IOPL = 0
 current process = 25637 (sshd)
 [thread pid 25637 tid 100131 ]
 Stopped at  m_copydata+0x28:movl0xc(%esi),%ebx
 db trace
 Tracing pid 25637 tid 100131 td 0xc23bc180
 m_copydata(c211aa00,0,40,c211aaa8,c21422ec) at m_copydata+0x28
 tcp_output(c1d74534,c211aa00,c211aa30,40,0) at tcp_output+0xb49
 tcp_usr_send(c1ec9144,0,c211aa00,0,0) at tcp_usr_send+0x1ca
 sosend(c1ec9144,0,d6dbfc6c,c211aa00,0) at sosend+0x6dc
 soo_write(c21422ec,d6dbfc6c,c2c2dd89,0,c23bc180) at soo_write+0x9e
 dofilewrite(c23bc180,c21422ec,4,807d000,40) at dofilewrite+0xb6
 write(c23bc180,d6dbfd04,c,c23bc180,c21264b0) at write+0x6a
 syscall(807002f,bfbf002f,bfbf002f,806eca8,40) at syscall+0x340
 Xint0x80_syscall() at Xint0x80_syscall+0x1f
 --- syscall (4, FreeBSD ELF32, write), eip = 0x2826cd0b, esp = 
 0xbfbfe4fc, ebp = 0xbfbfr518 ---
 %%%

 Please Cc: me in replies, I'm not subscribed to this list.

Can you load a kernel.debug into gdb and do l *(tcp_output+0xb49)  and
post the output? that offset isn't a function call in my kernel.
tcp_output() doesn't call m_copypacket directly so the exact spot is
difficult to find.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RC2 freezing - ATA related?

2005-05-18 Thread Doug White
On Wed, 18 May 2005, Jamie Heckford wrote:

 Hi Peter,

 On Thu, May 19, 2005 at 05:53:12AM +1000, Peter Jeremy wrote:
  On Wed, 2005-May-18 16:03:16 +0100, Jamie Heckford wrote:
  Managed to get a dump on our system for a similar prob we are getting:
 
  That traceback looks like a panic, not a deadlock.  What was the panic
  message?

 Only have remote access to the box im afraid, is there anyway I can obtain
 the panic message?

print msgbuf should do it


 
  #2  0xc0513474 in panic (fmt=0xc06c3da5 %s) at 
  /usr/src/sys/kern/kern_shutdown.c:566
  ...
  #7  0xc0510018 in crcopy () at /usr/src/sys/kern/kern_prot.c:1810
  #8  0xc0598c77 in in_pcbdetach (inp=0xc0743a40) at 
  /usr/src/sys/netinet/in_pcb.c:720
  #9  0xc05b21a6 in tcp_close (tp=0x0) at /usr/src/sys/netinet/tcp_subr.c:783
 
  There's something wrong here:  If tcp_close() is passed NULL it will panic
  at this point when it tries to dereference tp.

 Starting to stretch my knowledge a bit now ;)

 If I can provide you with further debug output would you be able to give me 
 some
 pointers?

 Thanks for your help



-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Working Keyboard in 5.x

2005-05-18 Thread Doug White
On Mon, 16 May 2005, Holtor wrote:

 Does anyone know how to make a PS/2 keyboard work when plugged into a
 system booted without a keyboard on FreeBSD 5.x? It doesn't seem to
 work.

This is a system-specific thing. As noted previously clearing flag 0x1 on
atkbd0 can help this, but it doesn't always work on all motherboards.  To
do this, edit /boot/device.hints and make sure there is no
hint.atkbd.0.flags line. Reboot to have the change take effect.

_Technically_, ps/2 is not hotplug -- you risk severe electrical damage to
the system by hotplugging ps/2 keyboards and mice.  If you do this
frequently you may consider investing in a KVM.

 For example in 4.x the default GENERIC kernel line is:
 device  atkbd0  at atkbdc? irq 1 flags 0x1

 If you leave that line alone, the ps/2 keyboard will not work if a system
 booted without one plugged in. But if you remove the flags 0x1 it will
 work fine.

 Does anyone know how to make the PS/2 work properly in 5.x?

 Thank you,

 Holt G.




 __
 Yahoo! Mail Mobile
 Take Yahoo! Mail with you! Check email on your mobile phone.
 http://mobile.yahoo.com/learn/mail
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: problems with ASR card/5.4

2005-05-18 Thread Doug White
On Mon, 16 May 2005, Bruce Burden wrote:



   Hi folks,

   I am trying to get my Adaptec 3210S RAID running under 5.4,
   and all I get for my trouble is pmap_ errors. Sorry, no dumps at
   the  moment.

   If I compile device asr into the kernel, I get a message
   about could not copy LDT, and a panic about pmap_enter:
   invalid page directory when the system moves from single to multi
   user. (single user is no problem?)

   If I do not compile the asr driver, and kldload asr instead,
   I get a panic from pmap_mapdev, instead.

   Any ideas? This is annoying, and I am seriously considering
   returning to 4.11, where asr was working...

This sounds like your system has severe memory corruption issues.  I'd try
a different PCI slot for your asr card first, then try the card in another
known working system and see if your problems appear there.

Have you tested the system without the asr card installed?

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: CUPDS reboot the whole system

2005-05-18 Thread Doug White
On Wed, 18 May 2005, Todor Dragnev wrote:

 Hello,

 Before a couple of days ago I started /usr/local/sbin/cupsd manualy from
 console. When I press CTRL+C to interrupt a program, the system change
 runlevel and going to reboot. This was on FreeBSD V5.3, today I
 installed fresh new 5.4 but the problem is the same.

FreeBSD doesn't have runlevels so I'm not sure what you're referring to
here.

Its also known that certain old Linux binaries that accidentally get run
as FreeBSD binaries can call reboot() when trying a Linux SYSV syscall.
If you copied cupsd from a linux box, use brandelf on it to make sure it
gets run under the linuxulator.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RC3 crash report

2005-05-18 Thread Doug White
On Tue, 17 May 2005, Cedric Tabary wrote:

 On 17/05/2005 17:54, Cedric Tabary wrote:
  FreeBSD 5.4-RC3 i386, no SMP compiled in kernel
  dmesg atached

 Oops forgot the dmesg ;)

Panic  trap messages would be nice too, so we know what address is
exploded on.


 Copyright (c) 1992-2005 The FreeBSD Project.
 Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
   The Regents of the University of California. All rights reserved.
 FreeBSD 5.4-RC3 #1: Thu Apr 21 10:01:03 CEST 2005
 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/CED
 Timecounter i8254 frequency 1193182 Hz quality 0
 CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2793.01-MHz 686-class CPU)
   Origin = GenuineIntel  Id = 0xf41  Stepping = 1
   
 Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
   Hyperthreading: 2 logical CPUs
 real memory  = 1073479680 (1023 MB)
 avail memory = 1040789504 (992 MB)
 ACPI APIC Table: DELL   PE BKC  
 ioapic0: Changing APIC ID to 2
 ioapic1: Changing APIC ID to 3
 ioapic1: WARNING: intbase 32 != expected base 24
 ioapic2: Changing APIC ID to 4
 ioapic2: WARNING: intbase 64 != expected base 56
 ioapic0 Version 2.0 irqs 0-23 on motherboard
 ioapic1 Version 2.0 irqs 32-55 on motherboard
 ioapic2 Version 2.0 irqs 64-87 on motherboard
 npx0: math processor on motherboard
 npx0: INT 16 interface
 acpi0: DELL PE BKC on motherboard
 acpi0: Power Button (fixed)
 Timecounter ACPI-fast frequency 3579545 Hz quality 1000
 acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
 cpu0: ACPI CPU on acpi0
 pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
 pci0: ACPI PCI bus on pcib0
 pcib1: ACPI PCI-PCI bridge at device 2.0 on pci0
 pci1: ACPI PCI bus on pcib1
 pcib2: ACPI PCI-PCI bridge at device 0.0 on pci1
 pci2: ACPI PCI bus on pcib2
 amr0: LSILogic MegaRAID 1.51 mem 
 0xdfee-0xdfef,0xd80f-0xd80f irq 46 at device 14.0 on pci2
 amr0: LSILogic PERC 4e/Si Firmware 513O, BIOS H418, 256MB RAM
 pcib3: ACPI PCI-PCI bridge at device 0.2 on pci1
 pci3: ACPI PCI bus on pcib3
 pcib4: ACPI PCI-PCI bridge at device 4.0 on pci0
 pci4: ACPI PCI bus on pcib4
 pcib5: ACPI PCI-PCI bridge at device 5.0 on pci0
 pci5: ACPI PCI bus on pcib5
 pcib6: ACPI PCI-PCI bridge at device 0.0 on pci5
 pci6: ACPI PCI bus on pcib6
 em0: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
 0xecc0-0xecff mem 0xdfbe-0xdfbf irq 64 at device 7.0 on pci6
 em0: Ethernet address: 00:11:43:36:39:e5
 em0:  Speed:N/A  Duplex:N/A
 pcib7: ACPI PCI-PCI bridge at device 0.2 on pci5
 pci7: ACPI PCI bus on pcib7
 em1: Intel(R) PRO/1000 Network Connection, Version - 1.7.35 port 
 0xdcc0-0xdcff mem 0xdf9e-0xdf9f irq 65 at device 8.0 on pci7
 em1: Ethernet address: 00:11:43:36:39:e6
 em1:  Speed:N/A  Duplex:N/A
 pcib8: ACPI PCI-PCI bridge at device 6.0 on pci0
 pci8: ACPI PCI bus on pcib8
 pcib9: ACPI PCI-PCI bridge at device 30.0 on pci0
 pci9: ACPI PCI bus on pcib9
 pci9: display, VGA at device 13.0 (no driver attached)
 isab0: PCI-ISA bridge at device 31.0 on pci0
 isa0: ISA bus on isab0
 atapci0: Intel ICH5 UDMA100 controller port 
 0xfc00-0xfc0f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 31.1 on pci0
 ata0: channel #0 on atapci0
 ata1: channel #1 on atapci0
 fdc0: floppy drive controller port 0x3f7,0x3f0-0x3f5 irq 6 drq 2 on acpi0
 sio0: 16550A-compatible COM port port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
 sio0: type 16550A
 orm0: ISA Option ROMs at iomem 
 0xec000-0xe,0xce800-0xcf7ff,0xcb000-0xcbfff,0xc-0xcafff on isa0
 pmtimer0 on isa0
 atkbdc0: Keyboard controller (i8042) at port 0x64,0x60 on isa0
 atkbd0: AT Keyboard irq 1 on atkbdc0
 kbd0 at atkbd0
 ppc0: parallel port not found.
 sc0: System console at flags 0x100 on isa0
 sc0: VGA 16 virtual consoles, flags=0x300
 sio1: configured irq 3 not in bitmap of probed irqs 0
 sio1: port may not be enabled
 vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on isa0
 Timecounter TSC frequency 2793014175 Hz quality 800
 Timecounters tick every 1.000 msec
 IP Filter: v3.4.35 initialized.  Default = pass all, Logging = enabled
 acd0: CDROM SAMSUNG CD-ROM SN-124/N104 at ata0-master PIO4
 amrd0: LSILogic MegaRAID logical drive on amr0
 amrd0: 69880MB (143114240 sectors) RAID 1 (optimal)
 ses0 at amr0 bus 0 target 6 lun 0
 ses0: PE/PV 1x2 SCSI BP 1.0 Fixed Processor SCSI-2 device
 ses0: SAF-TE Compliant Device
 Mounting root from ufs:/dev/amrd0s1a
 em0: Link is up 100 Mbps Full Duplex
 em1: Link is up 100 Mbps Full Duplex
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman

Re: SATA378 / SATA150 RAID controller supported by 5.4?

2005-05-18 Thread Doug White
On Tue, 17 May 2005, Rob wrote:


 Hi,

 The 'pciconf -lv' tells me:

  [EMAIL PROTECTED]:4:0: class=0x010400 card=0x80f51043
   chip=0x3373105a rev=0x02 hdr=0x00
 vendor   = 'Promise Technology Inc'
 device   = 'PDC20378 FastTrak 378/SATA 378 RAID
 Controller'
 class= mass storage
 subclass = RAID

 But I get this in my dmesg output:

  atapci0: Promise PDC20378 SATA150 controller
 port 0xd880-0xd8ff,0xdfa0-0xdfaf,0xdf00-0xdf3f
 mem 0xfeac-0xfead,0xfeafe000-0xfeafefff
 irq 10 at device 4.0 on pci2
  atapci0: failed: rid 0x20 is memory, requested 4

That's the PCI busmaster register, although it seems to be the wrong
resource type. It should be I/O space, which maps to PCI config space, and
not memory-mapped.  I can't seem to find what function emits that message.

A full dmesg would be useful.

  rl0: RealTek 8139 10/100BaseTX
  port 0xd000-0xd0ff
  mem 0xfeaff400-0xfeaff4ff
  irq 10 at device 9.0 on pci2

 At present I use a single harddisk on the regular
 IDE connector (atapci1 UDMA100-controller).

 Is the RAID controllor on atapci0 supported by 5.4?
 What is the 'failed' message about in dmesg?

It seems to be a bit error...

 Also note the seemingly interrupt conflict of both,
 rl0 and atapci0, claiming irq 10 ?!?!

Thats normal for PIC mode. PCI cards can share interrupts, but ISA can't
share with anything else, including other ISA cards.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Boot loader cant identify ntfs?

2005-05-18 Thread Doug White
On Wed, 18 May 2005, Matthias Buelow wrote:

 Dan Nelson wrote:

  The next release should:
  RCS file: /home/ncvs/src/sys/boot/i386/boot0/boot0.S,v
 [...]

 Obviously the Right Way of doing such things is to move this into the
 2nd stage boot loader... I can't think of any reason (except quick
 hackery) why this hasn't been done that way.  I mean, one could retain a
 simple choice of which disk/partition to boot in the 1st stage (to cover
 all eventualities), and maybe hide it by having to hit a key, and do the
 real menu in the 2nd stage, perhaps integrated with the kernel options
 boot menu.

Then the space problem just migrates.  There's a limited amount of space
in the disklabel for bootblocks and I think we're pushing that.

If you want pretty menus, there's a large number of boot managers
available.  Someone showed me one at BSDcan and I don't recall the name
offhand. Even had beastie icons.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RELEASE amd64: Panic isadma_start : bad bounce buffer

2005-05-14 Thread Doug White
On Sat, 14 May 2005, Steven Hartland wrote:

 Just been trying to install FreeBSD 5.4 amd64 here and
 when loading the raid control kernel module driver ( hptmv )
 from floppy and booting I was getting:
 Panic isadma_start : bad bounce buffer

This is a known bug; floppy drives don't work on amd64 since the ISA DMA
buffer gets allocated above 16MB.  I don't know if this is something that
needs to be handled by busdma.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Supermicro superserver 5013S-i

2005-05-14 Thread Doug White
On Sat, 14 May 2005, Balgansuren.B wrote:

 Yesterday (13 May, 2005), I installed FreeBSD 5.4 Release on this
 server without problem. This morning (14 May, 2005) I did cvsup and now
 it is 5.4-STABLE.

 But I saw strange thing when I change settings on bios setup.


[...]

 May 14 10:41:41 altainet kernel: ACPI-0698: *** Warning: Type override
 - [DEB_] had invalid type (Integer) for Scope operator, changed to
 (Scope)

This is a bug in the ACPI DSDT table in the BIOS. If upgrading the BIOS
causes this then contact your systems vendor and complain since the prior
BIOS table did not have these errors. It obviously goes away when ACPI is
disabled since the table is not evaluated in that instance.

Its likely harmless.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RELEASE amd64: Panic isadma_start : bad bounce buffer

2005-05-14 Thread Doug White
On Sat, 14 May 2005, Steven Hartland wrote:

 Thanks for the confirmation there Doug at least I have a workaround :)

As another workaround, add this to loader.conf and reboot:

vm.old_contigmalloc=1

This uses the old contigmalloc algorithm which may be more successful as
allocating the bounce buffer area.


 Steve
 - Original Message -
 From: Doug White [EMAIL PROTECTED]
 
  This is a known bug; floppy drives don't work on amd64 since the ISA DMA
  buffer gets allocated above 16MB.  I don't know if this is something that
  needs to be handled by busdma.


 
 This e.mail is private and confidential between Multiplay (UK) Ltd. and the 
 person or entity to whom it is addressed. In the event of misdirection, the 
 recipient is prohibited from using, copying, printing or otherwise 
 disseminating it or any information contained in it.

 In the event of misdirection, illegible or incomplete transmission please 
 telephone (023) 8024 3137
 or return the E.mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4 install disc1 will not find hard drive

2005-05-14 Thread Doug White
On Sat, 14 May 2005, fbsd_user wrote:

 I have seen this problem on the questions list since 5.0 first can out as
 development version. Here 5.4 is now stable release and this problem is
 still not fixed. I have been using the same pc to test installing stable
 versions from miniinst.iso every version since 3.4 without any problems.
 Even 5.3-miniinst.iso worked just 2 weeks ago. The 5.4 stable does not have
 a miniinst.iso file so this time I used the disc1.iso to burn the install cd
 from and when booting from this cd I now get no hard drive found message
 from sysinstall (standard install) menu. This sure looks very strange to me
 as I can put in the cd burned from the 5.3-miniinst.iso file and the system
 installs just fine. Only difference is using disc1 this time. Even though
 the md5 CHECKSUM matched I downloaded and reburned disc1.iso again just to
 verify it was good.

What hardware (motherboard, cpus, memory, etc.) are you using?

 Power management and APIC have been bios disabled since version FreeBSD 3.4
 version so that is not the problem. I tried selecting safe option to boot
 and still get same error 'no hard disk found.   My hard drive is an western
 digital 310100 on ata0 as master. When using this 5.4 install cd on other
 pc/ motherboard/ hard drive combos I get same error.

You can hit Scroll Lock and use the arrow keys or page up/down to look at
the boot messages. Near the bottom of the output should be the disk probe
-- this may have more details. If not, try booting and selecting the
verbose boot option.

 I think there is something wrong with the build process of the disc1.iso
 file. The miniinst.iso must be build using a different canned script that
 does not incorporated the bug that is in the disc1 build.   Maybe the .ISO
 builds team needs to take a look at this problem.

The only difference is the files included in the mkisofs run. The exact
same files are used.

 And to continue on with this thought  why does 5.4 NOT have a miniinst.iso
 file?

That's a good question :-) I didn't realize we hadn't made one.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Problem booting 5.4

2005-05-14 Thread Doug White
On Sat, 14 May 2005, rduffner wrote:

 Doug White writes:

  On Thu, 12 May 2005, Rainer Duffner wrote:
 
  Hi,
 
  I installed FreeBSD5.4 on a server with a 3ware 7006-2 controller and
  two 120GB disks as RAID1
  I cannot boot this install. (I get some kind of panic or endless loop,
  but the display is re-painted so fast I cannot read it).
  What I *can* do is insert my SuSE9.2-pro cd1, boot from that and at the
  grub-menu say boot from harddisk.
  That boots FreeBSD.
 
  On advice from IRC, I tried:
  # boot0cfg -B twed0
 
  -which leads to this output
 
  boot0cfg: write_mbr: /dev/twed0: No such file or directory
 
  This is a erroneous message. The actual problem is:
 
 484 boot0cfg NAMI  /dev/twed0
 484 boot0cfg RET   open -1 errno 1 Operation not permitted
 
  This is a known problem with certain MBR layouts. To work around this
  problem, set:
 
  sysctl kern.geom.debugflags=16
 
  then try your boot0cfg. There's a protection mechanism that sometimes gets
  confused by certain partition table layouts. Flag 16 disables that
  protection.  I don't recommend running this unless you are explicitly
  trying to updating something in a partition table-like area; its very easy
  to destroy your system with the flag set!


 I booted from CD and ran the boot0cfg offline - however, this worked only
 on one server. I have two more identical servers that now just beep
 endlessly at the F1-prompt.
 I was told it is a geometry problem, but what else can I do?

boot0cfg -o packet adX

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Problem booting 5.4

2005-05-13 Thread Doug White
On Thu, 12 May 2005, Rainer Duffner wrote:

 Hi,

 I installed FreeBSD5.4 on a server with a 3ware 7006-2 controller and
 two 120GB disks as RAID1
 I cannot boot this install. (I get some kind of panic or endless loop,
 but the display is re-painted so fast I cannot read it).
 What I *can* do is insert my SuSE9.2-pro cd1, boot from that and at the
 grub-menu say boot from harddisk.
 That boots FreeBSD.

 On advice from IRC, I tried:
 # boot0cfg -B twed0

 -which leads to this output

 boot0cfg: write_mbr: /dev/twed0: No such file or directory

This is a erroneous message. The actual problem is:

   484 boot0cfg NAMI  /dev/twed0
   484 boot0cfg RET   open -1 errno 1 Operation not permitted

This is a known problem with certain MBR layouts. To work around this
problem, set:

sysctl kern.geom.debugflags=16

then try your boot0cfg. There's a protection mechanism that sometimes gets
confused by certain partition table layouts. Flag 16 disables that
protection.  I don't recommend running this unless you are explicitly
trying to updating something in a partition table-like area; its very easy
to destroy your system with the flag set!

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4 weird transmit problem

2005-05-13 Thread Doug White
On Thu, 12 May 2005, jmc wrote:

 I've got 5.4 amd64 installed on an Opteron server and I cannot get it to
 reliably transmit packets larger than 80 bytes using the bge driver (on a
 BCM5703 NIC). It receives large packets without any problem, but it just
 won't transmit them. (I can tcpdump all day long without a problem - big and
 small packets.)

 For example, I can ping -s 38 ip and it works fine. But if I try ping
 -s 39 ip (or any size larger than 38) it does not work. A 38 byte ping
 creates an 80 byte Ethernet packet.

Random guesses:

1. Make sure your switch agrees with the speed and duplex setting.
Auto-neg problems are common.

2. Replace the cable.

3. Back-to-back two systems and try to reproduce.


 Here's the bge0 info from dmesg:

 bge0: Broadcom BCM5703X Gigabit Ethernet, ASIC rev. 0x1100 mem
 0xf7ef-0xf7
 ef irq 24 at device 1.0 on pci3
 bge0: Reserved 0x1 bytes for rid 0x10 type 3 at 0xf7ef
 miibus0: MII bus on bge0
 brgphy0: BCM5703 10/100/1000baseTX PHY on miibus0
 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,
 1000baseTX
 -FDX, auto
 bge0: bpf attached
 bge0: Ethernet address: 00:11:85:fd:8f:f9
 bge0: [MPSAFE]

 Here's ifconfig for bge0:

 bge0: flags=8843UP,BROADCAST,RUNNING,SIMPLEX,MULTICAST mtu 1500
 options=1aTXCSUM,VLAN_MTU,VLAN_HWTAGGING
 inet 16.100.240.165 http://16.100.240.165 netmask 0xfc00 broadcast
 16.100.243.255 http://16.100.243.255
 inet6 fe80::211:85ff:fefd:8ff9%bge0 prefixlen 64 scopeid 0x1
 ether 00:11:85:fd:8f:f9
 media: Ethernet autoselect (1000baseTX full-duplex)
 status: active

 Much to my dismay, I've found that if I use the -l option (preload) for
 ping, I get some large packets through:

 ninox# ping -l10 -c10 -s200 16.100.240.1 http://16.100.240.1
 PING 16.100.240.1 http://16.100.240.1 (16.100.240.1 http://16.100.240.1):
 200 data bytes
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=2 ttl=128 time=
 1.025 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=3 ttl=128 time=
 1.306 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=4 ttl=128 time=
 1.593 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=5 ttl=128 time=
 2.026 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=6 ttl=128 time=
 2.314 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=7 ttl=128 time=
 2.746 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=8 ttl=128 time=
 3.034 ms
 208 bytes from 16.100.240.1 http://16.100.240.1: icmp_seq=9 ttl=128 time=
 3.468 ms

 --- 16.100.240.1 http://16.100.240.1 ping statistics ---
 10 packets transmitted, 8 packets received, 20% packet loss
 round-trip min/avg/max/stddev = 1.025/2.189/3.468/0.806 ms

 Has anyone else ever seen a problem like this? Any suggestions on where to
 poke around for a solution?

 Thanks,
 John
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Experimental ttwwakeup() panic patch

2005-05-11 Thread Doug White
Sorry for the late reply on this.

On Fri, 6 May 2005, Ed Maste wrote:

 On Wed, May 04, 2005 at 08:54:23PM -0700, Doug White wrote:

   http://people.freebsd.org/~dwhite/tty.c.20050503.patch
 
  This patch has been committed and exists as rev 1.228.2.4 of
  src/sys/kern/tty.c.  Please let me know if this fixes the panic for you,
  or causes new problems :)

 For what it's worth, I've come across a similar crash during a test that
 repeatedly ssh'd into the machine.

 This was while opening a pty, not using serial console.  The symptoms
 seem similar; my tty struct has a struct knlist with a null struct mtx *.
 I haven't yet investigated in great detail or tried the patch.  I just
 wanted to offer this as another data point.

 Although kgdb gets confused during the backtrace (frames 7 and 8) the
 tf_eip is a cmpxchg in knote().

This is a problem mlaier and I may have fixed, at least against -CURRENT.
It appears that the process group alias in struct tty is accessed without
locking and its changing out from under us.

Give this a try:

http://people.freebsd.org/~mlaier/tty.t_pgrp.diff

It compiles but I haven't actually booted a kernel with it, so YMMV.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: twa0: INFO: (0x04: 0x000c): Background initialize started: unit=0

2005-05-11 Thread Doug White
On Mon, 9 May 2005, Marc G. Fournier wrote:


 Ya, this looks like it might be a problem ... server just crashed, and
 fsck is once more dog slow, and I suspect its in the 'initialization mode'
 again ...

 Looking at the following that I've also found, things appear to be
 pointing to ACPI as the 'trigger' ... does anyone else have any
 experiences with these cards?

This is normal for twa and twe cards with recent firmware. If it detects
an unclean shutdown it will schedule a unit verify.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: loss of keyboard in single-user mode in RELENG_5

2005-05-08 Thread Doug White
Don't crosspost both -current and -stable; your problem does not affect
-current.

On Fri, 6 May 2005, Thierry Herbelot wrote:

 Hello,

 I have rebuilt the world and kernel this morning.

The commit causing this problem has been backed out. Please cvsup and try
again.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: unexpected panic in idle process (RELENG_5 2005/04/25UTC)

2005-05-08 Thread Doug White
0 0 0 204 [IWAIT] irq10: sis0
20 c0ae81c40 0 0 204 [IWAIT] irq9: sis1
19 c0ae83880 0 0 204 [IWAIT] irq8: rtc
18 c0ac50000 0 0 204 [IWAIT] irq7:
17 c0ac51c40 0 0 204 [IWAIT] irq6:
16 c0ac53880 0 0 204 [IWAIT] irq5:
15 c0ac554c0 0 0 204 [IWAIT] irq4: sio0
14 c0ac57100 0 0 204 [IWAIT] irq3:
13 c0ac58d40 0 0 204 [IWAIT] irq1:
12 c0ac5a980 0 0 204 [IWAIT] irq0: clk
11 c0ac5c5c0 0 0 20c [CPU 0] idle
 1 c0ac5e200 0 1 0004200 [SLPQ wait 0xc0ac5e20][SLP] init
10 c0acc0000 0 0 204 [SLPQ ktrace 0xc066f7d8][SLP] ktrace
 0 c066bb000 0 0 200 [SLPQ sched 0xc066bb00][SLP] swapper
 db

 unfortunately, I won't be able to take a dump because the CF has
 no swap.

 Anyone have an idea what this could be, and what I could type into
 db to get further info?

 One thing that I changed on this HW yesterday was to add a Mini-PCI
 HiFn card:

 hifn0 mem 0x8004-0x80040fff,0x8000-0x8fff irq 12 at device 13.0 
 on pci0
 hifn0: Hifn 7951, rev 0, 128KB sram

 before that change, earlier RELENG_5 systems didn't demonstrate
 any such an idle panic on said HW.

 non-verbose dmesg and kernel config image are available, and I will
 leave the db there, awaiting suggestions.

 Adrian


 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Experimental ttwwakeup() panic patch

2005-05-04 Thread Doug White
On Tue, 3 May 2005, Doug White wrote:

 Hey folks,

 I've taken a crack at working around the ttwwakeup() panic thats been
 reported now and again.  My early analysis, based on debugging output from
 rwatson, is that a defunct struct tty gets reused without cleaning out the
 associated (stale) knote structures, and the ttwwakeup() at the end of
 sioopen() jumps off into space when it finds them.

 This patch is against RELENG_5 but the logic should apply to -CURRENT,
 although the patch likely won't as ttymalloc() is organized differently
 there.

 I did some basic testing on my UP box and didn't see any abberant behavior
 afterwards. However I can't reproduce the panic in question, so if you're
 good at triggering the panic give this a spin.

 http://people.freebsd.org/~dwhite/tty.c.20050503.patch

This patch has been committed and exists as rev 1.228.2.4 of
src/sys/kern/tty.c.  Please let me know if this fixes the panic for you,
or causes new problems :)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Experimental ttwwakeup() panic patch

2005-05-03 Thread Doug White
Hey folks,

I've taken a crack at working around the ttwwakeup() panic thats been
reported now and again.  My early analysis, based on debugging output from
rwatson, is that a defunct struct tty gets reused without cleaning out the
associated (stale) knote structures, and the ttwwakeup() at the end of
sioopen() jumps off into space when it finds them.

This patch is against RELENG_5 but the logic should apply to -CURRENT,
although the patch likely won't as ttymalloc() is organized differently
there.

I did some basic testing on my UP box and didn't see any abberant behavior
afterwards. However I can't reproduce the panic in question, so if you're
good at triggering the panic give this a spin.

http://people.freebsd.org/~dwhite/tty.c.20050503.patch

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Experimental ttwwakeup() panic patch

2005-05-02 Thread Doug White
Hey folks,

I've taken a crack at working around the ttwwakeup() panic thats been
reported now and again.  My early analysis, based on debugging output from
rwatson, is that a defunct struct tty gets reused without cleaning out the
associated (stale) knote structures, and the ttwwakeup() at the end of
sioopen() jumps off into space when it finds them.

This patch is against RELENG_5 but the logic should apply to -CURRENT,
although the patch likely won't as ttymalloc() is organized differently
there.

I did some basic testing on my UP box and didn't see any abberant behavior
afterwards. However I can't reproduce the panic in question, so if you're
good at triggering the panic give this a spin.

http://people.freebsd.org/~dwhite/tty.c.20050502.patch

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Opteron deadlock fix committed

2005-05-01 Thread Doug White
Hey folks,

Yesterday I committed a workaround for a deadlock condition in RELENG_5
and RELENG_5_4 caused by errata in AMD Opteron and Athlon64 processors.
(-CURRENT after April 8 is not affected due to changes in critical
sections.)

The deadlock appeared under heavy load, particularly with lots of I/O
interrupts, in SMP environments on both FreeBSD/amd64 and FreeBSD/i386.
Specifically, a spinwait as part of inter-processor TLB flushing triggered
Errata 106, which causes the cache on the spinning processor to not
update. To workaround the issue we enabled interrupts during the spinwait,
which breaks the lock on the cache when an interrupt occurs.  AMD also
offers a workaround that the BIOS can implement, but not all systems have
applied the errata.

This workaround will appear in 5.4-RELEASE.

In addition, I committed a KDB feature written by Stephen Uphoff (ups) to
assist in debugging SMP situations where one processor is stuck with
interrupts disabled. Since the regular cpustop IPI won't get through in
that case, he implemented an NMI handler to perform the stop.  This
feature, compiled in with the kernel option KDB_STOP_NMI and activated by
debug.kdb.stop_cpus_with_nmi, is available on all current branches and
will ship with 5.4-RELEASE.

If you have had problems with deadlock conditions on AMD processors in an
SMP environment, we urge you to update to the latest RELENG_5, or try the
upcoming 5.4-RC4.

I want to thank the following individuals for their help in debugging and
fixing the deadlock issue:

Paul Vixie and Peter Losher at ISC
Stephen Uphoff (ups)
Alan Cox (alc)
John Baldwin (jhb)
The rest of the FreeBSD RE team

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Multithreaded program crashes on SMP (HT enabled) machine

2005-04-29 Thread Doug White
Dropping -hackers and -bugs

On Wed, 27 Apr 2005, Adi Pircalabu wrote:

 Hello,
 I have a multithreaded application ported on FreeBSD 5.3 which crashes
 in a minute or less if hyperthreading in enabled. Without HT there is no
 problem.
 How and where should I start to investigate the problem?


Can you expalin crash? Do you get any messages?

 The backtrace:
 (gdb) bt
 #0  0x2819631b in pthread_testcancel () from /usr/lib/libpthread.so.1
 #1  0x2818e902 in pthread_mutexattr_init () from
 /usr/lib/libpthread.so.1 #2  0x in ?? ()

This looks like a thread library mismatch.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Reproducible panic 5.4-stable when booting with -v

2005-04-29 Thread Doug White
On Wed, 27 Apr 2005, Marc Olzheim wrote:

 The same system about which I just mailed, crashes when booting with
 verbose logging enabled. With a normal boot, everything's ok...

Looks like it perturbs some sort of timing and upsets the SCSI card.  Is
this on a serail console?

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: vm_page_free: freeing wired page

2005-04-27 Thread Doug White
On Tue, 26 Apr 2005, Sergey S. Ropchan wrote:

 Hi, all

 I have problem on high loaded SMP server with FreeBSD 5.3-p10+Apache
 1.3.33+Mysql 4.0 reported in kern/75780 !

 State still open - not fixed as i understand!

 If there any possible way to find workaround of this problem !?

 Maybe cvsup to 5.4 or something else ?!

 This problem exist only in 5.3 or in 5.x at all !?

You might try one of the 5.4-RC release candidates ... there's been a
bunch of VM fixes in that area that would help you.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: mysql40-server with linuxthreads and CFLAGS=-DUSE_OLD_FUNCTIONS

2005-04-25 Thread Doug White
On Sun, 17 Apr 2005 [EMAIL PROTECTED] wrote:

 Does any 1-:) have a compiled mysql4*-server with linuxthreads and CFLAGS=-
 DUSE_OLD_FUNCTIONS

 I went by all the deltas from 5500 up to today ( 5734 ) and always get the 
 same
 error
 
 cc -DDBUG_OFF -DUSE_OLD_FUNCTIONS -DUSE_OLD_FUNCTIONS -felide-constructors 
 -fno-
 rtti -fno-exceptions -fno-implicit-templates -fno-exceptions -fno-rtti -
 DMYSQLD_NET_RETRY_COUNT=100 -o mysqld sql_lex.o sql_handler.o item.o
 item_sum.o item_buff.o item_func.o item_cmpfunc.o item_strfunc.o
 item_timefunc.o thr_malloc.o item_create.o field.o key.o sql_class.o 
 sql_list.o
 net_serv.o net_pkg.o lock.o my_lock.o sql_string.o sql_manager.o sql_map.o
 mysqld.o password.o hash_filo.o hostname.o convert.o set_var.o sql_parse.o
 sql_yacc.o sql_base.o table.o sql_select.o sql_insert.o sql_update.o
 sql_delete.o uniques.o sql_do.o procedure.o item_uniq.o sql_test.o log.o
 log_event.o init.o derror.o sql_acl.o unireg.o des_key_file.o time.o
 opt_range.o opt_sum.o opt_ft.o records.o filesort.o handler.o ha_heap.o
 ha_myisam.o ha_myisammrg.o ha_berkeley.o ha_innodb.o ha_isam.o ha_isammrg.o
 sql_db.o sql_table.o sql_rename.o sql_crypt.o sql_load.o mf_iocache.o
 field_conv.o sql_show.o sql_udf.o sql_analyse.o sql_cache.o slave.o sql_repl.o
 sql_union.o mini_client.o mini_client_errors.o stacktrace.o repl_failsafe.o -
 static -DHAVE_GLIBC2_STYLE_GETHOSTBYNAME_R -D_THREAD_SAFE -
 I/usr/local/include/pthread/linuxthreads -DHAVE_GLIBC2_STYLE_GETHOSTBYNAME_R -
 D_THREAD_SAFE -I/usr/local/include/pthread/linuxthreads  -
 L/usr/ports/databases/mysql40-server/work/mysql-4.0.24/bdb/build_unix -
 ldb ../innobase/usr/libusr.a ../innobase/srv/libsrv.a 
 ../innobase/dict/libdict.a
  ../innobase/que/libque.a ../innobase/srv/libsrv.a ../innobase/ibuf/libibuf.a 
 ..
 /innobase/row/librow.a ../innobase/pars/libpars.a ../innobase/btr/libbtr.a 
 ../in
 nobase/trx/libtrx.a ../innobase/read/libread.a ../innobase/usr/libusr.a 
 ../innob
 ase/buf/libbuf.a ../innobase/ibuf/libibuf.a ../innobase/eval/libeval.a 
 ../innoba
 se/log/liblog.a ../innobase/fsp/libfsp.a ../innobase/fut/libfut.a 
 ../innobase/fi
 l/libfil.a ../innobase/lock/liblock.a ../innobase/mtr/libmtr.a 
 ../innobase/page/
 libpage.a ../innobase/rem/librem.a ../innobase/thr/libthr.a 
 ../innobase/sync/lib
 sync.a ../innobase/data/libdata.a ../innobase/mach/libmach.a 
 ../innobase/ha/libh
 a.a ../innobase/dyn/libdyn.a ../innobase/mem/libmem.a 
 ../innobase/sync/libsync.a
  ../innobase/ut/libut.a ../innobase/os/libos.a ../innobase/ut/libut.a 
 ../isam/li
 bnisam.a ../merge/libmerge.a ../myisam/libmyisam.a 
 ../myisammrg/libmyisammrg.a .
 ./heap/libheap.a ../vio/libvio.a ../mysys/libmysys.a ../dbug/libdbug.a 
 ../regex/
 libregex.a ../strings/libmystrings.a -lwrap -L/usr/local/lib -llthread -
 llgcc_r -lz -lcrypt -lm -llthread -llgcc_r
 *** Error code 1
 
 I will appreciate a compiled “work” directory.

Hm, odd that it didn't actually print an error message. Are you omitting
something?

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: tcpwrappers problem

2005-04-25 Thread Doug White
On Mon, 18 Apr 2005, Didier Wiroth wrote:

 Hi,
 (using freebsd5.4-stable)

 I'm trying to display a ftpd banner with hosts.allow, but it doesn't work.

 I'm using ftpd (/usr/libexec/ftpd) started through inetd.
 Ined is started with standard flags:
 /usr/sbin/inetd -wW -C 60

 In hosts.allow I have:
 ALL : ALL : allow
 ALL : ALL : banners /usr/local/etc/banners/
 ALL : PARANOID : RFC931 20 : deny

I wouldn't use tcpwrapper banners if you're doing more than just bitching
out the user before kicking them off. Some FTP clients will just ignore
your banner since it won't conform to the FTP protocol specification.

If you want it to display when the user logs into your FTP server, put it
in /etc/ftpmotd. If you want your telnet client to see it, put it in
/etc/motd.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: MySQL signal 11

2005-04-25 Thread Doug White
On Mon, 18 Apr 2005, Uzi Klein wrote:

 Uzi Klein wrote:
  Uzi Klein wrote:
 
 
  Hi
 
  I got this message from mysql-error log.
  Nothing visible on system log, but after it happens, mysql starts
  acting strange (shoes double fields values as 0 while is still
  contains the data - visible after restarting mysql)
 
  Hardware should be OK, nothing wrong in boot message and its a brand
  new HP DL380-G4.
 
  Any leads or ideas?
 
 

 eh.. something causing only 1 attachment to appear... sorry...



 * mysql-error.log : *

 mysqld got signal 11;
 This could be because you hit a bug. It is also possible that this binary
 or one of the libraries it was linked against is corrupt, improperly built,
 or misconfigured. This error can also be caused by malfunctioning hardware.
 We will try our best to scrape up some info that will hopefully help
 diagnose
 the problem, but since we have already crashed, something is definitely
 wrong
 and this may fail.

This is generally indicative of bad memory or nonfunctional cooling. Check
the system's environmentals.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 5.4 RC2 on a Compaq Laptop?

2005-04-25 Thread Doug White
On Thu, 21 Apr 2005, Matt Meola wrote:

 On Wed, 2005-04-20 at 19:25 -0700, Doug White wrote:
  See the freebsd-amd64 list archives for an extensive discussion on the
  R3000 series. (or was it -mobile? :-) )

 Yeah, I saw all that, and the gnats bug about it, too.  It appears that
 what needed to change was changed, and committed to (at least) -CURRENT.

  You will probably need to run at least -STABLE. These machines are wierd
  birds.

 :-)  You're telling me...  BTW, I was under the impression that 5.4-RC*
 was actually released from the -STABLE branch...  Am I mistaken on that?

That is correct. But I was trying to say I don't think 5.3-R will work. :)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: installkernel succeeds, but old kernel boots

2005-04-25 Thread Doug White
On Thu, 21 Apr 2005, Paco Hope wrote:

 I cvsupped the latest RELENG_5 sources and did make buildworld.  Then I
 built and installed the kernel using

 make -DALWAYS_CHECK_MAKE buildkernel KERNCONF=PACO
 make -DALWAYS_CHECK_MAKE installkernel KERNCONF=PACO

 Then I rebooted.  So far so good.  This is all normal. However, after I
 reboot and run 'uname -a' I see:

 FreeBSD www.provisio.net 5.3-RELEASE-p5 FreeBSD 5.3-RELEASE-p5 #0: Sat Mar
 19 01:28:39 EST 2005 [EMAIL PROTECTED]:/usr/obj/data/src/sys/PACO
 i386

 sysctl agrees with uname:

 kern.ostype: FreeBSD
 kern.osrelease: 5.3-RELEASE-p5
 kern.osrevision: 199506
 kern.version: FreeBSD 5.3-RELEASE-p5 #0: Sat Mar 19 01:28:39 EST 2005
 [EMAIL PROTECTED]:/usr/obj/data/src/sys/PACO

 That's my old kernel that I was trying to replace.  I run 'sysctl
 kern.bootfile' and it says '/boot/kernel/kernel' so I'm a little confused.
 By all indications, the file that's currently in /boot/kernel/kernel is
 5.4-STABLE. I can run 'strings' on /boot/kernel/kernel and see this near
 the bottom:

 FreeBSD 5.4-STABLE #0: Thu Apr 21 00:04:21 EDT 2005
 [EMAIL PROTECTED]:/usr/obj/data/src/sys/PACO
 FreeBSD
 5.4-STABLE
 PACO

 It sure looks like I booted my old kernel somehow, but I can't figure out
 how.  It seems unlikely that I somehow installed from a stale build tree
 because I newfs'd /usr/obj before building anything.

 Thoughts?

loader.conf?

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 5.4-RC3 Kernel Panic: vinvalbuf: dirty bufs (backtrace included)

2005-04-25 Thread Doug White
 was not properly dismounted
 WARNING: /usr was not properly dismounted
 WARNING: /var was not properly dismounted
 WARNING: /big was not properly dismounted
 em0: Link is up 1000 Mbps Full Duplex
 drm0: ATI Radeon If R250 9000 port 0xc000-0xc0ff mem
 0xfe8f-0xfe8f,0xe000-0xe3ff irq 16 at device 0.0 on pci1
 info: [drm] AGP at 0xf800 64MB
 info: [drm] Initialized radeon 1.11.0 20020828 on minor 0
 info: [drm] Loading R200 Microcode
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: FreeBSD 5.4 RC2 on a Compaq Laptop?

2005-04-20 Thread Doug White
On Thu, 14 Apr 2005, Matt Meola wrote:

 Has anyone had any success getting 5.4RC2 installed on a Compaq Presario
 laptop?  According to my BIOS, I have a Presario R3200 (Athlon XP
 processor).  I get the splash screen, it begins the boot process
 and then it just shuts off the computer.

See the freebsd-amd64 list archives for an extensive discussion on the
R3000 series. (or was it -mobile? :-) )

You will probably need to run at least -STABLE. These machines are wierd
birds.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Deadlock in 5.3p5

2005-04-20 Thread Doug White
On Sat, 16 Apr 2005, Peter Jeremy wrote:

 My son's computer deadlocked last night.  show lockedvnods in DDB showed:

 Locked vnodes
 0xc1669840: tag ufs, type VDIR, usecount 8, writecount 0, refcount 2, flags 
 (VV_ROOT|VV_OBJBUF), lock type ufs: EXCL (count 1) by thread 0xc18ed000 (pid 
 9666) with 7 pending
   ino 2, on dev ad0s1g (4, 25)
 0xc1682000: tag ufs, type VDIR, usecount 5, writecount 0, refcount 2, flags 
 (VV_OBJBUF), lock type ufs: EXCL (count 1) by thread 0xc16fd000 (pid 15686) 
 with 1 pending
   ino 23552, on dev ad0s1g (4, 25)
 0xc1b30d68: tag ufs, type VDIR, usecount 2, writecount 0, refcount 1, flags   
  (VV_OBJBUF), lock type ufs: EXCL (count 1) by thread 0xc268f000 (pid 9075) 
 with 1 pending
   ino 122986, on dev ad0s1g (4, 25)
 0xc1c3c210: tag ufs, type VDIR, usecount 3, writecount 0, refcount 1, flags   
  (VV_OBJBUF), lock type ufs: EXCL (count 1) by thread 0xc16fe4b0 (pid 9067) 
 with 1 pending
   ino 142022, on dev ad0s1g (4, 25)
 0xc1e2ce70: tag ufs, type VREG, usecount 6, writecount 0, refcount 0, flags 
 (VV_OBJBUF), lock type ufs: SHARED (count 1) with 1 pending
   ino 142169, on dev ad0s1g (4, 25)

 After poking around in the crashdump for a while, I've worked out that
 the process holding each of the above exclusive locks is waiting on
 the next lock in the list.  Unfortunately, there doesn't appear to be
 any way to work out which process is holding the shared lock unless
 DEBUG_LOCKS is set (and even this doesn't work if the lock was implicitly
 downgraded by a process calling lockmgr(LK_SHARED) when it holds an
 exclusive lock).

 FWIW, the affected inodes are:
  2  /usr
  23552  /usr/local
 122986  /usr/local/OpenOffice.org1.1.4
 142022  /usr/local/OpenOffice.org1.1.4/program
 142169  /usr/local/OpenOffice.org1.1.4/program/libpsp645fi.so

 Does anyone have any ideas on how to track this further (or so I just
 write it off as a glitch).

This is the old race to root that happens at the end of the death
spiral. You need to know why the guy holding tte library lock isn't
letting go of it.


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: reliable panic: isa_dmastart: bad bounce buffer

2005-04-15 Thread Doug White
On Tue, 12 Apr 2005, Mikhail Teterin wrote:

 Hello!

 Whenever I try to mount a floppy disk:

mount -t msdosfs /dev/fd0 /mnt

 I get:

   panic: isa_dmastart: bad bounce buffer

 The OS is FreeBSD-5.4-STABLE from last night, amd64. dmesg attached.

Probably related to:

fdc0: floppy drive controller (FDE) port 0x3f7,0x3f0-0x3f5 irq 6 drq 2
on acpi0
isa_dmainit(2, 40960) failed
fd0: 1440-KB 3.5 drive on fdc0 drive 0

That message is printed by isa_dmainit() in src/sys/amd64/isa/isa_dma.c.
You might instrument that function and see where its blowing up.  I
suspect its getting the buffer from malloc() but the range check below it
fails.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: fxp lockups on 5.4-RC1

2005-04-15 Thread Doug White
0 0 0 204 [IWAIT] irq6:
 16 c15783880 0 0 204 [IWAIT] irq5:
 15 c157854c0 0 0 204 [IWAIT] irq4: sio0
 14 c15787100 0 0 204 [IWAIT] irq3: sio1
 13 c15788d40 0 0 204 [IWAIT] irq1: atkbd0
 12 c1578a980 0 0 204 [IWAIT] irq0: clk
 11 c1578c5c0 0 0 20c [Can run] idle
  1 c1578e200 0 1 0004200 [SLPQ wait 0xc1578e20][SLP] init
 10 c1580 0 0 204 [SLPQ ktrace 0xc0609d18][SLP] ktrace
  0 c06064c00 0 0 200 [SLPQ sched 0xc06064c0][SLP] swapper
 db tr 375
 Tracing pid 375 tid 100038 td 0xc15b8c00
 sched_switch(c15b8c00,0,1) at sched_switch+0x143
 mi_switch(1,0,c15b8c00,0,c15b8c00) at mi_switch+0x1ba
 sleepq_switch(c0c1f460) at sleepq_switch+0xda
 sleepq_wait(c0c1f460,0,c0c45240,0,0) at sleepq_wait+0xb
 msleep(c0c1f460,c0c1f468,44,c05d48df,0) at msleep+0x2ce
 uma_zone_slab(c0c45b40,2,2,80,314c) at uma_zone_slab+0xd2
 uma_zalloc_bucket(c0c45b40,2) at uma_zalloc_bucket+0x14c
 uma_zalloc_arg(c0c45b40,c41c1c00,2) at uma_zalloc_arg+0x274
 mb_init_pack(c41c1c00,100,2) at mb_init_pack+0x1d
 uma_zalloc_bucket(c0c45ba0,3) at uma_zalloc_bucket+0x1b4
 uma_zalloc_arg(c0c45ba0,d5229c0c,2) at uma_zalloc_arg+0x274
 sosend(c1a05000,0,d5229c54,0,0) at sosend+0x33d
 kern_sendit(c15b8c00,3fd,d5229ccc,0,0) at kern_sendit+0x11c
 sendit(c15b8c00,3fd,d5229ccc,0,805a000) at sendit+0x145
 sendto(c15b8c00,d5229d14,6,60,202) at sendto+0x4d
 syscall(2f,805002f,bfbf002f,1a4,80521b0) at syscall+0x27b
 Xint0x80_syscall() at Xint0x80_syscall+0x1f
 --- syscall (133, FreeBSD ELF32, sendto), eip = 0x280c2b9b, esp = 0xbfbfeb4c, 
 ebp = 0xbfbfeb78 ---
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to [EMAIL PROTECTED]


-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Problems with world build in RELENG_5_4

2005-04-14 Thread Doug White
On Mon, 11 Apr 2005, Jose M Rodriguez wrote:

 Hi,

 I found a little problem with RELENG_5_4 buildworld in my env.

 I have a little patched RELENG_5_4 src in a local cvs server, mounted
 ro,-L in the build machine (/usr/src)

 No rpc.lockd or rpc.statd daemon running in both machines.  build
 machine timesync with cvs server by ntpdate before build.

 Every time I do a rm -rf /usr/obj/*  cd /usr/src  make buildworld I
 get:

 === share/info
 === include
 rm -f osreldate.h version vers.c
 rm: version: Read-only file system
 *** Error code 1

 The only thing I change form previous week builds are the -L mount and
 disabling the rpc.lockd and rpc.statd daemons in both machines.

 I can solve the problem doing a make obj before make buildworld.

Sounds like these files may be leftovers from a local build on the NFS
server.  You might try doing 'make cleandir; make cleandir' on the server
to remove any remnants.

Areyou setting MAKEOBJDIRPREFIX in /etc/make.conf?

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Alternate menu logos

2005-04-14 Thread Doug White
(Changing the subject since its not reflective of the tone of the
message. Apologies to the poster, but since this is a touchy subject I'm
going to make this a more technical discussion.)

On Mon, 11 Apr 2005, Hanspeter Roth wrote:

 - or have an convenient flag in loaders.conf that allows to
   run Beastie menu but display some other decoration?

I I have an idea and the patch in PR 74577 is about halfway there. I
suggest providing a script that forth-ifies a provided ASCII logo, and a
loader option to load a banner file from disk.  This way, if, say, an OEM
wanted to put contact information in there, they could put in loader.conf:

banner_enable=YES
banner_file=/boot/oem.banner

and have that displayed instead of the beastie.

The forthifier script would turn the file into a forth function definition
and then it can be included with standard routines in the loader.  Then
your banner function would just call it.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: kernel killing processes when out of swap

2005-04-14 Thread Doug White
On Wed, 13 Apr 2005, Jim C. Nasby wrote:

 It's extremely disappointing that you can't turn this off. I've been
 bashing linux for months now about how they think it's OK to kill random
 processes. But at least they'll let you turn it off.

It's extremely disappointing that you haven't submitted patches yet,
particularly when you have so many testers lined up. :-)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Hp dc7100 installs 5.4-rc1 from CD but won't boot from HD

2005-04-14 Thread Doug White
On Mon, 11 Apr 2005, John Hawkes-Reed wrote:

 (Which I think covers the problem)

 Boots from CD ok, USB keyboard seems less than reliable, so I'm using a PS2
 item. Running through 'standard' install appears to write data to the (SATA
 ICH6 controller) disk, but on reboot it sits at the F1: FreeBSD prompt
 beeping every ten seconds.

 Is there likely to be anything obvious I've missed? (Or indeed more useful
 data I can provide.)

Your system appears to require packet mode, but sysinstall didn't enable
it.  Two possible fixes:

1. If you have disc 2: Boot the install CD, go to Fixit, start up fixit
off CD, then run

boot0cfg -o packet adX

where adX is the appropriate disk device.

2. Reinstall, but use the standard MBR rather than the boot manager. Once
you get the system booted you can install the boot manager with:

boot0cfg -B -o packet adX

where adX is the appropriate disk device.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: Intel 6300ESB (ICH) SMBus controller not working

2005-04-14 Thread Doug White
On Tue, 12 Apr 2005, Philip Murray wrote:

 Hi,

 I'm running RELENG_5 on a Dell PowerEdge 750 and the ichsmb driver
 doesn't want to work with it, I get the following on boot:

 ichsmb0: Intel 6300ESB (ICH) SMBus controller port 0x8c0-0x8df irq 17
 at device 31.3 on pci0
 device_attach: ichsmb0 attach returned 6

 It then doesn't load smb or smbus. I had a look in the source and it is
 supposed to work with this controller.

 Is this something wrong with the driver? or have I left out some bit of
 configuration?

 Attached is the output from boot -v and my kernel configuration. Is
 there any other debugging output that would be useful?

According to the boot -v messages the I/O range map is getting attached to
the wrong function on that chip:

found- vendor=0x8086, dev=0x25a3, revid=0x02
bus=0, slot=31, func=2
class=01-01-8a, hdrtype=0x00, mfdev=0
cmdreg=0x0005, statreg=0x02a8, cachelnsz=0 (dwords)
lattimer=0x00 (0 ns), mingnt=0x00 (0 ns), maxlat=0x00 (0 ns)
intpin=a, irq=255
map[20]: type 4, range 32, base 08c0, size  5, enabled

but it should be on this:

ichsmb0: Intel 6300ESB (ICH) SMBus controller port 0x8c0-0x8df irq 17 at
device 31.3 on pci0

I'll poke at this a bit, but you should check for a BIOS update.

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: 4.11R panics

2005-04-12 Thread Doug White
On Sun, 10 Apr 2005, Kirill Ponomarew wrote:

 On Fri, Apr 08, 2005 at 10:14:17AM -0700, Doug White wrote:
   Fatal trap 12: page fault while in kernel mode
   fault virtual address = 0x20202020
 
  Hm, something ran into a bunch of ASCII spaces..
 
  Can you jump to frame #6 and print *kbp?  It appears the kernel malloc
  bucket list is corrupted, so I'm curious just how badly that struct is
  spammed.

 #0  dumpsys () at /usr/src/sys/kern/kern_shutdown.c:487
 487   if (dumping++) {
 (kgdb) up 6
 #6  0xc0193533 in malloc (size=324, type=0xc030d780, flags=9)
 at /usr/src/sys/kern/kern_malloc.c:243
 243   va = kbp-kb_next;
 (kgdb) print *kbp
 $1 = {kb_next = 0x20202020 Address 0x20202020 out of bounds,
   kb_last = 0xcc8fa000 , kb_calls = 5704, kb_total = 448, kb_elmpercl = 8,
   kb_totalfree = 13, kb_highwat = 40, kb_couldfree = 0}

Not very, apparently.

Dunno what to say ... I'd guess something coughed up a bad address to a
DMA op or something and it just happened to land there.  If you can
reproduce this with any sort of regularity there might be a bug, although
that seems unlikely since that code hasn't changed in years and this is
the first such report I've seen of this :)

-- 
Doug White|  FreeBSD: The Power to Serve
[EMAIL PROTECTED]  |  www.FreeBSD.org
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


  1   2   3   4   >