Problem detecting Sil3124 SATA controllers off of Sandy Bridge northbridge-connected PCIe slots

2012-08-02 Thread Steve Polyack

Hi,

We're having some trouble with detection of a couple of Sil3124 SATA 
controller cards on newer motherboard and processor combos. 
Specifically, we're running a Supermicro X9SCM-F motherboard (latest 
BIOS) and Intel E3-1220v2 CPU.


What we're seeing:
- Syba Sil3124 PCIe cards are only being detected when installed in PCIe 
Slot 4
-- The motherboard documentation shows that this is the only slot 
connected to the Intel C202/204 chipset on the motherboard
-- Slots 5, 6, and 7 are connected to the integrated northbridge on the 
Ivy Bridge CPU

(there is no slot 1, 2, or 3)

FreeBSD won't detect even a single Sil3124 card installed in PCIe slot 
5, 6, or 7.  If we put an Intel Dual-port NIC in either of one of these 
slots, it is detected just fine.


I've attached a verbose dmesg.boot from this box running FreeBSD 
9.0-RELEASE.  We've also tried 8.1-RELEASE, 8.2-RELEASE, and 9.1-BETA1 
with the same results.  Booting with ACPI disabled results in a kernel 
panic during the boot process.


I'd greatly appreciate any help or suggestions on this matter. We've 
already tried just about every BIOS option on the board.


Steve Polyack
Table 'FACP' at 0xddfb5290
Table 'APIC' at 0xddfb5388
APIC: Found table at 0xddfb5388
APIC: Using the MADT enumerator.
MADT: Found CPU APIC ID 0 ACPI ID 1: enabled
SMP: Added CPU 0 (AP)
MADT: Found CPU APIC ID 2 ACPI ID 2: enabled
SMP: Added CPU 2 (AP)
MADT: Found CPU APIC ID 4 ACPI ID 3: enabled
SMP: Added CPU 4 (AP)
MADT: Found CPU APIC ID 6 ACPI ID 4: enabled
SMP: Added CPU 6 (AP)
Copyright (c) 1992-2012 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 9.0-RELEASE #0: Tue Jan  3 07:46:30 UTC 2012
r...@farrell.cse.buffalo.edu:/usr/obj/usr/src/sys/GENERIC amd64
Table 'FACP' at 0xddfb5290
Table 'APIC' at 0xddfb5388
Table 'FPDT' at 0xddfb5400
Table 'MCFG' at 0xddfb5448
Table 'HPET' at 0xddfb5488
Table 'SSDT' at 0xddfb54c0
Table 'SPMI' at 0xddfb5830
Table 'SSDT' at 0xddfb5870
Table 'SPCR' at 0xddfb62f8
Table 'EINJ' at 0xddfb6348
Table 'ERST' at 0xddfb6478
Table 'HEST' at 0xddfb6688
Table 'BERT' at 0xddfb6730
Table 'BGRT' at 0xddfb6760
ACPI: No SRAT table found
Preloaded elf kernel /boot/kernel/kernel at 0x813cf000.
Calibrating TSC clock ... TSC clock: 3100091416 Hz
CPU: Intel(R) Xeon(R) CPU E3-1220 V2 @ 3.10GHz (3100.09-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0x306a9  Family = 6  Model = 3a  Stepping = 9
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
Features2=0x77bae3ffSSE3,PCLMULQDQ,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,PCID,SSE4.1,SSE4.2,x2APIC,POPCNT,TSCDLT,AESNI,XSAVE,AVX,F16C,b30
  AMD Features=0x28100800SYSCALL,NX,RDTSCP,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant, performance statistics
real memory  = 17179869184 (16384 MB)
Physical memory chunk(s):
0x1000 - 0x00094fff, 606208 bytes (148 pages)
0x0010 - 0x001f, 1048576 bytes (256 pages)
0x0140b000 - 0xdddecfff, 3701350400 bytes (903650 pages)
0xdf056000 - 0xdf056fff, 4096 bytes (1 pages)
0xdf09a000 - 0xdf7f, 7757824 bytes (1894 pages)
0x0001 - 0x000400fa7fff, 12901318656 bytes (3149736 pages)
avail memory = 16506712064 (15742 MB)
Event timer LAPIC quality 600
ACPI APIC Table: SUPERM SMCI--MB
INTR: Adding local APIC 2 as a target
INTR: Adding local APIC 4 as a target
INTR: Adding local APIC 6 as a target
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 1 package(s) x 4 core(s)
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  2
 cpu2 (AP): APIC ID:  4
 cpu3 (AP): APIC ID:  6
x86bios:  IVT 0x00-0x0004ff at 0xfe00
x86bios: SSEG 0x001000-0x001fff at 0xff8000246000
x86bios: EBDA 0x098000-0x09 at 0xfe098000
x86bios:  ROM 0x0a-0x0fefff at 0xfe0a
APIC: CPU 0 has ACPI ID 1
APIC: CPU 1 has ACPI ID 2
APIC: CPU 2 has ACPI ID 3
APIC: CPU 3 has ACPI ID 4
lapic0: CMCI unmasked
ULE: setup cpu 0
ULE: setup cpu 1
ULE: setup cpu 2
ULE: setup cpu 3
ACPI: RSDP 0xf0490 00024 (v02 SUPERM)
ACPI: XSDT 0xddfaa088 00094 (v01 SUPERM SMCI--MB 0001 AMI  00010013)
ACPI: FACP 0xddfb5290 000F4 (v04 SUPERM SMCI--MB 0001 AMI  00010013)
ACPI: DSDT 0xddfaa1b8 0B0D7 (v02 SUPERM SMCI--MB  INTL 20051117)
ACPI: FACS 0xddfb8f80 00040
ACPI: APIC 0xddfb5388 00072 (v03 SUPERM SMCI--MB 0001 AMI  00010013)
ACPI: FPDT 0xddfb5400 00044 (v01 SUPERM SMCI--MB 0001 AMI  00010013)
ACPI: MCFG 0xddfb5448 0003C (v01 SUPERM SMCI--MB 0001 MSFT 0097)
ACPI: HPET 0xddfb5488 00038 (v01 SUPERM SMCI--MB 0001 AMI. 0005)
ACPI: SSDT 0xddfb54c0 0036D (v01 SataRe SataTabl 1000 INTL 20091112)
ACPI: SPMI 0xddfb5830 00040 (v05 A M I   OEMSPMI 

Re: Problem detecting Sil3124 SATA controllers off of Sandy Bridge northbridge-connected PCIe slots

2012-08-02 Thread Steve Polyack

On 08/02/2012 01:58 PM, John Baldwin wrote:

On Thursday, August 02, 2012 10:21:20 am Steve Polyack wrote:

Hi,

We're having some trouble with detection of a couple of Sil3124 SATA
controller cards on newer motherboard and processor combos.
Specifically, we're running a Supermicro X9SCM-F motherboard (latest
BIOS) and Intel E3-1220v2 CPU.

What we're seeing:
- Syba Sil3124 PCIe cards are only being detected when installed in PCIe
Slot 4
-- The motherboard documentation shows that this is the only slot
connected to the Intel C202/204 chipset on the motherboard
-- Slots 5, 6, and 7 are connected to the integrated northbridge on the
Ivy Bridge CPU
(there is no slot 1, 2, or 3)

FreeBSD won't detect even a single Sil3124 card installed in PCIe slot
5, 6, or 7.  If we put an Intel Dual-port NIC in either of one of these
slots, it is detected just fine.

I've attached a verbose dmesg.boot from this box running FreeBSD
9.0-RELEASE.  We've also tried 8.1-RELEASE, 8.2-RELEASE, and 9.1-BETA1
with the same results.  Booting with ACPI disabled results in a kernel
panic during the boot process.

I'd greatly appreciate any help or suggestions on this matter. We've
already tried just about every BIOS option on the board.

Does the device show up in pciconf -l output?

The device itself does NOT show up in pciconf -l output.  However, I ran 
pciconf on two different boots, once with and once without the card 
installed in one of the non-probed slots.  I noticed that the following 
Ivy Bridge PCI Express Root Port showed up when the card was installed:
+pcib2@pci0:0:1:1:class=0x060400 card=0x062415d9 chip=0x01558086 
rev=0x09 hdr=0x01

+vendor = 'Intel Corporation'
+device = 'Ivy Bridge PCI Express Root Port'
+class  = bridge
+subclass   = PCI-PCI
+cap 0d[88] = PCI Bridge card=0x062415d9
+cap 01[80] = powerspec 3  supports D0 D3  current D0
+cap 05[90] = MSI supports 1 message
+cap 10[a0] = PCI-Express 2 root port max data 128(128) link x1(x8)
+ecap 0002[100] = VC 1 max VC0
+ecap 0005[140] = unknown 1
+ecap 0019[d94] = unknown 1

The previous Ivy Bridge Root Port still shows up, and shows the 
following change:

-cap 10[a0] = PCI-Express 2 root port max data 256(256) link x1(x8)
+cap 10[a0] = PCI-Express 2 root port max data 256(256) link x0(x8)

I also meant to note in my initial email that the Option ROM (RAID 
setup, drive probing, etc.) shows up for the Sil3124 SATA card in any 
slot.  Even when the OS is unable to discover the card.


Thanks for the response,
Steve

hostb0@pci0:0:0:0:  class=0x06 card=0x062415d9 chip=0x01588086 rev=0x09 
hdr=0x00
vendor = 'Intel Corporation'
device = 'Ivy Bridge DRAM Controller'
class  = bridge
subclass   = HOST-PCI
cap 09[e0] = vendor (length 12) Intel cap 0 version 1
pcib1@pci0:0:1:0:   class=0x060400 card=0x062415d9 chip=0x01518086 rev=0x09 
hdr=0x01
vendor = 'Intel Corporation'
device = 'Ivy Bridge PCI Express Root Port'
class  = bridge
subclass   = PCI-PCI
cap 0d[88] = PCI Bridge card=0x062415d9
cap 01[80] = powerspec 3  supports D0 D3  current D0
cap 05[90] = MSI supports 1 message 
cap 10[a0] = PCI-Express 2 root port max data 256(256) link x0(x8)
ecap 0002[100] = VC 1 max VC0
ecap 0005[140] = unknown 1
ecap 0019[d94] = unknown 1
pcib2@pci0:0:1:1:   class=0x060400 card=0x062415d9 chip=0x01558086 rev=0x09 
hdr=0x01
vendor = 'Intel Corporation'
device = 'Ivy Bridge PCI Express Root Port'
class  = bridge
subclass   = PCI-PCI
cap 0d[88] = PCI Bridge card=0x062415d9
cap 01[80] = powerspec 3  supports D0 D3  current D0
cap 05[90] = MSI supports 1 message 
cap 10[a0] = PCI-Express 2 root port max data 128(128) link x1(x8)
ecap 0002[100] = VC 1 max VC0
ecap 0005[140] = unknown 1
ecap 0019[d94] = unknown 1
em0@pci0:0:25:0:class=0x02 card=0x150215d9 chip=0x15028086 rev=0x05 
hdr=0x00
vendor = 'Intel Corporation'
device = '82579LM Gigabit Network Connection'
class  = network
subclass   = ethernet
bar   [10] = type Memory, range 32, base 0xf7b0, size 131072, enabled
bar   [14] = type Memory, range 32, base 0xf7b25000, size 4096, enabled
bar   [18] = type I/O Port, range 32, base 0xf020, size 32, enabled
cap 01[c8] = powerspec 2  supports D0 D3  current D0
cap 05[d0] = MSI supports 1 message, 64 bit enabled with 1 message
cap 13[e0] = PCI Advanced Features: FLR TP
ehci0@pci0:0:26:0:  class=0x0c0320 card=0x062415d9 chip=0x1c2d8086 rev=0x05 
hdr=0x00
vendor = 'Intel Corporation'
device = '6 Series/C200 Series Chipset Family USB Enhanced Host 
Controller'
class  = serial bus
subclass   = USB
bar   [10] = type Memory, range 32, base 0xf7b24000, size 1024, enabled
cap 01[50] = powerspec 2  supports D0 D3  current D0
cap 0a[58] = EHCI Debug Port at offset 0xa0 in map 0x14
cap 13

Re: Usling vlan(4) without an actual lan behind it

2011-09-21 Thread Steve Polyack

On 09/21/2011 01:34 PM, Mike Andrews wrote:

On Mon, 19 Sep 2011, Pete French wrote:

Does it specifically have to be a vlan(4), or can you perhaps add 
another

address to lo(4), or perhaps create a lo1 in addition to the lo0?


It can be anything really - I was looking for a generic interface
I can configure with IP addresses. But adding real addresses to
loopback interfaces can cause problems can it not ?

The issue I am trying to address is that I have a whole bunch of IPv6
addresse on a /64, which are oly used as endpoints for a set of
websites - they don't exist on a real ethernet anywhere, and don't
need to. I just want them on an interface on a machine wwhen I can run
up a load balancer to listeon on those addresses and forward them to
the approrpiate actual machines serving the requests.


Sounds like DSR-type load balancing (or in Linux LVS land, DR mode), 
where the load balancer just rewrites the target MAC address in the 
header instead of doing full-blown NAT or proxying.  Putting the IP's 
on lo0 is the way to go here.  We've been doing that for many, many 
years (well, months for v6, years for v4) and it works great.  With 
the IP's on lo0, the load balancers are the only thing that can ARP 
(or NDP) for those addresses... which is what you'd want.


I'll second that this works just fine (at least on the IPv4 side).  
We've always used lo1 to separate things, but that doesn't functionally 
change anything.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Networking - CARP interfaces

2011-06-14 Thread Steve Polyack

On 06/14/2011 01:00 PM, Damien Fleuriot wrote:


I can confirm that this scenario causes problems, see below:

### ON FIREWALL 1 , carp master for carp0, carp1, carp2
carp2: flags=49UP,LOOPBACK,RUNNING  metric 0 mtu 1500
inet 192.168.224.254 netmask 0xff00
carp: MASTER vhid 224 advbase 1 advskew 50


### ON FIREWALL 2 , carp backup for carp0, carp1, carp2
carp2: flags=49UP,LOOPBACK,RUNNING  metric 0 mtu 1500
inet 192.168.234.254 netmask 0xff00
carp: BACKUP vhid 234 advbase 1 advskew 100


Now, I add a dummy IP to carp2 on FIREWALL 2, which is supposedly backup:

ifconfig carp2 inet 192.168.234.207 alias

Result:

### ON FIREWALL 1, carp master for carp0, carp1, carp2
carp2: flags=49UP,LOOPBACK,RUNNING  metric 0 mtu 1500
inet 192.168.224.254 netmask 0xff00
carp: MASTER vhid 224 advbase 1 advskew 50

### ON FIREWALL 2, carp backup for carp0, carp1, but no longer carp2
carp2: flags=49UP,LOOPBACK,RUNNING  metric 0 mtu 1500
inet 192.168.234.254 netmask 0xff00
inet 192.168.234.207 netmask 0xff00
carp: MASTER vhid 234 advbase 1 advskew 100


After I remove the extraneous IP, the interface becomes backup again:


# This was a long time ago
carp0: MASTER -  BACKUP (more frequent advertisement received)
carp0: link state changed to DOWN
carp2: MASTER -  BACKUP (more frequent advertisement received)
carp2: link state changed to DOWN
carp1: MASTER -  BACKUP (more frequent advertisement received)
carp1: link state changed to DOWN
carp2: link state changed to DOWN
# This was when I ran my tests
carp2: INIT -  MASTER (preempting)
carp2: link state changed to UP
carp2: MASTER -  BACKUP (more frequent advertisement received)
carp2: link state changed to DOWN


Did you give this enough time to reasonably settle?  Sometimes when the 
interfaces initially come up, they will become MASTER for a bit before 
backing down.



This entails that hosts in a given carp vhid must have the exact same IP
addresses configured on that interface.

While this is perfectly understandable in a master-backup scenario, this
is a bit more annoying for us in a master-backup + backup-backup
scenario with 2 datacenters.

I'll just have to adapt and ensure they have the same IP addresses then.


I have a suspicion that the important part may be the number of IP 
addresses on the CARP interface.  If CARP sends an advertisement from 
each IP alias on a CARP interface, then I think that would explain what 
you are seeing - and also possibly give you a workaround by adding two 
more bogus IPs on your primary datacenter firewalls (where IPs W and Z 
are normally missing).


- Steve

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Networking - CARP interfaces

2011-06-14 Thread Steve Polyack

On 06/14/2011 04:51 PM, Damien Fleuriot wrote:


On 14 Jun 2011, at 19:33, Steve Polyackkor...@comcast.net  wrote:


On 06/14/2011 01:00 PM, Damien Fleuriot wrote:

I can confirm that this scenario causes problems, see below:

### ON FIREWALL 1 , carp master for carp0, carp1, carp2
carp2: flags=49UP,LOOPBACK,RUNNING   metric 0 mtu 1500
inet 192.168.224.254 netmask 0xff00
carp: MASTER vhid 224 advbase 1 advskew 50


### ON FIREWALL 2 , carp backup for carp0, carp1, carp2
carp2: flags=49UP,LOOPBACK,RUNNING   metric 0 mtu 1500
inet 192.168.234.254 netmask 0xff00
carp: BACKUP vhid 234 advbase 1 advskew 100


Now, I add a dummy IP to carp2 on FIREWALL 2, which is supposedly backup:

ifconfig carp2 inet 192.168.234.207 alias

Result:

### ON FIREWALL 1, carp master for carp0, carp1, carp2
carp2: flags=49UP,LOOPBACK,RUNNING   metric 0 mtu 1500
inet 192.168.224.254 netmask 0xff00
carp: MASTER vhid 224 advbase 1 advskew 50

### ON FIREWALL 2, carp backup for carp0, carp1, but no longer carp2
carp2: flags=49UP,LOOPBACK,RUNNING   metric 0 mtu 1500
inet 192.168.234.254 netmask 0xff00
inet 192.168.234.207 netmask 0xff00
carp: MASTER vhid 234 advbase 1 advskew 100


After I remove the extraneous IP, the interface becomes backup again:


# This was a long time ago
carp0: MASTER -   BACKUP (more frequent advertisement received)
carp0: link state changed to DOWN
carp2: MASTER -   BACKUP (more frequent advertisement received)
carp2: link state changed to DOWN
carp1: MASTER -   BACKUP (more frequent advertisement received)
carp1: link state changed to DOWN
carp2: link state changed to DOWN
# This was when I ran my tests
carp2: INIT -   MASTER (preempting)
carp2: link state changed to UP
carp2: MASTER -   BACKUP (more frequent advertisement received)
carp2: link state changed to DOWN

Did you give this enough time to reasonably settle?  Sometimes when the 
interfaces initially come up, they will become MASTER for a bit before backing 
down.


I think I did but I can do try again tomorrow evening just to make sure.

Oh god, if only dmesg entries were timestamped...



This entails that hosts in a given carp vhid must have the exact same IP
addresses configured on that interface.

While this is perfectly understandable in a master-backup scenario, this
is a bit more annoying for us in a master-backup + backup-backup
scenario with 2 datacenters.

I'll just have to adapt and ensure they have the same IP addresses then.

I have a suspicion that the important part may be the number of IP addresses on 
the CARP interface.  If CARP sends an advertisement from each IP alias on a 
CARP interface, then I think that would explain what you are seeing - and also 
possibly give you a workaround by adding two more bogus IPs on your primary 
datacenter firewalls (where IPs W and Z are normally missing).

- Steve


I'll give it a try, although I think in a scenario where the carp interfaces 
have the same number of IPs and these IPs differ, both interfaces will claim 
mastership.

Will post results.


Now that I look at the spec, it looks like both the count and the 
addresses themselves are provided in VRRP packets.  CARP likely does the 
same.  I can't speak for whether these things are considered along with 
the VHID and password, but it's worth a shot.  I think you are correct, 
though.


http://www.networksorcery.com/enp/protocol/vrrp.htm

- Steve
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FBSD 8.2R does not probe sound card

2011-03-17 Thread Steve Polyack

On 03/17/11 10:25, Gua Chung Lim wrote:

Hi all,

I have been using FBSD since mid 2005 without any problems.
Recently, I installed 8.2R from DVD on my new machine.
(HP Pavilion p6276l Home PC)
FBSD 8.2R does not probe my sound card.

# kldload snd_driver
ppc0: parallel port not found.
ppc0: parallel port not found.
ppc0: parallel port not found.
ppc0: parallel port not found.
hdac0:NVidia MCP61 High Definition Audio Controller  mem 
0xfbdf8000-0xfbdfbfff irq 23 at device 5.0 on pci0
hdac0: HDA Driver Revision: 20100226_0142
hdac0: [ITHREAD]
hdac0: HDA Codec #0: Realtek ALC662
hdac1:ATI RV730 High Definition Audio Controller  mem 0xfbeec000-0xfbee 
irq 17 at device 0.1 on pci2
hdac1: HDA Driver Revision: 20100226_0142
hdac1: [ITHREAD]
hdac1: HDA Codec #0: ATI R6xx HDMI
pcm0:HDA Realtek ALC662 PCM #0 Analog  at cad 0 nid 1 on hdac0
pcm1:HDA ATI R6xx HDMI PCM #0 HDMI  at cad 0 nid 1 on hdac1
ppc0: parallel port not found.
ppc0: parallel port not found.

I check sound card properties with Win7 (on another partition/slice).
It says ``High Definition Audio Device'' and nothing else.
So I try again.

# kldload snd_hda
hdac0:NVidia MCP61 High Definition Audio Controller  mem 
0xfbdf8000-0xfbdfbfff irq 23 at device 5.0 on pci0
hdac0: HDA Driver Revision: 20100226_0142
hdac0: [ITHREAD]
hdac0: HDA Codec #0: Realtek ALC662
hdac1:ATI RV730 High Definition Audio Controller  mem 0xfbeec000-0xfbee 
irq 17 at device 0.1 on pci2
hdac1: HDA Driver Revision: 20100226_0142
hdac1: [ITHREAD]
hdac1: HDA Codec #0: ATI R6xx HDMI
pcm0:HDA Realtek ALC662 PCM #0 Analog  at cad 0 nid 1 on hdac0
pcm1:HDA ATI R6xx HDMI PCM #0 HDMI  at cad 0 nid 1 on hdac1

Both cases the speakers are still quiet.
Anyone who has a clue or experieces this problem, please point me out.


It looks like your card is getting probed in both cases, unless you have 
an additional device that is not being listed.  The pcm0 device above 
looks to be from onboard sound on your nvidia motherboard, while the 
pcm1 device is the HDMI audio output on your graphics card.  My only 
suggestion here is to play with the sysctl hw.snd.default_unit:

$ sysctl -d hw.snd.default_unit
hw.snd.default_unit: default sound device

It defaults to 0, try 1-5 and see if there is any improvement, as you 
have multiple sound output devices.


What programs are you using to test the sound output?
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: MAC address / per-proto ARP caching in 8.1-RELEASE

2011-03-16 Thread Steve Polyack

On 03/15/11 14:26, Jeremy Chadwick wrote:

On Tue, Mar 15, 2011 at 09:30:39AM -0400, Steve Polyack wrote:

Is anyone aware of some sort of facility in either FreeBSD
8.1-RELEASE or the em(4) driver which would cause it to cache MAC
addresses / ARP entries for hosts on a per-protocol basis?

[snipping remaining details; readers can read it here instead:]
[http://lists.freebsd.org/pipermail/freebsd-stable/2011-March/061908.html]

The only thing I can think of would be flowtable, but I'm not sure
if it's enabled by default on 8.1-RELEASE-p2.  You can try the following
sysctl to disable it (I would recommend setting this in sysctl.conf and
rebooting; I don't know what happens in the case you set it on a live
system that's already experiencing the MAC issue you describe).

net.inet.flowtable.enable=0

Details:

http://conferences.sigcomm.org/sigcomm/2009/workshops/presto/papers/p37.pdf

I gave this a shot again this morning.  It's definitely related to the 
flowtable:


[spolyack@web01 ~]$ time host web00.lab00 ; sudo sysctl 
net.inet.flowtable.enable=0 ; time host web00.lab00

;; connection timed out; no servers could be reached

real0m10.017s
user0m0.000s
sys0m0.008s

net.inet.flowtable.enable: 1 - 0

web00.lab00 has address 10.0.1.129

real0m0.069s
user0m0.000s
sys0m0.003s

I'm still curious as to why this is only breaking new outgoing UDP 
traffic.  New TCP connections aren't affected in the same way at all.  
There also does not seem to be any relevant changes to flowtable code 
between 8.1-RELEASE and 8.2-RELEASE or 8-STABLE.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


MAC address / per-proto ARP caching in 8.1-RELEASE

2011-03-15 Thread Steve Polyack
Is anyone aware of some sort of facility in either FreeBSD 8.1-RELEASE 
or the em(4) driver which would cause it to cache MAC addresses / ARP 
entries for hosts on a per-protocol basis?  We've been doing some 
testing with new routers, and almost every time we switch them in or out 
our FreeBSD machines get stuck sending UDP DNS requests to the MAC 
address of the old default gateway in complete disregard for the current 
contents of the ARP table.  New TCP connections do not seem to be affected:


[spolyack@web01 ~]$ uname -a
FreeBSD web01 8.1-RELEASE-p2 FreeBSD 8.1-RELEASE-p2 #3: Mon Dec  6 
08:58:21 EST 2010 root@web01:/usr/obj/usr/src/sys/WEB-AMD64  amd64


Current ARP table after the router replacement, the default gateway is 
10.0.1.254, which it has the correct new MAC address for already:

[spolyack@web01 ~]$ arp -an
? (10.0.1.17) at 00:0c:29:47:74:3a on em2 permanent [ethernet]
? (10.0.1.130) at 00:0c:29:47:74:26 on em0 permanent [ethernet]
? (10.0.1.254) at 00:a0:c9:00:01:01 on em0 expires in 915 seconds [ethernet]
? (10.0.0.17) at 00:0c:29:47:74:30 on em1 permanent [ethernet]
? (10.0.0.15) at 00:0c:29:47:74:30 on em1 permanent [ethernet]
? (10.0.0.11) at 00:0c:29:47:74:30 on em1 permanent [ethernet]
? (10.0.0.2) at 00:1f:a0:10:28:70 on em1 expires in 1162 seconds [ethernet]
? (10.0.0.1) at 00:1f:a0:10:28:70 on em1 expires in 943 seconds [ethernet]

tcpdump shows the DNS requests heading to the *old* router's MAC address 
despite the contents of the ARP table:

[spolyack@web01 ~]$ sudo tcpdump -i em0 -s 256 -vvv -e -n -ttt 'port 53'
tcpdump: listening on em0, link-type EN10MB (Ethernet), capture size 256 
bytes
00:00:00.00 00:0c:29:47:74:26  54:75:d0:a3:7c:8c, ethertype IPv4 
(0x0800), length 86: (tos 0x0, ttl 64, id 55590, offset 0, flags [none], 
proto UDP (17), length 72)
10.0.1.130.52419  10.0.2.80.53: [bad udp cksum fdc5!] 52051+ A? 
db-testing-lab. (44)


[spolyack@web01 ~]$ arp -an | grep 54:75:d0
[spolyack@web01 ~]$

Using telnet to form a TCP connection to port 53 on the same DNS server 
which the above UDP requests are headed to works just fine:

[spolyack@web01 ~]$ telnet 10.0.2.80 53
Trying 10.0.2.80...
Connected to 10.0.2.80.
Escape character is '^]'.
00:03:43.272134 00:0c:29:47:74:26  00:a0:c9:00:01:01, ethertype IPv4 
(0x0800), length 74: (tos 0x10, ttl 64, id 24383, offset 0, flags [DF], 
proto TCP (6), length 60)
10.0.2.130.20130  10.0.2.80.53: Flags [S], cksum 0x0353 (incorrect 
- 0xf74d), seq 2674341615, win 65535, options [mss 1460,nop,wscale 
3,sackOK,TS val 60433610 ecr 0], length

...

Even after the above, standard UDP DNS requests are still headed to the 
old default gateway MAC address.  tcpdumping and looking at ARP 
requests/responses doesn't show any traces of the old MAC address 
either.  The switch which connects the server and router doesn't have 
any entries referencing the old router's MAC address anywhere either.  
I'm assuming this isn't related to the new flowtable features included 
in 8.x, since they appear to function on a 4-tuple of src addr, src 
port, dst addr, and dst port, which isn't going to match for new DNS 
requests.


If anyone has any ideas or suggestions on what additional data I can 
grab, I'd be happy to investigate further, but I've run out of ideas here.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: MAC address / per-proto ARP caching in 8.1-RELEASE

2011-03-15 Thread Steve Polyack

On 03/15/11 14:26, Jeremy Chadwick wrote:

On Tue, Mar 15, 2011 at 09:30:39AM -0400, Steve Polyack wrote:

Is anyone aware of some sort of facility in either FreeBSD
8.1-RELEASE or the em(4) driver which would cause it to cache MAC
addresses / ARP entries for hosts on a per-protocol basis?

[snipping remaining details; readers can read it here instead:]
[http://lists.freebsd.org/pipermail/freebsd-stable/2011-March/061908.html]

The only thing I can think of would be flowtable, but I'm not sure
if it's enabled by default on 8.1-RELEASE-p2.  You can try the following
sysctl to disable it (I would recommend setting this in sysctl.conf and
rebooting; I don't know what happens in the case you set it on a live
system that's already experiencing the MAC issue you describe).

net.inet.flowtable.enable=0

Details:

http://conferences.sigcomm.org/sigcomm/2009/workshops/presto/papers/p37.pdf

It looks like it is enabled by default on 8.1-RELEASE.  Here's the 
net.inet.flowtable tree from the box in question:

[spolyack@web00 ~]$ sysctl net.inet.flowtable
net.inet.flowtable.stats:
table name: ipv4
collisions: 1
allocated: 0
misses: 20013
max_depth: 0
free_checks: 377953
frees: 19993
hits: 69519580
lookups: 69539593

net.inet.flowtable.nmbflows: 50176
net.inet.flowtable.tcp_expire: 86400
net.inet.flowtable.fin_wait_expire: 600
net.inet.flowtable.udp_expire: 300
net.inet.flowtable.syn_expire: 300
net.inet.flowtable.enable: 1
net.inet.flowtable.debug: 0

I'm planning on setting net.inet.flowtable.debug=1 next time we see this 
behavior, but from looking at the code, it looks like we might have to 
rebuild with -DFLOWTABLE_DEBUG to get really useful information.  It's 
too bad that there is not yet a method to dump the contents of the 
flowtable.


Thanks for the suggestion.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Keyboard repeat issues with Dell Optiplex 980s

2011-01-19 Thread Steve Polyack

On 01/19/11 08:48, Steve Polyack wrote:

On 1/18/2011 5:56 PM, Jeremy Chadwick wrote:

On Tue, Jan 18, 2011 at 04:40:13PM -0500, Steve Polyack wrote:

We've recently upgraded a few desktop workstations from Dell
Optiplex 960s to Optiplex 980s.  We were running FreeBSD
8.1-RELEASE.  The migration was performed by simply swapping the
drives into the new systems.  Immediately after switching people
over, they all began to report bizarre keyboard issues - things like
infinite key repeats (letters, numbers, enter) for keys they did
not hold down.  The key repeats continue indefinitely until another
key is pressed.  Occasionally, even mouse input will trigger similar
infinite keyboard input repetition.  In addition to the repeat
issue, sometimes physical key-presses are not registered by FreeBSD,
leading to typos and angry developers.

We've tried doing fresh installs of FreeBSD 8.2-RC2 on two of these
systems, and the issue persists.  Because of the observed behavior,
I'm thinking that this is due to new hardware in the 980s which
isn't timing or handling interrupts correctly under the FreeBSD
kernel.

Looking at a 'pciconf -lvb' from each system, I noticed that the 980
has two USB controllers which probe under ehci(4), while the 960
(which does not exhibit this problem), enumerates six uhci(4)
controllers and two ehci(4) controllers.  To cut to the chase here,
the 960 users' keyboards probe under a USB1.0 uhci(4), while the
980s only have ehci(4) devices to attach to.

So, I guess what I'm asking is - has anyone else seen any keyboard
repeat or other USB craziness with ehci(4) ports or otherwise Intel
PCH controllers?Any fellow Optiplex 980 users?  I'd be more than
happy to provide pciconf or other output if requested.

Try adding the following to /boot/loader.conf then reboot and see if
the excessive repeat behaviour changes:

hint.kbdmux.0.disabled=1

It would also help if you would state exactly what brand/model of
keyboard is used.  Yes, believe it or not, it matters.  dmesg output
would be helpful in this case.

The keyboard is also a Dell model - model KB1421, or listed as Dell 
QuiteKey Keyboard under dmesg.  The same keyboard does not exhibit 
the strange behavior when used with the older model of tower (Optiplex 
960).


 I'll reboot today with the loader.conf hint you provided.  I'll let 
you guys know if it helps.  Thanks!



I forgot to attach my dmesg - here it is!
Copyright (c) 1992-2011 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 8.2-RC2 #1: Mon Jan 17 12:10:53 EST 2011
root@galvatron:/usr/obj/usr/src/sys/GENERIC amd64
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Core(TM) i5 CPU 750  @ 2.67GHz (2660.02-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0x106e5  Family = 6  Model = 1e  Stepping = 5
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
Features2=0x98e3fdSSE3,DTES64,MON,DS_CPL,VMX,SMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,SSE4.1,SSE4.2,POPCNT
  AMD Features=0x28100800SYSCALL,NX,RDTSCP,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
real memory  = 4294967296 (4096 MB)
avail memory = 4082315264 (3893 MB)
ACPI APIC Table: DELL   B11K   
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 1 package(s) x 4 core(s)
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  2
 cpu2 (AP): APIC ID:  4
 cpu3 (AP): APIC ID:  6
ioapic0: Changing APIC ID to 8
ioapic0 Version 2.0 irqs 0-23 on motherboard
lapic0: Forcing LINT1 to edge trigger
kbd1 at kbdmux0
acpi0: DELL B11Kon motherboard
acpi0: [ITHREAD]
acpi0: Power Button (fixed)
Timecounter ACPI-fast frequency 3579545 Hz quality 1000
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
cpu0: ACPI CPU on acpi0
cpu1: ACPI CPU on acpi0
cpu2: ACPI CPU on acpi0
cpu3: ACPI CPU on acpi0
acpi_button0: Power Button on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pcib1: PCI-PCI bridge irq 16 at device 3.0 on pci0
pci1: PCI bus on pcib1
vgapci0: VGA-compatible display port 0xdc80-0xdcff mem 
0xf600-0xf6ff,0xe000-0xefff,0xf000-0xf1ff irq 16 at 
device 0.0 on pci1
nvidia0: GeForce GT 330 on vgapci0
vgapci0: child nvidia0 requested pci_enable_busmaster
vgapci0: child nvidia0 requested pci_enable_io
vgapci0: child nvidia0 requested pci_enable_io
nvidia0: [ITHREAD]
hdac0: NVidia (Unknown) High Definition Audio Controller mem 
0xf7dfc000-0xf7df irq 17 at device 0.1 on pci1
hdac0: HDA Driver Revision: 20100226_0142
hdac0: [ITHREAD]
pci0: base peripheral at device 8.0 (no driver attached)
pci0: base peripheral at device 8.1 (no driver attached)
pci0: base peripheral at device 8.2 (no driver attached)
pci0: base peripheral at device 16.0 (no driver attached)
pci0

Re: Keyboard repeat issues with Dell Optiplex 980s

2011-01-19 Thread Steve Polyack

On 1/18/2011 5:56 PM, Jeremy Chadwick wrote:

On Tue, Jan 18, 2011 at 04:40:13PM -0500, Steve Polyack wrote:

We've recently upgraded a few desktop workstations from Dell
Optiplex 960s to Optiplex 980s.  We were running FreeBSD
8.1-RELEASE.  The migration was performed by simply swapping the
drives into the new systems.  Immediately after switching people
over, they all began to report bizarre keyboard issues - things like
infinite key repeats (letters, numbers, enter) for keys they did
not hold down.  The key repeats continue indefinitely until another
key is pressed.  Occasionally, even mouse input will trigger similar
infinite keyboard input repetition.  In addition to the repeat
issue, sometimes physical key-presses are not registered by FreeBSD,
leading to typos and angry developers.

We've tried doing fresh installs of FreeBSD 8.2-RC2 on two of these
systems, and the issue persists.  Because of the observed behavior,
I'm thinking that this is due to new hardware in the 980s which
isn't timing or handling interrupts correctly under the FreeBSD
kernel.

Looking at a 'pciconf -lvb' from each system, I noticed that the 980
has two USB controllers which probe under ehci(4), while the 960
(which does not exhibit this problem), enumerates six uhci(4)
controllers and two ehci(4) controllers.  To cut to the chase here,
the 960 users' keyboards probe under a USB1.0 uhci(4), while the
980s only have ehci(4) devices to attach to.

So, I guess what I'm asking is - has anyone else seen any keyboard
repeat or other USB craziness with ehci(4) ports or otherwise Intel
PCH controllers?Any fellow Optiplex 980 users?  I'd be more than
happy to provide pciconf or other output if requested.

Try adding the following to /boot/loader.conf then reboot and see if
the excessive repeat behaviour changes:

hint.kbdmux.0.disabled=1

It would also help if you would state exactly what brand/model of
keyboard is used.  Yes, believe it or not, it matters.  dmesg output
would be helpful in this case.

The keyboard is also a Dell model - model KB1421, or listed as Dell 
QuiteKey Keyboard under dmesg.  The same keyboard does not exhibit the 
strange behavior when used with the older model of tower (Optiplex 960).


 I'll reboot today with the loader.conf hint you provided.  I'll let 
you guys know if it helps.  Thanks!

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Keyboard repeat issues with Dell Optiplex 980s

2011-01-19 Thread Steve Polyack

On 01/19/11 08:48, Steve Polyack wrote:

On 1/18/2011 5:56 PM, Jeremy Chadwick wrote:

On Tue, Jan 18, 2011 at 04:40:13PM -0500, Steve Polyack wrote:

We've recently upgraded a few desktop workstations from Dell
Optiplex 960s to Optiplex 980s.  We were running FreeBSD
8.1-RELEASE.  The migration was performed by simply swapping the
drives into the new systems.  Immediately after switching people
over, they all began to report bizarre keyboard issues - things like
infinite key repeats (letters, numbers, enter) for keys they did
not hold down.  The key repeats continue indefinitely until another
key is pressed.  Occasionally, even mouse input will trigger similar
infinite keyboard input repetition.  In addition to the repeat
issue, sometimes physical key-presses are not registered by FreeBSD,
leading to typos and angry developers.

We've tried doing fresh installs of FreeBSD 8.2-RC2 on two of these
systems, and the issue persists.  Because of the observed behavior,
I'm thinking that this is due to new hardware in the 980s which
isn't timing or handling interrupts correctly under the FreeBSD
kernel.

Looking at a 'pciconf -lvb' from each system, I noticed that the 980
has two USB controllers which probe under ehci(4), while the 960
(which does not exhibit this problem), enumerates six uhci(4)
controllers and two ehci(4) controllers.  To cut to the chase here,
the 960 users' keyboards probe under a USB1.0 uhci(4), while the
980s only have ehci(4) devices to attach to.

So, I guess what I'm asking is - has anyone else seen any keyboard
repeat or other USB craziness with ehci(4) ports or otherwise Intel
PCH controllers?Any fellow Optiplex 980 users?  I'd be more than
happy to provide pciconf or other output if requested.

Try adding the following to /boot/loader.conf then reboot and see if
the excessive repeat behaviour changes:

hint.kbdmux.0.disabled=1

It would also help if you would state exactly what brand/model of
keyboard is used.  Yes, believe it or not, it matters.  dmesg output
would be helpful in this case.

The keyboard is also a Dell model - model KB1421, or listed as Dell 
QuiteKey Keyboard under dmesg.  The same keyboard does not exhibit 
the strange behavior when used with the older model of tower (Optiplex 
960).


 I'll reboot today with the loader.conf hint you provided.  I'll let 
you guys know if it helps.  Thanks!




The problem still exists with the kbdmux.0.disabled hint.  It definitely 
took effect, as there is no longer a /dev/kbdmux0, and dmesg lists the 
refusal to register the kbdmux module.  Any other ideas?  We've tried 
playing with the hw.usb.ehci.lostinrbug and hw.usb.ehci.no_hs sysctls, 
but they don't make a difference either.


Looking at the ehci(4) man page, this sticks out at me:
BUGS
 The driver is not finished and is quite buggy.

 There is currently no support for isochronous transfers.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Keyboard repeat issues with Dell Optiplex 980s

2011-01-18 Thread Steve Polyack


We've recently upgraded a few desktop workstations from Dell Optiplex 
960s to Optiplex 980s.  We were running FreeBSD 8.1-RELEASE.  The 
migration was performed by simply swapping the drives into the new 
systems.  Immediately after switching people over, they all began to 
report bizarre keyboard issues - things like infinite key repeats 
(letters, numbers, enter) for keys they did not hold down.  The key 
repeats continue indefinitely until another key is pressed.  
Occasionally, even mouse input will trigger similar infinite keyboard 
input repetition.  In addition to the repeat issue, sometimes physical 
key-presses are not registered by FreeBSD, leading to typos and angry 
developers.


We've tried doing fresh installs of FreeBSD 8.2-RC2 on two of these 
systems, and the issue persists.  Because of the observed behavior, I'm 
thinking that this is due to new hardware in the 980s which isn't timing 
or handling interrupts correctly under the FreeBSD kernel.


Looking at a 'pciconf -lvb' from each system, I noticed that the 980 has 
two USB controllers which probe under ehci(4), while the 960 (which does 
not exhibit this problem), enumerates six uhci(4) controllers and two 
ehci(4) controllers.  To cut to the chase here, the 960 users' keyboards 
probe under a USB1.0 uhci(4), while the 980s only have ehci(4) devices 
to attach to.


So, I guess what I'm asking is - has anyone else seen any keyboard 
repeat or other USB craziness with ehci(4) ports or otherwise Intel PCH 
controllers?Any fellow Optiplex 980 users?  I'd be more than happy 
to provide pciconf or other output if requested.


Thanks,
Steve Polyack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: out of HDD space - zfs degraded

2010-10-02 Thread Steve Polyack

 On 10/2/2010 10:04 PM, Dan Langille wrote:

On 10/2/2010 7:50 PM, Jeremy Chadwick wrote:

On Sat, Oct 02, 2010 at 07:23:16PM -0400, Dan Langille wrote:

On 10/2/2010 6:36 PM, Jeremy Chadwick wrote:

On Sat, Oct 02, 2010 at 06:09:25PM -0400, Dan Langille wrote:

On 10/2/2010 10:19 AM, Jeremy Chadwick wrote:

On Sat, Oct 02, 2010 at 09:43:30AM -0400, Dan Langille wrote:

Overnight I was running a zfs send | zfs receive (both within the
same system / zpool).  The system ran out of space, a drive went 
off

line, and the system is degraded.

This is a raidz2 array running on FreeBSD 8.1-STABLE #0: Sat Sep 18
23:43:48 EDT 2010.

The following logs are also available at
http://www.langille.org/tmp/zfs-space.txt- no line wrapping

This is what was running:

# time zfs send storage/bac...@transfer | mbuffer | zfs receive
storage/compressed/bacula-mbuffer
in @  0.0 kB/s, out @  0.0 kB/s, 3670 GB total, buffer 100%
fullcannot receive new filesystem stream: out of space
mbuffer: error: outputThread: error writing tostdoutat offset
0x395917c4000: Broken pipe

summary: 3670 GByte in 10 h 40 min 97.8 MB/s
mbuffer: warning: error during output tostdout: Broken pipe
warning: cannot send 'storage/bac...@transfer': Broken pipe

real640m48.423s
user8m52.660s
sys 211m40.862s


Looking in the logs, I see this:

Oct  2 00:50:53 kraken kernel: (ada0:siisch0:0:0:0): lost device
Oct  2 00:50:54 kraken kernel: siisch0: Timeout on slot 30
Oct  2 00:50:54 kraken kernel: siisch0: siis_timeout is 0004 ss
4000 rs 4000 es  sts 801f0040 serr 
Oct  2 00:50:54 kraken kernel: siisch0: Error while READ LOG EXT
Oct  2 00:50:55 kraken kernel: siisch0: Timeout on slot 30
Oct  2 00:50:55 kraken kernel: siisch0: siis_timeout is 0004 ss
4000 rs 4000 es  sts 801f0040 serr 
Oct  2 00:50:55 kraken kernel: siisch0: Error while READ LOG EXT
Oct  2 00:50:56 kraken kernel: siisch0: Timeout on slot 30
Oct  2 00:50:56 kraken kernel: siisch0: siis_timeout is 0004 ss
4000 rs 4000 es  sts 801f0040 serr 
Oct  2 00:50:56 kraken kernel: siisch0: Error while READ LOG EXT
Oct  2 00:50:57 kraken kernel: siisch0: Timeout on slot 30
Oct  2 00:50:57 kraken kernel: siisch0: siis_timeout is 0004 ss
4000 rs 4000 es  sts 801f0040 serr 
Oct  2 00:50:57 kraken kernel: siisch0: Error while READ LOG EXT
Oct  2 00:50:58 kraken kernel: siisch0: Timeout on slot 30
Oct  2 00:50:58 kraken kernel: siisch0: siis_timeout is 0004 ss
4000 rs 4000 es  sts 801f0040 serr 
Oct  2 00:50:58 kraken kernel: siisch0: Error while READ LOG EXT
Oct  2 00:50:59 kraken root: ZFS: vdev I/O failure, zpool=storage
path=/dev/gpt/disk06-live offset=270336 size=8192 error=6

Oct  2 00:50:59 kraken kernel: (ada0:siisch0:0:0:0): Synchronize
cache failed
Oct  2 00:50:59 kraken kernel: (ada0:siisch0:0:0:0): removing 
device entry


Oct  2 00:50:59 kraken root: ZFS: vdev I/O failure, zpool=storage
path=/dev/gpt/disk06-live offset=2000187564032 size=8192 error=6
Oct  2 00:50:59 kraken root: ZFS: vdev I/O failure, zpool=storage
path=/dev/gpt/disk06-live offset=2000187826176 size=8192 error=6

$ zpool status
   pool: storage
  state: DEGRADED
  scrub: scrub in progress for 5h32m, 17.16% done, 26h44m to go
config:

 NAME STATE READ WRITE CKSUM
 storage  DEGRADED 0 0 0
   raidz2 DEGRADED 0 0 0
 gpt/disk01-live  ONLINE   0 0 0
 gpt/disk02-live  ONLINE   0 0 0
 gpt/disk03-live  ONLINE   0 0 0
 gpt/disk04-live  ONLINE   0 0 0
 gpt/disk05-live  ONLINE   0 0 0
 gpt/disk06-live  REMOVED  0 0 0
 gpt/disk07-live  ONLINE   0 0 0

$ zfs list
NAMEUSED  AVAIL  REFER  MOUNTPOINT
storage6.97T  1.91T  1.75G  /storage
storage/bacula 4.72T  1.91T  4.29T  /storage/bacula
storage/compressed 2.25T  1.91T  46.9K  /storage/compressed
storage/compressed/bacula  2.25T  1.91T  42.7K  
/storage/compressed/bacula

storage/pgsql  5.50G  1.91T  5.50G  /storage/pgsql

$ sudo camcontrol devlist
Password:
Hitachi HDS722020ALA330 JKAOA28A at scbus2 target 0 lun 0 
(pass1,ada1)
Hitachi HDS722020ALA330 JKAOA28A at scbus3 target 0 lun 0 
(pass2,ada2)
Hitachi HDS722020ALA330 JKAOA28A at scbus4 target 0 lun 0 
(pass3,ada3)
Hitachi HDS722020ALA330 JKAOA28A at scbus5 target 0 lun 0 
(pass4,ada4)
Hitachi HDS722020ALA330 JKAOA28A at scbus6 target 0 lun 0 
(pass5,ada5)
Hitachi HDS722020ALA330 JKAOA28A at scbus7 target 0 lun 0 
(pass6,ada6)
ST380815AS 4.AABat scbus8 target 0 lun 0 
(pass7,ada7)
TSSTcorp CDDVDW SH-S223C SB01   at scbus9 target 0 lun 0 
(cd0,pass8)
WDC WD1600AAJS-75M0A0 02.03E02  at scbus10 target 0 lun 0 

Re: NFS 75 second stall

2010-09-01 Thread Steve Polyack

 On 07/01/10 15:23, Garrett Cooper wrote:

On Thu, Jul 1, 2010 at 11:51 AM, alan bryanalan.br...@yahoo.com  wrote:


--- On Thu, 7/1/10, Garrett Cooperyanef...@gmail.com  wrote:


From: Garrett Cooperyanef...@gmail.com
Subject: Re: NFS 75 second stall
To: alan bryanalan.br...@yahoo.com
Cc: freebsd-stable@freebsd.org
Date: Thursday, July 1, 2010, 11:13 AM
On Thu, Jul 1, 2010 at 11:01 AM, alan
bryanalan.br...@yahoo.com
wrote:

Setup:

server - FreeBSD 8-stable from today.  2 UFS dirs

exported via NFS.

client - FreeBSD 8.0-Release.  Running a test php

script that copies around various files to/from 2 separate
NFS mounts.

Situation:

script is started (forked to do 20 simultaneous runs)

and 20 1GB files are copied to the NFS dir which works
fine.  When it then switches to reading those files back
and simultaneously writing to the other NFS mount I see a
hang of 75 seconds.  If I do an ls -l on the NFS mount it
hangs too.  After 75 seconds the client has reported:

nfs server 192.168.10.133:/usr/local/export1: not

responding

nfs server 192.168.10.133:/usr/local/export1: is alive

again

nfs server 192.168.10.133:/usr/local/export1: not

responding

nfs server 192.168.10.133:/usr/local/export1: is alive

again

and then things start working again.  The server was

originally FreeBSD 8.0-Release also but was upgraded to the
latest stable to see if this issue could be avoided.

# nfsstat -s -W -w 1
  GtAttr Lookup Rdlink   Read  Write Rename

Access  Rddir

   0  0  0222257

   0  0  0

   0  0  0178135

   0  0  0

   0  0  0 85127

 0  0  0

   0  0  0  0  0

 0  0  0

   0  0  0  0  0

 0  0  0

   0  0  0  0  0

 0  0  0

   0  0  0  0  0

 0  0  0

   0  0  0  0  0

 0  0  0

... for 75 rows of all zeros

   0  0  0272266

   0  0  0

   0  0  0167165

   0  0  0

I also tried runs with 15 simultaneous processes and

25.  15 processes gave only about a 5 second stall but 25
gave again the same 75 second stall.

Further, I tested with 2 mounts to the same server but

from ZFS filesytems with the exact same stall/timeout
periods.  So, it doesn't appear to matter what the
underlying filesystem is - it's something in NFS or
networking code.

Any ideas on what's going on here?  What's causing

the complete stall period of zero NFS activity?   Any flaws
with my testing methods?

Thanks for any and all help/ideas.

What network driver are you using? Have you tried
tcpdumping the packets?
-Garrett


I'm using igb currently but have also used em.  I have not tried tcpdumping the 
packets yet on this test.  Any suggestions on things to look out for (I'm not 
that familiar with that whole process).

Which brings up another point - I'm using TCP connections for NFS, not UDP.

 Is the net.inet.tcp.tso sysctl enabled or not? What about rxcsum and 
txcsum?
Thanks,
-Garrett


We're occaisionally seeing these same types of stalls (+ repeated is 
not responding is alive again messages in quick succession).  We're 
seeing it only on our 8.1-RELEASE systems against a variety of NFS 
servers (6.3-RELEASE, 7.2-RELEASE, and 8-STABLE from before the release 
of 8.1).  We also see it happen with a variety of client hardware and 
network adapters (em, bce, bge); the only common denominator is 
8.1-RELEASE on the clients.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: em(4) duplex problems with 82541EI on RELENG_8, -CURRENT on PowerEdge 1850

2010-07-15 Thread Steve Polyack

On 07/15/10 13:31, Michael Tuexen wrote:

On Jul 15, 2010, at 6:50 PM, Brian A. Seklecki wrote:

   
 

It may have gone in before the RELENG_8_1 tag/branch occurred?  SVN
r209309

Jacks's change went into stable/8 on June 18:

   

Also, did anyone provide feedback on SVN r209959 to
head/sys/dev/e1000/if_em.c ?

It's saying 8.1 MFC, so you might want to ask people to test that on
stable/8 then expedite the 8.1 MFC as well.
 

Maybe Jack can MFC it to stable/8. MFCing to releng/8.1 must come from stable/8 
anyway.
There is no need for re@ approval to MFC it into stable/8.
   


As Brian stated, the change has already been MFC'd into stable/8 (June 
18th) with the following comment from Jack:


MFC to RELENG8.1 asap


Steve

Best regards
Michael
   

~BAS
 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

   



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Dell PowerEdge Virtual Media

2010-06-09 Thread Steve Polyack

On 12/08/09 12:41, Jeff Blank wrote:

Hi,

I'm having a little trouble using the virtual media function of
Dell's PowerEdge R-series (R710 in this case) iDRAC6 under FreeBSD
(7.1, 8.0).  This is presented as /dev/cd0, a USB/SCSI device, I
guess.  This is in the dmesg buffer when I boot up the existing 7.1
installation with the virtual optical drive mapped to the 8.0-RELEASE
amd64 DVD image:
   
We're having some similar issues with our older PowerEdge 1850s 
(DRAC4).  Things were alright on 6.3-RELEASE, but now there is a large 
hangup when trying to install 8.0-RELEASE:


We see ~ 30x of these in dmesg:
  acd1: WARNING - READ_TOC read data overrun 2012

Followed by this in sysinstall:
  The disc in your drive looks more like an audio disc than a
  FreeBSD Release (1).

The DRAC Virtual Media and physical CDROM drives are probed like this in 
dmesg:

acd1: DVR VIRTUALCDROM DRIVE/ at ata2-slave PIO3
acd0: CDROM TEAC CD-ROM CD-224E/K.9A

This is with the latest DRAC firmware available from Dell.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Freebsd 8.0 kmem map too small

2010-05-10 Thread Steve Polyack

On 05/10/10 11:55, Mike Andrews wrote:

On 5/5/10 11:19 AM, Freddie Cash wrote:
On Tue, May 4, 2010 at 11:32 PM, Giulio Ferroau...@zirakzigil.org  
wrote:



Giulio Ferro wrote:


Thanks, I'll try these settings.

I'll keep you posted.



Nope, it's happened again... Now I've tried to rise vm.kmem_size to 
6G...

I'm really astounded at how unstable zfs is, it's causing me a lot of
problem.
Why isn't it stated in the handbook that zfs isn't up to production 
yet?




As with everything related to computers, it all depends on your uses.


Sorry to semi-hijack this, but...  I'm also running into frequent 
kmem_map too small panics on 8-STABLE, such as:


panic: kmem_malloc(131072): kmem_map too small: 2023780352 total 
allocated
panic: kmem_malloc(131072): kmem_map too small: 2011525120 total 
allocated
panic: kmem_malloc(114688): kmem_map too small: 1849356288 total 
allocated
panic: kmem_malloc(114688): kmem_map too small: 1849356288 total 
allocated
panic: kmem_malloc(114688): kmem_map too small: 1849356288 total 
allocated
panic: kmem_malloc(131072): kmem_map too small: 2020409344 total 
allocated
panic: kmem_malloc(536576): kmem_map too small: 2022957056 total 
allocated


(those are over the course of 3-4 days)

On this specific system, it has 32 GB physical memory and has 
vfs.zfs.arc_max=2G and vm.kmem_size=64G in /boot/loader.conf.  The 
latter was added per earlier suggestions on this list, but appears to 
be ignored as sysctl vm.kmem_size returns about 2 GB (2172452864) 
anyway.



As Artem stated in another reply, you will need to set vm.kmem_size 
slightly under 2x the physical memory.  The kernel will default to 2GB 
if you pass this limit.  1.5x physical memory size should be sufficient, 
so try 48G and verify that it gets set correctly on the next boot.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Freebsd 8.0 kmem map too small

2010-05-05 Thread Steve Polyack

On 05/05/10 10:12, Ben Kelly wrote:

On May 5, 2010, at 9:33 AM, Pawel Jakub Dawidek wrote:

   

On Wed, May 05, 2010 at 10:46:31AM +0200, Giulio Ferro wrote:
 

On 05.05.2010 09:52, Jeremy Chadwick wrote:

Nope, it's happened again... Now I've tried to rise vm.kmem_size to 6G...


   

Did you set both vm.kmem_size and vfs.zfs.arc_max, setting the latter to
something *less* than vm.kmem_size?


 

Yes.
After your suggestion, I set
vfs.zfs.arc_max: 3758096384
vm.kmem_size: 4G

Now:
vfs.zfs.arc_max: 3758096384
vm.kmem_size: 6392119296
   

Could you try to track down the commit that is causing your problems?
Could you try 8-STABLE kernel from before r206815?
 


Are others generally able to run ARC values so close to kmem size?  My 
experience has been that you really need the ARC to be much smaller than the 
kmem limit (like 25% or less) due to fragmentation of kmem_map.

   
On a system here with 8GB of RAM and a very large zpool consisting of 
multiple zdevs, little tuning was needed.  The system is running 
8-STABLE(amd64) as of a month or two ago.  The only things I have set in 
/boot/loader.conf are:

vm.kmem_size=12G
vfs.zfs.arc_max=4G

Setting kmem_size to 12G came from a combination of recommendations I 
saw in various mailing list posts (you can probably dig them up).  
Setting it to the physical memory size was the initial recommendation, 
while others recommended 1.5x physical memory size to help prevent 
fragmentation/wasted space in kmem.


Regardless, this has served us quite well for the ~6 months the system 
has been in use.  It has never crashed, even under intensive 
multi-threaded benchmarking.

-Steve
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ZFS (zpool) doesn't detect failed drive

2010-05-05 Thread Steve Polyack

On 05/05/10 10:56, Harald Schmalzbauer wrote:

Harald Schmalzbauer schrieb am 05.05.2010 14:41 (localtime):

Hello,

one drive of my mirror failed today, but 'zpool staus' shows it 
online.
Every process using a ZFS mount hangs. Also 'zpool offline /dev/ad1' 
hangs infinitely.

...
Sorry, I made an error with zpool create. Somehow the little word 
mirror must have been lost. So the pool wasn't a mirror but a 
stripe. Then of course I can't make one vdev offline. Sorry for the 
noise.
But I took the opportunity to do some tests with that failing drive 
and created a _real_ mirror. That works without failures, but using 
the mirror again leads to:

ad1: TIMEOUT - FLUSHCACHE48 retrying (1 retry left)
ad1: TIMEOUT - FLUSHCACHE48 retrying (1 retry left)
ad1: TIMEOUT - FLUSHCACHE48 retrying (1 retry left)
ad1: TIMEOUT - FLUSHCACHE48 retrying (1 retry left)
ad1: TIMEOUT - FLUSHCACHE48 retrying (1 retry left)
ad1: TIMEOUT - FLUSHCACHE48 retrying (1 retry left)
ata3: port is not ready (timeout 1ms) tfd = 0080
ata3: hardware reset timeout
ad1: FAILURE - device detached

Now zpool reporsts the vdev ad1 still online although it has been 
detached and 'atacontrol list' doesn't show it anymore:


zpool status
  pool: URUBAmirrorP1
 state: ONLINE
status: One or more devices has experienced an unrecoverable error.  An
attempt was made to correct the error.  Applications are 
unaffected.
action: Determine if the device needs to be replaced, and clear the 
errors

using 'zpool clear' or replace the device with 'zpool replace'.
   see: http://www.sun.com/msg/ZFS-8000-9P
 scrub: none requested
config:

NAMESTATE READ WRITE CKSUM
URUBAmirrorP1  ONLINE   0 0 0
  mirrorONLINE   0 0 0
ad1 ONLINE   3  302K 0
ad2 ONLINE   0 0 0

errors: No known data errors

atacontrol list
ATA channel 2:
Master:  ad0 TRANSCEND/20090520 SATA revision 1.x
Slave:   no device present
ATA channel 3:
Master:  no device present
Slave:   no device present
ATA channel 4:
Master:  ad2 SAMSUNG HD154UI/1AG01118 SATA revision 2.x
Slave:   no device present
ATA channel 5:
Master:  ad3 ST3750640NS/3.AEG SATA revision 1.x
Slave:   no device present

How should such a failure be handled?
Do I have to manually mark the drive offline for zpool?

Thanks,

-Harry

You may want to try newer controller drivers like ahci(4) if possible.  
Otherwise, building the kernel with ATA_CAM may accomplish something 
similar.  I'm not sure, but I'm speculating that the newer ATA/CAM 
system may feed the proper notifications back to the ZFS systems.


I use many drives on the siis(4) driver, which is CAM-enabled, and 
haven't had any issues.  However, I have not had an outright drive 
failure.  I do recall testing situations where we would yank a working 
drive, and I seem to remember it working correctly after the last set of 
CAM improvements.


It may not be something you can try on a production system, but if you 
can experiment, it's worth a shot.  Note that your device names WILL 
change to adaX instead of adX.  I would definitely recommend you 
glabel(8) and create the zpool/zdevs using the glabel devices instead to 
circumvent any future problems associated with device numbering.


Steve
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


rc.d/rc.subr support for multiple FIBs

2010-03-12 Thread Steve Polyack
With multiple FIB support generally available in FreeBSD 8.0-RELEASE it 
would be quite beneficial to have the ability to build routing tables in 
secondary FIBs as well as start certain applications in certain FIBs 
from within rc.conf(5).


I've done some poking around and came across two PRs which implement 
exactly what I'm looking for:

http://www.freebsd.org/cgi/query-pr.cgi?pr=132483
http://www.freebsd.org/cgi/query-pr.cgi?pr=132476

My question is whether there are any plans to commit these into -CURRENT 
and possibly MFC them to a future 8.x-RELEASE.  Having multiple FIBs 
available has been great so far, it goes hand and hand with Multi-IP 
Jails.  The only thing missing are native methods for constructing the 
routing tables on boot (Yes, there is rc.local, but I don't want to go 
there...).


Thanks,
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ZFS hot spares

2010-03-09 Thread Steve Polyack

On 03/09/10 05:11, Ivan Voras wrote:

On 03/08/10 19:06, Steve Polyack wrote:

ZFS in FreeBSD lacks at least one major feature from the Solaris
version: hot spares. There is a PR open at
http://www.freebsd.org/cgi/query-pr.cgi?pr=134491, but there hasn't been
any motion/thoughts posted on it since its creation almost one year ago.

I'm aware that on Solaris, hot spare replacement is handled by a few
Solaris-specific daemons, zfs-retire and zfs-diagnose, which both plug
into the Solaris FMA (Fault Management Architecture). Have there been
any thoughts on porting these over or getting something similar running
within FreeBSD? With all of the recent SATA/SAS CAM hotplug work now
committed, it would be nice to have automatic replacement of hot spares
with a future hot-replacement of the failed drive.

On the other side, I'd be interested in hearing if anyone has had
success in rolling their own scripted solution: i.e. something which
polls 'zpool status' looking for failed drives and performing hot-spare
replacements automatically.


You don't have to exactly poll it. See /etc/devd.conf:

# Sample ZFS problem reports handling.
notify 10 {
match system  ZFS;
match typezpool;
action logger -p kern.err 'ZFS: failed to load zpool $pool';
};

notify 10 {
match system  ZFS;
match typevdev;
action logger -p kern.err 'ZFS: vdev failure, zpool=$pool 
type=$type';

};

notify 10 {
match system  ZFS;
match typedata;
action logger -p kern.warn 'ZFS: zpool I/O failure, 
zpool=$pool error=$zio_err';

};

notify 10 {
match system  ZFS;
match typeio;
action logger -p kern.warn 'ZFS: vdev I/O failure, 
zpool=$pool path=$vdev_path offset=$zio_offset size=$zio_size 
error=$zio_err';

};

notify 10 {
match system  ZFS;
match typechecksum;
action logger -p kern.warn 'ZFS: checksum mismatch, 
zpool=$pool path=$vdev_path offset=$zio_offset size=$zio_size';

};

I don't really know if these notifications actually work since I don't 
have hot-plug test machines, but if they do, this looks like a decent 
starting point.




Thanks for the suggestions.  I received a similar one from someone 
else.  If I get time to build a ZFS lab machine then I will certainly 
try these out and provide feedback on how they work.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


ZFS hot spares

2010-03-08 Thread Steve Polyack
ZFS in FreeBSD lacks at least one major feature from the Solaris 
version: hot spares.   There is a PR open at 
http://www.freebsd.org/cgi/query-pr.cgi?pr=134491, but there hasn't been 
any motion/thoughts posted on it since its creation almost one year ago.


I'm aware that on Solaris, hot spare replacement is handled by a few 
Solaris-specific daemons, zfs-retire and zfs-diagnose, which both plug 
into the Solaris FMA (Fault Management Architecture).  Have there been 
any thoughts on porting these over or getting something similar running 
within FreeBSD?  With all of the recent SATA/SAS CAM hotplug work now 
committed, it would be nice to have automatic replacement of hot spares 
with a future hot-replacement of the failed drive.


On the other side, I'd be interested in hearing if anyone has had 
success in rolling their own scripted solution: i.e. something which 
polls 'zpool status' looking for failed drives and performing hot-spare 
replacements automatically.


Thanks,
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: hardware for home use large storage

2010-02-15 Thread Steve Polyack

On 02/15/10 12:14, Dan Langille wrote:

Dan Naumov wrote:

On Sun, Feb 14, 2010 at 11:38 PM, Dan Langille d...@langille.org wrote:

Dan Naumov wrote:

On Sun, 14 Feb 2010, Dan Langille wrote:

After creating three different system configurations (Athena,
Supermicro, and HP), my configuration of choice is this Supermicro
setup:

   1. Samsung SATA CD/DVD Burner $20 (+ $8 shipping)
   2. SuperMicro 5046A $750 (+$43 shipping)
   3. LSI SAS 3081E-R $235
   4. SATA cables $60
   5. Crucial 3×2G ECC DDR3-1333 $191 (+ $6 shipping)
   6. Xeon W3520 $310

You do realise how much of a massive overkill this is and how much you
are overspending?


I appreciate the comments and feedback.  I'd also appreciate 
alternative
suggestions in addition to what you have contributed so far.  Spec 
out the

box you would build.


==
Case: Fractal Design Define R2 - 89 euro:
http://www.fractal-design.com/?view=productprod=32

Mobo/CPU: Supermicro X7SPA-H / Atom D510 - 180-220 euro:
http://www.supermicro.com/products/motherboard/ATOM/ICH9/X7SPA.cfm?typ=H

PSU: Corsair 400CX 80+ - 59 euro:
http://www.corsair.com/products/cx/default.aspx

RAM: Corsair 2x2GB, DDR2 800MHz SO-DIMM, CL5 - 85 euro
==
Total: ~435 euro

The motherboard has 6 native AHCI-capable ports on ICH9R controller
and you have a PCI-E slot free if you want to add an additional
controller card. Feel free to blow the money you've saved on crazy
fast SATA disks and if your system workload is going to have a lot of
random reads, then spend 200 euro on a 80gb Intel X25-M for use as a
dedicated L2ARC device for your pool.


Based on the Fractal Design case mentioned above, I was told about 
Lian Lia cases, which I think are great.  As a result, I've gone with 
a tower  case without hot-swap.  The parts are listed at and 
reproduced below:


  http://dan.langille.org/2010/02/15/a-full-tower-case/

   1. LIAN LI PC-A71F Black Aluminum ATX Full Tower Computer Case $240 
(from mwave)

   2. Antec EarthWatts EA650 650W PSU $80
   3. Samsung SATA CD/DVD Burner $20 (+ $8 shipping)
   4. Intel S3200SHV LGA 775 Intel 3200 m/b $200
   5. Intel Core2 Quad Q9400 CPU $190
   6. SATA cables $22
   7. Supermicro LSI MegaRAID 8 Port SAS RAID Controller $118
   8. Kingston ValueRAM 4GB (2 x 2GB) 240-Pin DDR2 SDRAM ECC $97

Total cost is about $1020 with shipping.  Plus HDD.

No purchases yet, but the above is what appeals to me now.

Dan,
I'm not sure about that particular card, but we've never seen that great 
of performance out of the LSI MegaRAID cards that ship with Dell servers 
as the PERC.  The newest incarnations are better, but I would try to get 
an Areca.  The ones we have tested have displayed fantastic 
performance.  They are fairly expensive in comparison, though.  If 
you're using ZFS in place of the RAID on the LSI MegaRAID, I'd instead 
recommend other simpler SAS cards which are known to have good driver 
support.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: hardware for home use large storage

2010-02-15 Thread Steve Polyack

On 2/15/2010 6:04 PM, Dan Langille wrote:

Steve Polyack wrote:

On 02/15/10 12:14, Dan Langille wrote:



   7. Supermicro LSI MegaRAID 8 Port SAS RAID Controller $118



Dan,
I'm not sure about that particular card, but we've never seen that 
great of performance out of the LSI MegaRAID cards that ship with 
Dell servers as the PERC.  The newest incarnations are better, but I 
would try to get an Areca.  The ones we have tested have displayed 
fantastic performance.  They are fairly expensive in comparison, 
though.  If you're using ZFS in place of the RAID on the LSI 
MegaRAID, I'd instead recommend other simpler SAS cards which are 
known to have good driver support.


Yes, the card will be used as a straight-through and not use for RAID. 
ZFS will be running raidz for me, possibly raidz2.  Given that, I'm 
not sure if you're suggesting the3 Areca or something else.


In addition, I'm not sure what makes a SAS card simpler and supported. 
Recommendation?


Other cards I have considered include:

LSI SAS3041E-R 4 port $120
http://www.google.com/products/catalog?q=lsi+sas+pciehl=encid=1824913543877548833sa=title#p 



SYBA SY-PEX40008 PCI Express SATA II 4 port $60
http://www.newegg.com/Product/Product.aspx?Item=N82E16816124027

LSISAS1064 chipset - SAS3042e
http://www.lsi.com/DistributionSystem/AssetDocument/PCIe_3GSAS_UG.pdf

SUPERMICRO AOC-SAT2-MV8 64-bit PCI-X133MHz SATA Controller Card $99
http://www.newegg.com/Product/Product.aspx?Item=N82E16815121009



All I meant by simpler was that if you're not going to use the RAID 
portion then you do not have to pay for it.   Same goes for SAS - if 
you're not going to use SAS disks, you only need a SATA controller.  The 
SYBA card you have listed works great with the siis driver (NCQ 
support).  If you don't need SAS support, I think there are Areca cards 
around $100 that just do SATA w/o RAID.


As far as supported, I know that the Areca driver is very good, as is 
the recently revamped ahci generic and siis drivers.  Like I said, those 
newer LSI cards may be fine, but I've had bad experiences with the PERC4 
and PERC5 (/LSI/ MegaRAID SAS 8408E), which are both re-branded LSI 
MegaRAID cards.  They were always very reliable and handled failed disks 
quite well, but performance was often ... just bad. YMMV.


The PERC6 (one of the newer generations) isn't half as bad... If you end 
up with one of the LSI cards, let us know how it performs ;)

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


ZFS ARC being limited below what is defined in /boot/loader.conf

2010-02-12 Thread Steve Polyack
Has anyone had an issue with the ZFS ARC max being limited below what 
has been defined in /boot/loader.conf?  I just upgraded the RAM in a 
ZFS-equipped system and attempted to devote 4GB to the ARC cache by 
placing the following in loader.conf:

  vfs.zfs.arc_max=4096M

However, after rebooting, querying the sysctl gives me this:
$ sysctl vfs.zfs.arc_max
vfs.zfs.arc_max: 1726489600

or about 1.7GB, an odd number that I can't find any references to.  For 
reference, I'm running 8-STABLE (as of Jan 19th) on an amd64 system with 
8GB of RAM.  The system was previously very stable with 4GB of RAM and a 
512MB arc_max.  I have not modified vm.kmem_size_max (defaults to ~330GB 
on amd64) or any other ZFS tunables.  I'd also like to avoid syncing up 
to the current 8-STABLE if at all possible.


Thanks,
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ZFS ARC being limited below what is defined in /boot/loader.conf

2010-02-12 Thread Steve Polyack

On 02/12/10 13:47, Artem Belevich wrote:

On Fri, Feb 12, 2010 at 10:31 AM, Steve Polyackkor...@comcast.net  wrote:

   

Has anyone had an issue with the ZFS ARC max being limited below what has
been defined in /boot/loader.conf?  I just upgraded the RAM in a
ZFS-equipped system and attempted to devote 4GB to the ARC cache by placing
the following in loader.conf:
  vfs.zfs.arc_max=4096M

However, after rebooting, querying the sysctl gives me this:
$ sysctl vfs.zfs.arc_max
vfs.zfs.arc_max: 1726489600

or about 1.7GB, an odd number that I can't find any references to.  For
reference, I'm running 8-STABLE (as of Jan 19th) on an amd64 system with 8GB
of RAM.  The system was previously very stable with 4GB of RAM and a 512MB
arc_max.  I have not modified vm.kmem_size_max (defaults to ~330GB on amd64)
or any other ZFS tunables.  I'd also like to avoid syncing up to the current
8-STABLE if at all possible.

Thanks,
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

 


Check your vm.kmem_size. Default setting is way too low. Set it to at
least double of desired arc size.

--Artem
I mentioned it briefly, but vm.kmem_size_max was left at the default for 
amd64.  At 330GB it is way above and beyond what will ever be allocated 
to ARC:

$ sysctl vm.kmem_size_max
vm.kmem_size_max: 329853485875


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ZFS ARC being limited below what is defined in /boot/loader.conf

2010-02-12 Thread Steve Polyack
This was indeed the issue.  I read your first response incorrectly.  
Thanks a bunch!


On 2/12/2010 4:10 PM, Artem Belevich wrote:

vm.kmem_size_max/vm.kmem_size_min define the range vm.kmem_size can be set to.
vm_kmem_size specifies the actual kmem size.

ARC size in turn limited by vm.kmem_size.

If you want to bump ARC size, you do need to bump vm.kmem_size.

--Artem



On Fri, Feb 12, 2010 at 11:36 AM, Steve Polyackkor...@comcast.net  wrote:
   

On 02/12/10 13:47, Artem Belevich wrote:
 

On Fri, Feb 12, 2010 at 10:31 AM, Steve Polyackkor...@comcast.net
  wrote:


   

Has anyone had an issue with the ZFS ARC max being limited below what has
been defined in /boot/loader.conf?  I just upgraded the RAM in a
ZFS-equipped system and attempted to devote 4GB to the ARC cache by
placing
the following in loader.conf:
  vfs.zfs.arc_max=4096M

However, after rebooting, querying the sysctl gives me this:
$ sysctl vfs.zfs.arc_max
vfs.zfs.arc_max: 1726489600

or about 1.7GB, an odd number that I can't find any references to.  For
reference, I'm running 8-STABLE (as of Jan 19th) on an amd64 system with
8GB
of RAM.  The system was previously very stable with 4GB of RAM and a
512MB
arc_max.  I have not modified vm.kmem_size_max (defaults to ~330GB on
amd64)
or any other ZFS tunables.  I'd also like to avoid syncing up to the
current
8-STABLE if at all possible.

Thanks,
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


 

Check your vm.kmem_size. Default setting is way too low. Set it to at
least double of desired arc size.

--Artem
   

I mentioned it briefly, but vm.kmem_size_max was left at the default for
amd64.  At 330GB it is way above and beyond what will ever be allocated to
ARC:
$ sysctl vm.kmem_size_max
vm.kmem_size_max: 329853485875



 
   


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: hardware for home use large storage

2010-02-10 Thread Steve Polyack

On 2/10/2010 12:02 AM, Dan Langille wrote:

Trying to make sense of stuff I don't know about...

Matthew Dillon wrote:


AHCI on-motherboard with equivalent capabilities do not appear to be
in wide distribution yet.  Most AHCI chips can do NCQ to a single
target (even a single target behind a PM), but not concurrently to
multiple targets behind a port multiplier.  Even though SATA 
bandwidth

constraints might seem to make this a reasonable alternative it
actually isn't because any seek heavy activity to multiple drives
will be serialized and perform EXTREMELY poorly.  Linear performance
will be fine.  Random performance will be horrible.


Don't use a port multiplier and this goes away.  I was hoping to avoid 
a PM and using something like the Syba PCI Express SATA II 4 x Ports 
RAID Controller seems to be the best solution so far.


http://www.amazon.com/Syba-Express-Ports-Controller-SY-PEX40008/dp/B002R0DZWQ/ref=sr_1_22?ie=UTF8s=electronicsqid=1258452902sr=1-22 



Dan, I can personally vouch for these cards under FreeBSD.  We have 3 of 
them in one system, with almost every port connected to a port 
multiplier (SiI5xxx PMs).  Using the siis(4) driver on 8.0-RELEASE 
provides very good performance, and supports both NCQ and FIS-based 
switching (an essential for decent port-multiplier performance).


One thing to consider, however, is that the card is only single-lane 
PCI-Express.  The bandwidth available is only 2.5Gb/s (~312MB/sec, 
slightly less than that of the SATA-2 link spec), so if you have 4 
high-performance drives connected, you may hit a bottleneck at the 
bus.   I'd be particularly interested if anyone can find any similar 
Silicon Image SATA controllers with a PCI-E 4x or 8x interface ;)






It should be noted that while hotswap is supported with silicon 
image
chipsets and port multiplier enclosures (which also use Sili 
chips in
the enclosure), the hot-swap capability is not anywhere near as 
robust

as you would find with a more costly commercial SAS setup.  SI chips
are very poorly made (this is the same company that went bust under
another name a few years back due to shoddy chipsets), and have a 
lot

of on-chip hardware bugs, but fortunately OSS driver writers (linux
guys) have been able to work around most of them.  So even though 
the

chipset is a bit shoddy actual operation is quite good.  However,
this does mean you generally want to idle all activity on the 
enclosure

to safely hot swap anything, not just the drive you are pulling out.
I've done a lot of testing and hot-swapping an idle disk while other
drives in the same enclosure are hot is not reliable (for a cheap 
port

multiplier enclosure using a Sili chip inside, which nearly all do).



I haven't had such bad experience as the above, but it is certainly a 
concern.  Using ZFS we simply 'offline' the device, pull, replace with a 
new one, glabel, and zfs replace.  It seems to work fine as long as 
nothing is accessing the device you are replacing (otherwise you will 
get a kernel panic a few minutes down the line).  m...@freebsd.org has 
also committed a large patch set to 9-CURRENT which implements proper 
SATA/AHCI hot-plug support and error-recovery through CAM.


-Steve Polyack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Pack of CAM improvements

2010-01-22 Thread Steve Polyack

On 01/22/10 11:48, Freddie Cash wrote:

On Fri, Jan 22, 2010 at 2:23 AM, Harald Schmalzbauer
h.schmalzba...@omnilan.de  wrote:

   

Alexander Motin schrieb am 19.01.2010 17:12 (localtime):
...

  Patch can be found here:
 

http://people.freebsd.org/~mav/cam-ata.20100119.patch

Feedback as always welcome.

   

Again, thanks a lot for your ongoing great work!
The patch doesn't cleanly apply with vpo, but I don't use vpo so I didn't
care.
Otherwise I couldn't find any problems.
The system detects reinserted SATA drives on ICH9 fine.

This was tested on a zfs backup server which went to the backbone
yesterday, so I can't physically remove any devices any more for testing...

But I had some questions about zfs raidz states. I think that isn't a
matter of atacam but if I removed one disk, zpool status still showed me the
ada3 device online.
After reinserting (and proper detection/initialisazion with cam, ada3 was
present again) and zpool clean, it set the devicea as UNAVAIL sinve I/O
errors.
I coudn't get the device into the pool again, no matter what I tried.
Only rebooting the machine helped. Then I could clean and scrub.

What are the needed steps to provide a reinsterted hard disk to geom? With
the latest patches I don't need to issue any reset/rescan comman, right?
So it's a zfs problem, right? My mistake in understanding?

In my testing of pulling drives at random (using a 3Ware 9550SXU or 9650SE
 

controller), you have to zpool offlinepool  device while the drive is
unplugged, before you can re-insert the same disk or a different disk.
  Without doing that step, it's very hard to re-insert the same disk, or
replace it with a new one, without rebooting.

Took me a couple of reboots and drive replacements before I figured that one
out.  :)

   
I think you can do it without the 'zpool offline pool device' 
command;  I may be wrong, but I believe you can use 'zpool replace' to 
accomplish what you're trying to do.  i.e. if you have a bad drive ada3, 
and take it out, then replace it with a new disk, you can issue a 'zpool 
replace pool /dev/ada3 /dev/ada3' (yes, the same device is specified 
twice). ZFS should recognize that its a different disk and/or that it is 
lacking ZFS metadata and begin to resilver the pool onto the new 
device.  If you watch 'zfs status' in the process you'll see something like:


raidz1   DEGRADED 0 0 0
label/ada4ONLINE   0 0 0  12.4M resilvered
label/ada5ONLINE   0 0 0  12.4M resilvered
label/ada6ONLINE   0 0 0  12.3M resilvered
replacing  DEGRADED 0 0 0
  label/ada3/old  UNAVAIL  0   595 0  cannot open
  label/ada3  ONLINE   0 0 0  9.74G resilvered

Try it out and let me know if it works for you.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Panic possibly related to glabel/geom and siis(4)

2009-11-24 Thread Steve Polyack
I have a system running 8.0-PRERELEASE with multiple drives and SATA 
port multipliers (siis controllers and PMPs).  All of the attached 
drives are labeled via glabel(8) and then included into a ZFS pool.  
During some testing to determine how the system would react to a dead 
drive (simulated by physically removing a drive during operation),  I 
was able to produce a panic.


Now, I know that the SATA PMP and siis(4) code to handle and recover 
from device errors is incomplete, but I believe the crash may be 
particular to using glabel'd drives.  Basically, after removing a drive 
while the zpool is in use and issues 'camcontrol reset' and 'rescan' on 
the appropriate bus, the physical device associated with the drive 
disappears.  In this case:

 (pass5:siisch7:0:15:0): lost device
 (pass5:siisch7:0:15:0): removing device entry
 (ada2:siisch7:0:0:0): lost device

and /dev/ada2 disappears.  However, the associated glabel 
/dev/label/bigdisk07 remains.  Since my ZFS pool is created based on the 
drive glabels, I believe this is why ZFS never notices the drives 
disappear either.


Do glabels typically go away after a physical device is lost?  Should 
this not be the case?



After some runtime with the physical device missing, a kernel panic is 
produced:


ada2:siisch7:0:0:0): Synchronize cache failed
(ada2:siisch7:0:0:0): removing device entry


Fatal trap 12: page fault while in kernel mode
cpuid = 2; apic id = 14
fault virtual address   = 0x48
fault code  = supervisor write data, page not present
instruction pointer = 0x20:0x8035f375
stack pointer   = 0x28:0xff86db60
frame pointer   = 0x28:0xff86db70
code segment= base 0x0, limit 0xf, type 0x1b
   = DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= interrupt enabled, resume, IOPL = 0
current process = 2 (g_event)
[thread pid 2 tid 100014 ]
Stopped at  _mtx_lock_flags+0x15:   lock cmpxchgq   %rsi,0x18(%rdi)
db bt
Tracing pid 2 tid 100014 td 0xff00014d4ab0
_mtx_lock_flags() at _mtx_lock_flags+0x15
vdev_geom_release() at vdev_geom_release+0x33
vdev_geom_orphan() at vdev_geom_orphan+0x15c
g_run_events() at g_run_events+0x104
g_event_procbody() at g_event_procbody+0x55
fork_exit() at fork_exit+0x118
fork_trampoline() at fork_trampoline+0xe
--- trap 0, rip = 0, rsp = 0xff86dd30, rbp = 0 ---


I'm open to try patches and other suggestions.  Thanks.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Support for SAS/SATA non-RAID adapters

2009-11-18 Thread Steve Polyack

Freddie Cash wrote:

On Tue, Nov 17, 2009 at 11:43 PM, Rink Springer r...@freebsd.org wrote:
  

On Tue, Nov 17, 2009 at 08:38:21PM -0800, Freddie Cash wrote:


I've also found a couple of Areca cards (PCI-X, non-RAID/PCIe RAID),
and have heard good things about Areca support in FreeBSD.  Any
comments on their quality/performance/reliability?
  

I have got an Areca ARC-1110 4x SATA2 PCI-X card in my server, and I'm
quite impressed with the performance; these cards do very well in terms
I/O operations per second and the driver has been rock solid for me. The
only downside is that they are quite expensive (but well worth it, IMO)



Compared to a 3Ware 9550SXU controller, these are cheap.  The Areca is
only $500 (open-box) or $700 (new) on newegg.ca.  The 3Ware cards are
over $1000, with the PCIe versions being over $1200 (which is what
started me on this journey -- hardware budgets are getting smaller and
smaller each year).

  
We've also tried the Areca cards with FreeBSD - the Areca 
ARC-1680IX-12-2G PCIe x8 card to be precise.  It's a SAS/SATA RAID 
card.  The performance was very impressive.  MUCH better than the Dell 
PERC4/5/6s we were used to.  The drivers also seemed to be rock solid 
(FreeBSD is even listed as a supported OS on the company's website).  
The feature-set of the card itself is also very rich... The one we tried 
had its own OOB management via serial port or dedicated 100mbps ethernet 
jack.  It supports endless combinations of RAID arrays, volumes, and 
SMTP/SNMP alerts.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD and SATA Port Multipliers

2009-10-26 Thread Steve Polyack

Alexander Motin wrote:

Steve Polyack wrote:
  

Alexander Motin wrote:


You can try this patch against today's HEAD:
http://people.freebsd.org/~mav/cam-ata.20091022.patch
  
  
I tried the patch this morning against a fresh checkout of HEAD. 
Immediately after boot only one device per PM was detected (I have two

hooked up at the moment, 5 drives on one, 1 on the other).  However,
about two minutes later all of the drives showed up!

camcontrol also rescans the entire bus *very* quickly now, and discovers
all changes (new/missing disks and port multipliers).

Here's some verbose info from /var/log/messages immediately after boot:
Oct 22 10:52:03 lovepod kernel: siisch0: Timeout on slot 27
Oct 22 10:52:03 lovepod kernel: siisch0: siis_timeout is  ss
0800 rs 0800 es  sts 801b2000 serr 
Oct 22 10:52:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:03 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:52:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:03 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:52:03 lovepod kernel: siisch0: SATA connect timeout
status=0001
Oct 22 10:52:03 lovepod kernel: siisch0: SIIS reset done: phy reset
found no device
Oct 22 10:52:03 lovepod kernel: (aprobe0:siisch0:0:1:0): Command timed out
Oct 22 10:52:03 lovepod kernel: (aprobe0:siisch0:0:1:0): error 5
Oct 22 10:52:03 lovepod kernel: (aprobe0:siisch0:0:1:0): Retries Exhausted
Oct 22 10:52:03 lovepod kernel: siisch0: DISCONNECT requested



What was before that? Can you send me full verbose boot messages?

  
Here's a full dmesg from today using the 10/22/2009 head and patch.  It 
actually failed to detect all but one of 6 drives this time.  I'm about 
to try today's head and patch.  I'll let you know how it goes.


Copyright (c) 1992-2009 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 9.0-CURRENT #2: Thu Oct 22 10:34:44 EDT 2009
Preloaded elf kernel /boot/kernel/kernel at 0x809e7000.
Preloaded /boot/zfs/zpool.cache /boot/zfs/zpool.cache at 0x809e7260.
Timecounter i8254 frequency 1193182 Hz quality 0
Calibrating TSC clock ... TSC clock: 2133421704 Hz
CPU: Intel(R) Xeon(R) CPU   E5506  @ 2.13GHz (2133.42-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0x106a5  Stepping = 5
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
Features2=0x9ce3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,DCA,SSE4.1,SSE4.2,POPCNT
  AMD Features=0x28100800SYSCALL,NX,RDTSCP,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
real memory  = 4299161600 (4100 MB)
Physical memory chunk(s):
0x1000 - 0x00093fff, 602112 bytes (147 pages)
0x00a16000 - 0xb60f, 3043926016 bytes (743146 pages)
0xbf78e000 - 0xbf78, 8192 bytes (2 pages)
0x0001 - 0x00013ffe, 1073676288 bytes (262128 pages)
avail memory = 4095377408 (3905 MB)
ACPI APIC Table: 061909 APIC1420
INTR: Adding local APIC 18 as a target
INTR: Adding local APIC 20 as a target
INTR: Adding local APIC 22 as a target
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 1 package(s) x 4 core(s)
 cpu0 (BSP): APIC ID: 16
 cpu1 (AP): APIC ID: 18
 cpu2 (AP): APIC ID: 20
 cpu3 (AP): APIC ID: 22
APIC: CPU 0 has ACPI ID 1
APIC: CPU 1 has ACPI ID 2
APIC: CPU 2 has ACPI ID 3
APIC: CPU 3 has ACPI ID 4
ULE: setup cpu 0
ULE: setup cpu 1
ULE: setup cpu 2
ULE: setup cpu 3
ACPI: RSDP 0xfa710 00024 (v2 ACPIAM)
ACPI: XSDT 0xbf790100 0008C (v1 NEC 20090619 MSFT 0097)
ACPI: FACP 0xbf790290 000F4 (v4 061909 FACP1420 20090619 MSFT 0097)
ACPI: DSDT 0xbf790670 05B8F (v2  10006 10006000  INTL 20051117)
ACPI: FACS 0xbf79e000 00040
ACPI: APIC 0xbf790390 000D2 (v2 061909 APIC1420 20090619 MSFT 0097)
ACPI: MCFG 0xbf790470 0003C (v1 061909 OEMMCFG  20090619 MSFT 0097)
ACPI: SLIC 0xbf7904b0 00176 (v1 NEC 20090619 MSFT 0097)
ACPI: ERST 0xbf790630 00040 (v1 061909 WHEA1420 20090619 MSFT 0097)
ACPI: OEMB 0xbf79e040 0007B (v1 061909 OEMB1420 20090619 MSFT 0097)
ACPI: HPET 0xbf79a670 00038 (v1 061909 OEMHPET  20090619 MSFT 0097)
ACPI: DMAR 0xbf79e0c0 00120 (v1AMI  OEMDMAR 0001 MSFT 0097)
ACPI: SSDT 0xbf79efb0 00363 (v1 DpgPmmCpuPm 0012 INTL 20051117)
ACPI: EINJ 0xbf79a6b0 00130 (v1  AMIER AMI_EINJ 20090619 MSFT 0097)
ACPI: BERT 0xbf79a840 00030 (v1  AMIER AMI_BERT 20090619 MSFT 0097)
ACPI: ERST 0xbf79a870 001B0 (v1  AMIER AMI_ERST 20090619 MSFT 0097)
ACPI: HEST 0xbf79aa20 000A8 (v1  AMIER AMI_HEST 20090619 MSFT 0097)
MADT: Found IO APIC ID 1, Interrupt 0 at 0xfec0
ioapic0: Changing APIC ID to 1
ioapic0

Re: FreeBSD and SATA Port Multipliers

2009-10-26 Thread Steve Polyack

Alexander Motin wrote:

Steve Polyack wrote:
  

I'm about to try today's head and patch.  I'll let you know how it goes.



Ok. Just to be sure it is not cabling issue (I have seen such), try also
limit port speed to 1.5Gbps by adding to loader.conf:
hint.siisch.0.sata_rev=1
hint.siisch.1.sata_rev=1
hint.siisch.2.sata_rev=1
hint.siisch.3.sata_rev=1

  
Here's a dmesg from todays head and your patch from this morning.  It's 
still only sensing one disk over the two PMs.  Forcing SATA1/1.5Gbps 
operation didn't change anything.




Copyright (c) 1992-2009 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 9.0-CURRENT #4: Mon Oct 26 10:01:55 EDT 2009
Preloaded elf kernel /boot/kernel/kernel at 0x809e7000.
Preloaded /boot/zfs/zpool.cache /boot/zfs/zpool.cache at 0x809e7260.
Timecounter i8254 frequency 1193182 Hz quality 0
Calibrating TSC clock ... TSC clock: 2133419836 Hz
CPU: Intel(R) Xeon(R) CPU   E5506  @ 2.13GHz (2133.42-MHz K8-class CPU)
  Origin = GenuineIntel  Id = 0x106a5  Stepping = 5
  
Features=0xbfebfbffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  
Features2=0x9ce3bdSSE3,DTES64,MON,DS_CPL,VMX,EST,TM2,SSSE3,CX16,xTPR,PDCM,DCA,SSE4.1,SSE4.2,POPCNT
  AMD Features=0x28100800SYSCALL,NX,RDTSCP,LM
  AMD Features2=0x1LAHF
  TSC: P-state invariant
real memory  = 4299161600 (4100 MB)
Physical memory chunk(s):
0x1000 - 0x00093fff, 602112 bytes (147 pages)
0x00a16000 - 0xb60f, 3043926016 bytes (743146 pages)
0xbf78e000 - 0xbf78, 8192 bytes (2 pages)
0x0001 - 0x00013ffe, 1073676288 bytes (262128 pages)
avail memory = 4095377408 (3905 MB)
ACPI APIC Table: 061909 APIC1420
INTR: Adding local APIC 18 as a target
INTR: Adding local APIC 20 as a target
INTR: Adding local APIC 22 as a target
FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs
FreeBSD/SMP: 1 package(s) x 4 core(s)
 cpu0 (BSP): APIC ID: 16
 cpu1 (AP): APIC ID: 18
 cpu2 (AP): APIC ID: 20
 cpu3 (AP): APIC ID: 22
APIC: CPU 0 has ACPI ID 1
APIC: CPU 1 has ACPI ID 2
APIC: CPU 2 has ACPI ID 3
APIC: CPU 3 has ACPI ID 4
ULE: setup cpu 0
ULE: setup cpu 1
ULE: setup cpu 2
ULE: setup cpu 3
ACPI: RSDP 0xfa710 00024 (v2 ACPIAM)
ACPI: XSDT 0xbf790100 0008C (v1 NEC 20090619 MSFT 0097)
ACPI: FACP 0xbf790290 000F4 (v4 061909 FACP1420 20090619 MSFT 0097)
ACPI: DSDT 0xbf790670 05B8F (v2  10006 10006000  INTL 20051117)
ACPI: FACS 0xbf79e000 00040
ACPI: APIC 0xbf790390 000D2 (v2 061909 APIC1420 20090619 MSFT 0097)
ACPI: MCFG 0xbf790470 0003C (v1 061909 OEMMCFG  20090619 MSFT 0097)
ACPI: SLIC 0xbf7904b0 00176 (v1 NEC 20090619 MSFT 0097)
ACPI: ERST 0xbf790630 00040 (v1 061909 WHEA1420 20090619 MSFT 0097)
ACPI: OEMB 0xbf79e040 0007B (v1 061909 OEMB1420 20090619 MSFT 0097)
ACPI: HPET 0xbf79a670 00038 (v1 061909 OEMHPET  20090619 MSFT 0097)
ACPI: DMAR 0xbf79e0c0 00120 (v1AMI  OEMDMAR 0001 MSFT 0097)
ACPI: SSDT 0xbf79efb0 00363 (v1 DpgPmmCpuPm 0012 INTL 20051117)
ACPI: EINJ 0xbf79a6b0 00130 (v1  AMIER AMI_EINJ 20090619 MSFT 0097)
ACPI: BERT 0xbf79a840 00030 (v1  AMIER AMI_BERT 20090619 MSFT 0097)
ACPI: ERST 0xbf79a870 001B0 (v1  AMIER AMI_ERST 20090619 MSFT 0097)
ACPI: HEST 0xbf79aa20 000A8 (v1  AMIER AMI_HEST 20090619 MSFT 0097)
MADT: Found IO APIC ID 1, Interrupt 0 at 0xfec0
ioapic0: Changing APIC ID to 1
ioapic0: Routing external 8259A's - intpin 0
MADT: Interrupt override: source 0, irq 2
ioapic0: Routing IRQ 0 - intpin 2
MADT: Interrupt override: source 9, irq 9
ioapic0: intpin 9 trigger: level
lapic: Routing NMI - LINT1
lapic: LINT1 trigger: edge
lapic: LINT1 polarity: high
ioapic0 Version 2.0 irqs 0-23 on motherboard
cpu0 BSP:
 ID: 0x1000   VER: 0x00060015 LDR: 0x DFR: 0x
  lint0: 0x00010700 lint1: 0x0400 TPR: 0x SVR: 0x01ff
  timer: 0x000100ef therm: 0x0001 err: 0x0001000f pcm: 0x00010400
nfslock: pseudo-device
kbd: new array size 4
kbd1 at kbdmux0
mem: memory
null: null device, zero device
io: I/O
random: entropy source, Software, Yarrow
acpi0: NEC  on motherboard
PCIe: Memory Mapped configuration base @ 0xe000
ioapic0: routing intpin 9 (ISA IRQ 9) to lapic 16 vector 48
acpi0: [MPSAFE]
acpi0: [ITHREAD]
ACPI: Executed 1 blocks of module-level executable AML code
acpi0: Power Button (fixed)
AcpiOsDerivePciId: \\_SB_.PCI0.SBRG.IELK.RXA0 - bus 0 dev 0 func 0
AcpiOsDerivePciId: \\_SB_.PCI0.SBRG.PIX0 - bus 0 dev 31 func 0
ACPI timer: 1/1 1/0 1/0 1/2 0/459 1/2 1/0 1/0 1/0 1/0 - 9
Timecounter ACPI-safe frequency 3579545 Hz quality 850
acpi_timer0: 24-bit timer at 3.579545MHz port 0x808-0x80b on acpi0
pci_link0:Index  IRQ  Rtd  Ref  IRQs
  Initial Probe

Re: FreeBSD and SATA Port Multipliers

2009-10-26 Thread Steve Polyack

Steve Polyack wrote:

Alexander Motin wrote:

Steve Polyack wrote:
 
I'm about to try today's head and patch.  I'll let you know how it 
goes.



Ok. Just to be sure it is not cabling issue (I have seen such), try also
limit port speed to 1.5Gbps by adding to loader.conf:
hint.siisch.0.sata_rev=1
hint.siisch.1.sata_rev=1
hint.siisch.2.sata_rev=1
hint.siisch.3.sata_rev=1

  
Here's a dmesg from todays head and your patch from this morning.  
It's still only sensing one disk over the two PMs.  Forcing 
SATA1/1.5Gbps operation didn't change anything.


I should also note that if I do a 'camcontrol rescan', the first PM 
(with 5 physical drives attached, none detected) will vanish.  
Physically unplugging and reconnecting the device does the same.


COMMAND=/sbin/camcontrol rescan all
siisch0: Timeout on slot 29
siisch0: siis_timeout is  ss 2000 rs 2000 es  
sts 801f2000 serr 

siisch0: ready wait time=1ms
siisch0: ready wait time=1ms
siisch0: SIIS reset...
siisch0: device reset time=0ms
siisch0: SATA connect time=0ms status=0123
siisch0: ready wait time=0ms
siisch0: SIIS reset done: devices=0001
(aprobe0:siisch0:0:15:0): Command timed out
(aprobe0:siisch0:0:15:0): error 5
(aprobe0:siisch0:0:15:0): Retries Exhausted
(pass0:siisch0:0:15:0): lost device
(pass0:siisch0:0:15:0): removing device entry
(pmp0:siisch0:0:15:0): lost device
siisch0: Timeout on slot 30
siisch0: siis_timeout is  ss 4000 rs 4000 es  
sts 801f serr 

siisch0: ready wait time=1ms
siisch0: ready wait time=1ms
(pmp0:siisch0:0:15:0): Request Requeued
(pmp0:siisch0:0:15:0): Retrying Command
siisch0: SIIS reset...
siisch0: ready wait time=1ms
siisch0: device reset time=0ms
siisch0: SATA connect time=0ms status=0123
siisch0: ready wait time=0ms
siisch0: SIIS reset done: devices=0001
(aprobe0:siisch0:0:0:0): Command timed out
(aprobe0:siisch0:0:0:0): error 5
(aprobe0:siisch0:0:0:0): Retries Exhausted
(pmp0:siisch0:0:15:0): Request Requeued
(pmp0:siisch0:0:15:0): Retrying Command
(aprobe0:siisch2:0:15:0): SIGNATURE: 9669
PM Product ID: 37261095
PM Revision: 1706
PMP freeze: 0

PM ports: 5
PM RESET 0 skipping

PM RESET 1
PM RESET 2

PM RESET 3
PM RESET 4

PM reset done
PM connect done
PM status: 0 - 0123
PM status: 1 - 
PM status: 2 - 
PM status: 3 - 
PM status: 4 - 
PMP release: 0



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD and SATA Port Multipliers

2009-10-22 Thread Steve Polyack

Alexander Motin wrote:

If your conditions permit, you can try to upgrade to recent HEAD to look
how will it go with newer code. I am periodically merging my work there
from Perforce. Also I am going to generate new test patch for HEAD,
which includes reworked PMP support, tomorrow.



You can try this patch against today's HEAD:
http://people.freebsd.org/~mav/cam-ata.20091022.patch

  
I tried the patch this morning against a fresh checkout of HEAD.  
Immediately after boot only one device per PM was detected (I have two 
hooked up at the moment, 5 drives on one, 1 on the other).  However, 
about two minutes later all of the drives showed up!


camcontrol also rescans the entire bus *very* quickly now, and discovers 
all changes (new/missing disks and port multipliers).


Here's some verbose info from /var/log/messages immediately after boot:
Oct 22 10:52:03 lovepod kernel: siisch0: Timeout on slot 27
Oct 22 10:52:03 lovepod kernel: siisch0: siis_timeout is  ss 
0800 rs 0800 es  sts 801b2000 serr 

Oct 22 10:52:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:03 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:52:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:03 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:52:03 lovepod kernel: siisch0: SATA connect timeout 
status=0001
Oct 22 10:52:03 lovepod kernel: siisch0: SIIS reset done: phy reset 
found no device

Oct 22 10:52:03 lovepod kernel: (aprobe0:siisch0:0:1:0): Command timed out
Oct 22 10:52:03 lovepod kernel: (aprobe0:siisch0:0:1:0): error 5
Oct 22 10:52:03 lovepod kernel: (aprobe0:siisch0:0:1:0): Retries Exhausted
Oct 22 10:52:03 lovepod kernel: siisch0: DISCONNECT requested
Oct 22 10:52:05 lovepod kernel: siisch0: CONNECT requested
Oct 22 10:52:05 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:52:05 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:05 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:52:05 lovepod kernel: siisch0: SATA connect time=0ms 
status=0123

Oct 22 10:52:05 lovepod kernel: siisch0: ready wait time=0ms
Oct 22 10:52:05 lovepod kernel: siisch0: SIIS reset done: devices=0001
Oct 22 10:52:33 lovepod kernel: siisch0: Timeout on slot 28
Oct 22 10:52:33 lovepod kernel: siisch0: siis_timeout is  ss 
1000 rs 1000 es  sts 801c2000 serr 

Oct 22 10:52:33 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:33 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:33 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:52:33 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:33 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:52:33 lovepod kernel: siisch0: SATA connect timeout 
status=0001
Oct 22 10:52:33 lovepod kernel: siisch0: SIIS reset done: phy reset 
found no device

Oct 22 10:52:33 lovepod kernel: (aprobe0:siisch0:0:2:0): Command timed out
Oct 22 10:52:33 lovepod kernel: (aprobe0:siisch0:0:2:0): error 5
Oct 22 10:52:33 lovepod kernel: (aprobe0:siisch0:0:2:0): Retries Exhausted
Oct 22 10:52:33 lovepod kernel: siisch0: DISCONNECT requested
Oct 22 10:52:35 lovepod kernel: siisch0: CONNECT requested
Oct 22 10:52:35 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:52:35 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:52:35 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:52:35 lovepod kernel: siisch0: SATA connect time=0ms 
status=0123

Oct 22 10:52:35 lovepod kernel: siisch0: ready wait time=0ms
Oct 22 10:52:35 lovepod kernel: siisch0: SIIS reset done: devices=0001
Oct 22 10:53:03 lovepod kernel: siisch0: Timeout on slot 29
Oct 22 10:53:03 lovepod kernel: siisch0: siis_timeout is  ss 
2000 rs 2000 es  sts 801d2000 serr 

Oct 22 10:53:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:53:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:53:03 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:53:03 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:53:03 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:53:03 lovepod kernel: siisch0: SATA connect timeout 
status=0001
Oct 22 10:53:03 lovepod kernel: siisch0: SIIS reset done: phy reset 
found no device

Oct 22 10:53:03 lovepod kernel: (aprobe0:siisch0:0:3:0): Command timed out
Oct 22 10:53:03 lovepod kernel: (aprobe0:siisch0:0:3:0): error 5
Oct 22 10:53:03 lovepod kernel: (aprobe0:siisch0:0:3:0): Retries Exhausted
Oct 22 10:53:03 lovepod kernel: siisch0: DISCONNECT requested
Oct 22 10:53:05 lovepod kernel: siisch0: CONNECT requested
Oct 22 10:53:05 lovepod kernel: siisch0: SIIS reset...
Oct 22 10:53:05 lovepod kernel: siisch0: ready wait time=1ms
Oct 22 10:53:05 lovepod kernel: siisch0: hardware reset ...
Oct 22 10:53:05 lovepod kernel: siisch0: SATA connect time=0ms 
status=0123

Oct 22 10:53:05 lovepod kernel: siisch0: ready wait time=0ms
Oct 22 10:53:05 lovepod kernel: siisch0: SIIS reset done: 

Re: FreeBSD and SATA Port Multipliers

2009-10-21 Thread Steve Polyack

Alexander Motin wrote:

 Have you tried new CAM-based ATA subsystem present in 8.x by siis(4)
 driver? Work is still in progress there, but it should work with port
 multipliers much better then previous implementation.


Indeed, the siis(4) driver works MUCH better.  However, it's still not 
always detecting every drive if I fill a port multiplier with 5/5 
drives.  Sometimes it boots up with 4, sometimes with 5.


Snippet from the dmesg in my other reply:
siisch4: Timeout on slot 25
siisch4: siis_timeout is  ss 0200 rs 0200 es  
sts 80192000 serr 

(aprobe0:siisch4:0:1:0): SIGNATURE: 
(aprobe0:siisch4:0:2:0): SIGNATURE: 
(aprobe0:siisch4:0:3:0): SIGNATURE: 
(aprobe0:siisch4:0:4:0): SIGNATURE: 
ada0 at siisch4 bus 0 target 1 lun 0
ada0: ST31500341AS CC1H ATA/ATAPI-8 SATA 2.x device
ada0: 300.000MB/s transfers
ada0: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C)
ada0: Native Command Queueing enabled

siisch4 bus 0 target 0 lun 0 is missing here, due to some kind of 
timeout.  The drives are fine and the missing device does not follow a 
specific physical drive.  Lastly, using camcontrol to rescan port 
multipliers and attached drives only leads to failure.  Rescanning 
everythign simply loses all but one drive attached to each port multiplier.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


FreeBSD and SATA Port Multipliers

2009-10-20 Thread Steve Polyack
I have a system running FreeBSD 8.0-RC1 that contains 3x Sil3124 PCI-E 
SATA controller cards.  Each card has 3x Sil3726 Port Multipliers 
attached (5 slots per SATA PM).  The problem is that the disks are often 
not all detected by FreeBSD, even though the controller Option ROMs list 
all of the installed physical disks.  Which drive(s) are not detected 
seems to vary with each boot.  All of the port multipliers are detected 
every boot, it's just the drives which aren't getting probed.


I've tried the following patch, 
http://www.freebsd.org/cgi/query-pr.cgi?pr=kern/129784 , which may have 
helped - it picks up 6-8 out of 9 disks, but obviously that's not 
ideal.  The motherboard BIOS version is current, as is the firmware for 
the SATA controllers.


Does anyone have any ideas or other patches that I may try?


Thanks,
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: FreeBSD and SATA Port Multipliers

2009-10-20 Thread Steve Polyack

Steve Polyack wrote:
I have a system running FreeBSD 8.0-RC1 that contains 3x Sil3124 PCI-E 
SATA controller cards.  Each card has 3x Sil3726 Port Multipliers 
attached (5 slots per SATA PM).  The problem is that the disks are 
often not all detected by FreeBSD, even though the controller Option 
ROMs list all of the installed physical disks.  Which drive(s) are not 
detected seems to vary with each boot.  All of the port multipliers 
are detected every boot, it's just the drives which aren't getting 
probed.


This was mostly fixed by loading the siis(4) module at boot.  However, 
if I attach 5 drives to a port multiplier board and reboot, only 4 of 
the drives attached to that port multiplier are detected.


From dmesg:
siisch4: Timeout on slot 25
siisch4: siis_timeout is  ss 0200 rs 0200 es  
sts 80192000 serr 

(aprobe0:siisch4:0:1:0): SIGNATURE: 
(aprobe0:siisch4:0:2:0): SIGNATURE: 
(aprobe0:siisch4:0:3:0): SIGNATURE: 
(aprobe0:siisch4:0:4:0): SIGNATURE: 
ada0 at siisch4 bus 0 target 1 lun 0
ada0: ST31500341AS CC1H ATA/ATAPI-8 SATA 2.x device
ada0: 300.000MB/s transfers
ada0: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C)
ada0: Native Command Queueing enabled
ada1 at siisch4 bus 0 target 2 lun 0
ada1: ST31500341AS CC1H ATA/ATAPI-8 SATA 2.x device
ada1: 300.000MB/s transfers
ada1: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C)
ada1: Native Command Queueing enabled
ada2 at siisch4 bus 0 target 3 lun 0
ada2: ST31500341AS CC1H ATA/ATAPI-8 SATA 2.x device
ada2: 300.000MB/s transfers
ada2: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C)
ada2: Native Command Queueing enabled
ada3 at siisch4 bus 0 target 4 lun 0
ada3: ST31500341AS CC1H ATA/ATAPI-8 SATA 2.x device
ada3: 300.000MB/s transfers
ada3: 1430799MB (2930277168 512 byte sectors: 16H 63S/T 16383C)
ada3: Native Command Queueing enabled


The disk at siisch4 bus 0 target 0 lun 0 is what's missing - it looks 
like there's some kind of timeout during the detection. A 'camcontrol 
rescan all' does not help.  In fact, the camcontrol rescan appears to 
lose all of the SATA drives attached to port-multipliers with a similar 
timeout message:


siisch4: Timeout on slot 17
siisch4: siis_timeout is 0004 ss 0002 rs 0002 es  
sts 80112000 serr 

(ada0:siisch4:0:1:0): lost device
(ada0:siisch4:0:1:0): removing device entry
siisch4: Timeout on slot 19
siisch4: siis_timeout is 0004 ss 0008 rs 0008 es  
sts 80132000 serr 

siisch4: Timeout on slot 20
siisch4: siis_timeout is 0004 ss 0010 rs 0010 es  
sts 80142000 serr 

(ada1:siisch4:0:2:0): lost device
(ada1:siisch4:0:2:0): removing device entry
siisch4: Timeout on slot 21
siisch4: siis_timeout is 0004 ss 0020 rs 0020 es  
sts 80152000 serr 

siisch4: Timeout on slot 22
siisch4: siis_timeout is 0004 ss 0040 rs 0040 es  
sts 80162000 serr 

(ada2:siisch4:0:3:0): lost device
(ada2:siisch4:0:3:0): removing device entry

I can provide more info on request.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: What does mfi0: Copy out failed mean?

2009-08-13 Thread Steve Polyack

Václav Haisman wrote:

I am getting mfi0: Copy out failed message in logs, usually several times a
day. What does it mean? This is FreeBSD 7.2.

--
VH

  
I can't tell you exactly what the error message means, but out of 
curiosity what kind of hardware are you running? A PERC6 controller perhaps?


Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: What does mfi0: Copy out failed mean?

2009-08-13 Thread Steve Polyack

Václav Haisman wrote:

Steve Polyack wrote:
  

Václav Haisman wrote:


I am getting mfi0: Copy out failed message in logs, usually several
times a
day. What does it mean? This is FreeBSD 7.2.
  

  I can't tell you exactly what the error message means, but out of
  

curiosity what kind of hardware are you running? A PERC6 controller
perhaps?


Yes, mfi0: Dell PERC 6.

  
You may want to contact Scott Long, sco...@freebsd.org, to see if he can 
help.  I believe he's the author/maintainer of the mfi(4) driver.  He 
may be interested in any problems you've seen.  I've heard of some other 
people having issues with PERC6 cards and FreeBSD as well.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Radeon 9250 DRM in 7.2-RELEASE

2009-05-04 Thread Steve Polyack
I upgraded my workstation (via source) to 7.2-RELEASE from 7.1-RELEASE 
about a day ago.  After the upgrade procedure, I'm having no luck with 
using DRM in X11.  It worked fine before and gave reasonable 
performance.  Now when I start up X with DRI enabled, both of my screens 
display garbage.  The mouse pointer is also not visible.


I'm using a PCI Radeon 9250.  I can get more details if necessary.  I 
can also try to revert the kernel DRM module code to 7.1.


From dmesg:
drm2: ATI Radeon RV280 9250 on vgapci2
vgapci2: child drm2 requested pci_enable_busmaster
info: [drm] Initialized radeon 1.29.0 20080528
info: [drm] Setting GART location based on new memory map
info: [drm] Loading R200 Microcode
info: [drm] writeback test succeeded in 2 usecs
drm2: [ITHREAD]

Any ideas or suggestions would be appreciated.  Thanks.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Radeon 9250 DRM in 7.2-RELEASE

2009-05-04 Thread Steve Polyack

Paul Schmehl wrote:


Read /usr/portsUPDATING wrt xorg.  There were major changes in the 
config file and peripheral detection between 7.1 and 7.2


You need to make sure that both dbus and hald are running.  Remove the 
mouse and keyboard sections from your xorg.conf file.  They are no 
longer needed. Make sure to remove any line that references RgbPath.


This may help to get the mouse working (ignore the DontZap line):

Section ServerFlags
   Option DontZap No
   Option AllowEmptyInput No
EndSection

This should get DRI working:

Section DRI
   Group 0
   Mode  0660
EndSection

I tackled the above when I upgraded Xorg to 7.4 months ago, 
independently of any FreeBSD release updates.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Radeon 9250 DRM in 7.2-RELEASE

2009-05-04 Thread Steve Polyack

Robert Noland wrote:


Why is this drm2?  Is this a multicard setup?  Multicard doesn't work
right now.  If you aren't using more than one card, can you disable
whatever else drm is attaching to?

robert.

  


This is something funny I had to deal with initially.  There is *no* 
drm0 or drm1 being probed.  Consequently, there is only one entry in 
/dev/dri/, card2.  As a result, Xorg doesn't even recognize card2 as a 
source for DRM (probably due to what you said).  In the past, I have 
symlinked /dev/dri/card2 to /dev/dri/card0 and Xorg would work 
correctly, utilizing DRM.


I have an onboard Intel VGA device, but as far as I know it is 
disabled.  I can double check, but as I've noted things have worked fine 
with the above tweak until upgrading to 7.2-RELEASE.


Thanks,

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Radeon 9250 DRM in 7.2-RELEASE

2009-05-04 Thread Steve Polyack

Robert Noland wrote:

On Mon, 2009-05-04 at 14:48 -0400, Steve Polyack wrote:
  
This is something funny I had to deal with initially.  There is *no* 
drm0 or drm1 being probed.  Consequently, there is only one entry in 
/dev/dri/, card2.  As a result, Xorg doesn't even recognize card2 as a 
source for DRM (probably due to what you said).  In the past, I have 
symlinked /dev/dri/card2 to /dev/dri/card0 and Xorg would work 
correctly, utilizing DRM.


I have an onboard Intel VGA device, but as far as I know it is 
disabled.  I can double check, but as I've noted things have worked fine 
with the above tweak until upgrading to 7.2-RELEASE.



Can you send me a pciconf -lvb

robert.

  


$ pciconf -lvb
hos...@pci0:0:0:0:class=0x06 card=0x01ad1028 chip=0x27708086 
rev=0x02 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82945G/GZ/P/PL Host Bridge/DRAM Controller'
   class  = bridge
   subclass   = HOST-PCI
pc...@pci0:0:1:0:class=0x060400 card=0x8086 chip=0x27718086 
rev=0x02 hdr=0x01

   vendor = 'Intel Corporation'
   device = '82945G/GZ/P/PL Host to PCI Express Bridge'
   class  = bridge
   subclass   = PCI-PCI
vgap...@pci0:0:2:0:class=0x038000 card=0x01ad1028 chip=0x27728086 
rev=0x02 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82945G Integrated Graphics Controller'
   class  = display
   bar   [10] = type Memory, range 32, base 0xfeb0, size 524288, 
enabled

   bar   [14] = type I/O Port, range 32, base 0xe898, size  8, enabled
   bar   [18] = type Prefetchable Memory, range 32, base 0xd000, 
size 268435456, enabled
   bar   [1c] = type Memory, range 32, base 0xfeac, size 262144, 
enabled
vgap...@pci0:0:2:1:class=0x038000 card=0x01ad1028 chip=0x27768086 
rev=0x02 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82945G Integrated Graphics Controller'
   class  = display
   bar   [10] = type Memory, range 32, base 0xfeb8, size 524288, 
enabled
pc...@pci0:0:28:0:class=0x060400 card=0x chip=0x27d08086 
rev=0x01 hdr=0x01

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) PCIe Root Port'
   class  = bridge
   subclass   = PCI-PCI
pc...@pci0:0:28:1:class=0x060400 card=0x chip=0x27d28086 
rev=0x01 hdr=0x01

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) PCIe Root Port'
   class  = bridge
   subclass   = PCI-PCI
uh...@pci0:0:29:0:class=0x0c0300 card=0x01ad1028 chip=0x27c88086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) USB Universal Host Controller'
   class  = serial bus
   subclass   = USB
   bar   [20] = type I/O Port, range 32, base 0xff80, size 32, enabled
uh...@pci0:0:29:1:class=0x0c0300 card=0x01ad1028 chip=0x27c98086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) USB Universal Host Controller'
   class  = serial bus
   subclass   = USB
   bar   [20] = type I/O Port, range 32, base 0xff60, size 32, enabled
uh...@pci0:0:29:2:class=0x0c0300 card=0x01ad1028 chip=0x27ca8086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) USB Universal Host Controller'
   class  = serial bus
   subclass   = USB
   bar   [20] = type I/O Port, range 32, base 0xff40, size 32, enabled
uh...@pci0:0:29:3:class=0x0c0300 card=0x01ad1028 chip=0x27cb8086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) USB Universal Host Controller'
   class  = serial bus
   subclass   = USB
   bar   [20] = type I/O Port, range 32, base 0xff20, size 32, enabled
eh...@pci0:0:29:7:class=0x0c0320 card=0x01ad1028 chip=0x27cc8086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82801G (ICH7 Family) USB 2.0 Enhanced Host Controller'
   class  = serial bus
   subclass   = USB
   bar   [10] = type Memory, range 32, base 0xffa80800, size 1024, enabled
pc...@pci0:0:30:0:class=0x060401 card=0x chip=0x244e8086 
rev=0xe1 hdr=0x01

   vendor = 'Intel Corporation'
   device = '82801 Family (ICH2/3/4/4/5/5/6/7/8/9,63xxESB) Hub 
Interface to PCI Bridge'

   class  = bridge
   subclass   = PCI-PCI
p...@pci0:0:30:2:class=0x040100 card=0x01ad1028 chip=0x27de8086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '82801GB I/O Controller Hub AC'97 Audio'
   class  = multimedia
   subclass   = audio
   bar   [10] = type I/O Port, range 32, base 0xec00, size 256, enabled
   bar   [14] = type I/O Port, range 32, base 0xe8c0, size 64, enabled
   bar   [18] = type Memory, range 32, base 0xfeabfa00, size 512, enabled
   bar   [1c] = type Memory, range 32, base 0xfeabf900, size 256, enabled
is...@pci0:0:31:0:class=0x060100 card=0x chip=0x27b88086 
rev=0x01 hdr=0x00

   vendor = 'Intel Corporation'
   device = '945GL Intel 82801GB/GR (ICH7 Family) LPC Interface

Re: Radeon 9250 DRM in 7.2-RELEASE

2009-05-04 Thread Steve Polyack

Robert Noland wrote:

Ok, they are at least partially disabled...  I wonder if a BIOS update
would help.

Anyway, the garbled screen issue is usually associated with the caching
method used on the PCI GART.  On IGP chips we force the GART to be
uncacheable.  On PCI chips they are supposed to be able to snoop the bus
and DTRT.  All of the fixes for memory caching should be in 7.

Please try the attached patch and see if that makes a difference.

robert.

  


Unfortunately, this didn't help.  I should also clarify what I'm seeing, 
as it's not complete garbage as I thought.  I have a dual monitor setup, 
side-by-side using xrandr.  When I start X (xfce4) with drm available, 
the left monitor (primary) is a sold light shade of blue, except for a 
black corner which is maybe 10pix square.  The right monitor displays my 
typical background image with another black box over part of it.  If it 
shift back to the console and then back into X, I can see the outlines 
of my startup windows, but they contain garbage and the outlines are 
fuzzed with green and magenta garbage.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Radeon 9250 DRM in 7.2-RELEASE

2009-05-04 Thread Steve Polyack

Robert Noland wrote:

ok, how about this one... If this doesn't do it, then we need to start
digging...

robert.

  


No change when using this patch either.  I've also updated to the latest 
BIOS and have ensured that the onboard Intel adapter is as disabled as 
it can get.  I can provide you some photos of the screen behavior if you 
think it may help.  Thanks again so far!


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: poweredge 1850 won't boot 7.1? maybe LSI-related : amr0: adapter is busy

2009-02-16 Thread Steve Polyack

Brian A. Seklecki wrote:

NOTE: You're using the 4e/Si, which we have as well.  We're experiencing
random crashes on the 1850/8th gen, as a result of a (believed) DMA bug
introduced into 7.x

  
Just to note, we are only seeing these issues in combination with megarc 
(/usr/ports/sysutils/megarc) monitoring.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: amr driver issues in 7.1-RELEASE

2009-01-26 Thread Steve Polyack

Steve Polyack wrote:

Scott Long wrote:
The fix for this that I was thinking of is already in 7.1.  There 
might still be a driver bug, but I'm leaning more towards the 
controller simply being busy.  Do you have a reproducible test case 
that I could

try?

Scott

So far, I have not been able to reliably reproduce this.  It pops up 
every now and then during our backups, which at the moment aren't that 
disk intensive.  I'll let you know if I come across anything else.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Sysinstall partition oddities (6.3/i386 - 7.x/amd64)

2009-01-22 Thread Steve Polyack

Steve Polyack wrote:
I've seen some oddities with the partition and bsdlabel editors in the 
sysinstall program on the 7.0 and 7.1 releases.  The partition editor 
seems to be reading or parsing the partition table incorrectly.  I had 
a 6.3-RELEASE system with the following layout:

/dev/amrd3s1a on / (ufs, local)
/dev/amrd3s1g on /opt (ufs, local, soft-updates)
/dev/amrd3s1f on /usr (ufs, local, soft-updates)
/dev/amrd3s1d on /var (ufs, local, soft-updates)
/dev/amrd3s1e on /var/log (ufs, local, soft-updates)

Upon booting into the 7.x install media and encountering the FDISK 
Partition Editor, the partition it's seeing is amrd3*a*s1, as opposed 
to amrd3s1.  Trying to continue with the partition table and bsd 
labels as is only led to the installer bailing out.  As soon as it 
would attempt to newfs the disk partitions, the installer would error 
and report that it can't find a device entry in /dev for amrd3*a*s1a.  
Since preserving the data on the disk was not critical, I was able to 
continue by deleting the original partition/slice and recreating 
them.  This worked fine.


However, I'm still curious as to what the cause of this is.  I have 
seen this before on two other systems while installing 7.x, quite 
possibly while upgrading from 6.3.  When this occurred, I was also 
moving from i386 to amd64;  Is there some kind of offset for partition 
tables which may change based on architecture?


Lastly, here's a screenshot of the partition editor: 
http://people.collaborativefusion.com/~spolyack/fbsd-install.jpg


Unfortunately, I do not have any screenshots of the errors during the 
newfs step.  If this comes up again, I'll be sure to take some.  Thanks.


This also occurs in VMWare.  I'm able to get the exact same behavior by 
installing a 7.1-RELEASE system (single slice, da0s1), then booting off 
the install media and start a new installation.  It picks up the 
partition as da0as1 instead of da0s1, making it impossible to use 
sysinstall while preserving existing partitions.


-Steve Polyack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


amr driver issues in 7.1-RELEASE

2009-01-22 Thread Steve Polyack
We have multiple systems using LSILogic PERC4 cards (PERC4e/Si, 
PERC4/DC, amr driver).  After recently upgrading two of them from 6.3 to 
7.1, we have begun to see the following errors in our logs during heavy 
use in both systems:
amr0: Too many retries on command 0x80a4da58.  Controller is 
likely dead
amr0: Too many retries on command 0x80a4eaa8.  Controller is 
likely dead
amr0: Too many retries on command 0x80a497a0.  Controller is 
likely dead
amr0: Too many retries on command 0x80a4eaa8.  Controller is 
likely dead


However, the system continues working and the volumes remain 
accessible.  This happens on the PERC4/DC controllers in both systems.  
Firmware version is 352D.  MegaCLI reports no problems.


Has anyone else seen these messages?  A google search turns up nothing 
but results in the driver code.  Is this something we should be worried 
about?  We won't be moving any other systems to 7.1 until we can clear 
this up.


Thanks!

-Steve Polyack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: amr driver issues in 7.1-RELEASE

2009-01-22 Thread Steve Polyack

Scott Long wrote:
The fix for this that I was thinking of is already in 7.1.  There 
might still be a driver bug, but I'm leaning more towards the 
controller simply being busy.  Do you have a reproducible test case 
that I could

try?

Scott

We saw this one while backups wrote from an array on the PERC4/DC to a 
tape drive (on a separate controller).
amr1: Too many retries on command 0x80a6d060.  Controller is 
likely dead


The other four which I noted came during writes to the array attached to 
the PERC4/DC (external Dell PowerVault).  I want to say they showed up 
while writing a 30G junkfile (/dev/random) to the array which we were 
using to test the tape access; either that, or while we wrote that file 
out to the tape drive.


If it matters, we also use ports/sysutils/linux-megacli2 to periodically 
check the status of our arrays.  It's possible that this happened during 
one of these long writes/reads.  I'm not having any luck reproducing at 
the moment, but if I come across a reproducible test, I will let you know.


Thanks!
Steve Polyack

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: amr driver changes in FreeBSD 7.1-RELEASE

2009-01-16 Thread Steve Polyack
In case anyone is interested, the situation has only gotten weirder.  
Booting 7.1-R with 'boot -v' mysteriously adds:

(probe32:amr1:1:6:0): error 22
(probe32:amr1:1:6:0): Unretryable Error

However, amr1 is not the one that is giving us issues.  It appears to 
work fine.  amr0 is the one on which 7.1-R fails to find any volumes.  
Even stranger, a 'camcontrol devlist' manages to display all off the 
individual SCSI drives attached to amr0.


Finally, this problem was solved whilst writing this e-mail.  The 
default for kern.cam.scsi_delay (5000ms) appears to be too aggressive in 
my circumstances.  Increasing it to 15000 solves my issue (and gets rid 
of amr0: adapter is busy messages).


Steve Polyack wrote:

Hello,

We have a Dell PowerEdge 1850 server.  It contains two PERC4 RAID 
controllers.  One is a PERC4e/Si, and the other is a PERC4/DC.  Right 
now we are running FreeBSD 6.3-RELEASE, with a 36GB RAID-1 on the 
PERC4e/Si (amr0), and both a 1TB RAID5 and a 136GB RAID1 on the 
PERC4/DC(amr1). Both adapters are running the latest firmware revision.


When we boot FreeBSD7.1 install media, the amr driver fails to detect 
any volumes (disks) attached to amr0, the PERC4e/Si.  However, it 
picks up the attached disks on the PERC4/DC just fine.  However, if I 
boot 7.0-RELEASE install media, it picks up all of the attached 
volumes, leading me to believe the issue is due to changes in the amr 
driver between 7.0 and 7.1.  During the 7.1 boot process, before 
probing disks, we see the message amr0: adapter is busy show up 
twice.  This also does not occur on the 6.3, 6.4, or 7.0 releases.


We also have another PE1850 with a very similar configuration, except 
the two PERCs get probed in a different order, and it detects all of 
the attached volumes without any issues.


Any suggestions? These are semi-critical systems, so we aren't always 
able to test things like this.  But, we can schedule downtime once or 
twice a week if necessary.


-Steve Polyack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


amr driver changes in FreeBSD 7.1-RELEASE

2009-01-13 Thread Steve Polyack

Hello,

We have a Dell PowerEdge 1850 server.  It contains two PERC4 RAID 
controllers.  One is a PERC4e/Si, and the other is a PERC4/DC.  Right 
now we are running FreeBSD 6.3-RELEASE, with a 36GB RAID-1 on the 
PERC4e/Si (amr0), and both a 1TB RAID5 and a 136GB RAID1 on the 
PERC4/DC(amr1). Both adapters are running the latest firmware revision.


When we boot FreeBSD7.1 install media, the amr driver fails to detect 
any volumes (disks) attached to amr0, the PERC4e/Si.  However, it picks 
up the attached disks on the PERC4/DC just fine.  However, if I boot 
7.0-RELEASE install media, it picks up all of the attached volumes, 
leading me to believe the issue is due to changes in the amr driver 
between 7.0 and 7.1.  During the 7.1 boot process, before probing disks, 
we see the message amr0: adapter is busy show up twice.  This also 
does not occur on the 6.3, 6.4, or 7.0 releases.


We also have another PE1850 with a very similar configuration, except 
the two PERCs get probed in a different order, and it detects all of the 
attached volumes without any issues.


Any suggestions? These are semi-critical systems, so we aren't always 
able to test things like this.  But, we can schedule downtime once or 
twice a week if necessary.


-Steve Polyack
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Pending MFC of drm updates

2009-01-08 Thread Steve Polyack

Robert Noland wrote:

I am planning to merge most all of the drm from -CURRENT to releng_7
shortly.  The merge that I have staged includes the following.

Merged /head/sys:r182080,182467-182469,182883-182884,183573,183603-183605,
183828,183830-183834,184212-184213,184263,184373-184375

There are really too many updates/fixes to mention as the drm from 7 is
more than 2 years old now.  This has support for several newer Intel and
AMD/ATI chips, (no r6/7xx yet, but soon(tm)).

I have a patch available for testing at
http://people.freebsd.org/~rnoland/drm-update-7-010609.patch.bz2

robert.

  
I've pulled this down and patched my install of FreeBSD 7.1-RELEASE.  
Whilst using a Radeon 9250 (R280?), I have no problems.  There is also 
no apparent performance difference.


The only issue I have seen occurs with my dual-monitor setup.  
Occasionally a window on the second monitor will decide to render its 
drop-down menus or other (overlay-based?) graphics on the primary 
monitor instead of where it should be.  Restarting the application seems 
to clear this up.  I have not seen this previously before applying your 
patch.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Burning DVD with files4GB from console

2008-12-04 Thread Steve Polyack
Not too say you're .iso images can't be 2GB/4GB, but I'm pretty sure 
the ISO9660 standard is limited to a 2GB maximum file size (for files 
within the .iso).  You must use UDF to burn files of greater size.  
mkisofs(8) seems to support this, if only in alpha/hybrid stage:


  -udf   Include UDF support in the generated filesystem image.  
UDF sup-
 port is currently in alpha status and for this reason, it 
is not

 possible to create UDF only images.


Stephen Montgomery-Smith wrote:

David Kelly wrote:

On Thu, Dec 04, 2008 at 10:48:54AM +0100, Bartosz Stec wrote:
My backup script split filesystem dumps to files with size of 4,37 
GB (4 588 544 kB). It's just an optimal size to fill out DVDs. At 
this moment I have to burn them from windows via smb-link becuase I 
didn't manage to do this task from FreeBSD console due to 2GB/4GB 
filesize restrictions (growisofs).


Since when did FreeBSD (or growisofs) have a 2GB/4GB filesize limit? I
have burned 4.3GB DVDs several times.


I never had a problem writing huge files.  But I did have a problem 
subsequently reading it with Windows XP.  Maybe the file size 
restriction is a Windows thing.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]


Re: repeatable crash on RELENG7

2008-12-04 Thread Steve Polyack

Kevin Oberman wrote:

Date: Tue, 2 Dec 2008 19:33:00 +0300
From: Alexandr Pakhomov [EMAIL PROTECTED]
Sender: [EMAIL PROTECTED]

On Tue, Dec 2, 2008 at 5:12 PM, Mike Tancsa [EMAIL PROTECTED] wrote:



At 08:38 AM 12/2/2008, Kostik Belousov wrote:

  

mdconfig -a -t malloc -s 1800M
  

You cannot have ~ 2Gb of kernel memory allocated for md, at least not on
i386.



Thanks,  how do I find out what the limit is on a machine ? Is it
vm.kvm_size ?


   ---Mike
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

  

Try tune both vm.kmem_size and vm.kmem_size_max via /boot/loader.conf. They
are equal to 330M by default.



Mike,

Let me second Kris's suggestion. I can think of very few cases where
malloc backed is in any way better than swap backed. Many people assume
that swap-backed performance is better, but this is only the case if you
run out of memory (and that is full memory, not kvm).

md is still RAM based and very fast. The current mdconfig should be
defaulting to swap-backed devices, but I don't see any indication of
that in the man page, so I may be wrong.
  
I agree here.  Using mdconfig(8) in malloc-backed mode may lead to 
unpredictable results when attempting to use large amounts of memory.  
mdconfig(8) seems to allow you to reserve more kernel memory than may 
be appropriate, and as in Mike's experience, causes a kernel panic upon 
writing enough data to the malloc'd md(4) device.


It seems that the term swap-backed is misleading for some people.  It 
does NOT mean your md(4) device will be constantly swapping to disk (and 
the man page does an alright job of relaying this).  It simply means 
that generally available memory will be used, and so will swap iff 
available memory happens to drop low enough.


The bottom line in my experience with md(4) devices greater than ~100MB 
is that swap-backed is always reliable, while malloc'd md(4) devices 
will cause unpredictable kernel panics.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]