Re: /usr/bin/script eating 100% cpu with portupgrade and xargs

2011-09-19 Thread Chris Torek
On Sep 18, 2011 11:25 AM, Kostik Belousov kostik...@gmail.com wrote:
 Please note that interpreting the receiving of 0 bytes on the terminal
 as EOF is only a convention. If done absolutely properly, script shall
 not interpret zero-byte read as EOF. Might be, the reasonable thing to
 do would be to only look at the stdin once in a second after receiving
 zero-bytes, and switching it back to normal mode if something is read.

The other obvious alternative is to test for underlying file type, and
distinguish between EOF is forever (ordinary pipes and sockets, some
but not all block devices, etc) and other.  In fact, the EOF is
only temporary mode introduced in the original change might
reasonably be limited to character devices only, for the purposes to
which script is put.

The delay method has significant merit in terms of simplicity, of
course: it avoids special-casing.  To truly fix the problem, though,
you would need something different in terms of a kernel interface: a
message from the master side of the pty saying slave side has done a
read(n) (where n is some integer), so that the program running the
pty can issue a corresponding read(n) on a buffer, to forward the
results back through the pty later.  Without this, the script
program erroneously translates one EOF as signaled by read returning
0 into multiple EOF on pty signals.  Adding a delay simply changes
this from as many as script can stuff down the pty at maximum CPU
rate to one a second.  Thus, if I were fixing this, I would lean
towards EOF is just a temporary signal being applied only to
character devices, and/or perhaps having a command-line flag to force
one mode or the other (with the default being EOF is permanent on
everything except character devices).

Chris
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


busdma MFC broke ipfw fwd for RELENG_6

2011-09-19 Thread Andreas Longwitz
Eugene Grosbein wrote:

 Well, given that before busdma commit that hardware worked just fine
 with stock driver, it could be less overhead for me to rollback that
 one busdma small chunk :-)
 Who knows, which drivers got broken then in 2010 in 6.4-STABLE with
 busdma change besides re(4)...

Another example is de(4) as mentioned in kern/151941.

-- 
Dr. Andreas Longwitz

Data Service GmbH
Beethovenstr. 2A
23617 Stockelsdorf
Amtsgericht Lübeck, HRB 318 BS
Geschäftsführer: Wilfried Paepcke, Dr. Andreas Longwitz, Josef Flatau

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Usling vlan(4) without an actual lan behind it

2011-09-19 Thread Pete French
Can anyone see any problem is doing this ? i.e. creating a vlan interface
which doesnt correspond to any physical interface, just as a place to hang IP
addresses. I am trying to work around a problem with carp and ndp when
there are multiple IPv6 addresses bound to it.

cheers,

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Usling vlan(4) without an actual lan behind it

2011-09-19 Thread David Magda
On Mon, September 19, 2011 08:02, Pete French wrote:
 Can anyone see any problem is doing this ? i.e. creating a vlan interface
 which doesnt correspond to any physical interface, just as a place to hang
 IP addresses. I am trying to work around a problem with carp and ndp when
 there are multiple IPv6 addresses bound to it.

Does it specifically have to be a vlan(4), or can you perhaps add another
address to lo(4), or perhaps create a lo1 in addition to the lo0?


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Usling vlan(4) without an actual lan behind it

2011-09-19 Thread Pete French
 Does it specifically have to be a vlan(4), or can you perhaps add another
 address to lo(4), or perhaps create a lo1 in addition to the lo0?

It can be anything really - I was looking for a generic interface
I can configure with IP addresses. But adding real addresses to
loopback interfaces can cause problems can it not ?

The issue I am trying to address is that I have a whole bunch of IPv6
addresse on a /64, which are oly used as endpoints for a set of
websites - they don't exist on a real ethernet anywhere, and don't
need to. I just want them on an interface on a machine wwhen I can run
up a load balancer to listeon on those addresses and forward them to
the approrpiate actual machines serving the requests.

Currently I am using a range which is the same as the IPv6 - so
they do exist on a real ethernet - but the nework interface there
is CARP, and from the inside ndp can oly resolve the single initial
carp address (I believe this is a known bug). So I want some interface
on the firewall box, which is neither the outside, nor the inside, where I
can hang a set of addresses on a different subnet. That way both sides
will be able to see the addresses fine, and everything will work nicely.

-pete.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: busdma MFC broke ipfw fwd for RELENG_6

2011-09-19 Thread John Baldwin
On Monday, September 19, 2011 6:20:07 am Andreas Longwitz wrote:
 Eugene Grosbein wrote:
 
  Well, given that before busdma commit that hardware worked just fine
  with stock driver, it could be less overhead for me to rollback that
  one busdma small chunk :-)
  Who knows, which drivers got broken then in 2010 in 6.4-STABLE with
  busdma change besides re(4)...
 
 Another example is de(4) as mentioned in kern/151941.

Hmm, that PR isn't really about de(4).  For de(4) I think you probably want 
this change:

r197465 | yongari | 2009-09-24 13:53:00 -0400 (Thu, 24 Sep 2009) | 18 lines

Align Tx/Rx descriptors on 32 bytes boundary instead of PAGE_SIZE.
Also align setup descriptor on 32 bytes boundary. Tx buffer have no
alignment limitation so create dmamap without alignment
restriction[1]. Rx buffer still seems to require 4 bytes alignment
limitation but we can simply use MCLBYTES for size to map the
buffer instead of TULIP_DATA_PER_DESC as the buffer is allocated
with m_getcl(9).
de(4) supports up to TULIP_MAX_TXSEG segments for Tx buffers,
increase maximum dma segment size to TULIP_MAX_TXSEG * MCLBYTES.
While I'm here remove TULIP_DATA_PER_DESC as it is not used anymore.

This should fix de(4) breakage introduced after r176206.

Submitted by:   jhb [1]
Reported by:WATANABE Kazuhiro  CQG00620  nifty dot ne dot jp 
Tested by:  WATANABE Kazuhiro  CQG00620  nifty dot ne dot jp ,
Takahashi Yoshihiro  nyan  jp dot freebsd dot org 

And probably these as well:


r197464 | yongari | 2009-09-24 13:11:41 -0400 (Thu, 24 Sep 2009) | 2 lines

Destroy dmamap in dma cleanup.


r197463 | yongari | 2009-09-24 13:07:04 -0400 (Thu, 24 Sep 2009) | 2 lines

Consistently use bus_addr_t.


r197461 | yongari | 2009-09-24 12:57:35 -0400 (Thu, 24 Sep 2009) | 4 lines

Use __NO_STRICT_ALIGNMENT to determine whether de(4) have to apply
alignment fixup code for received frames on strict alignment
architectures.


r177937 | jhb | 2008-04-05 13:24:44 -0400 (Sat, 05 Apr 2008) | 9 lines

During attach on some de(4) adapters the driver sends out a test packet as
part of detecting the media.  Explicitly ensure that we don't send it to
bpf(4) as bpf(4) isn't setup yet.  This worked by accident before the bpf
interface stuff was reworked to avoid other races (bpf_peers_present, etc.)
but now it needs an explicit check to avoid a panic.

MFC after:  3 days
PR: kern/120915


r170389 | yongari | 2007-06-06 20:28:47 -0400 (Wed, 06 Jun 2007) | 9 lines

The maxsegsz of a dma tag created in de(4) is TULIP_DATA_PER_DESC bytes.
In Rx path it allocates a new mbuf with m_getcl(9) so the length of
the mbuf is MCLBYTES which is greater than a segment size specified by
the dma tag. This segment size mismatch caused a voluntary panic.
Fix the panic by settting the mbuf length to TULIP_DATA_PER_DESC.

Reported by:Arne H Juul arnej AT yahoo-inc DOT com
Tested by:  Arne H Juul arnej AT yahoo-inc DOT com



-- 
John Baldwin
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


mpt0 timeouts with an intel SASUC8I controller

2011-09-19 Thread Thomas Vogt
Hello

I've stability issue with my new intel SASUC8I [1] PCIe controller. It's a LSI 
1068e based controller. After a few minutes with disk io (csup or scrub by 
example) my FreeBSD 8-stable (64bit) is freezing for a couple of minutes and 
I see a lot of error messages like:

Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bc3b0:48367 timed out for 
ccb 0xff00050a8000 (req-ccb 0xff00050a8000)
Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bbb40:48368 timed out for 
ccb 0xff0004f81800 (req-ccb 0xff0004f81800)
Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
0xff80002bc3b0:48367
Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
0xff80002bbb40:48368
Sep 17 03:10:03 gw kernel: mpt0: Timedout requests already complete. Interrupts 
may not be functioning

It takes minutes to execute commands after the first timeout message. The whole 
OS is running on single raidz ZFS pool. I use 4x 2TB SpinPoint F4 EcoGreen 
HD204UI disks with fixed firmware [2]. The disks are connected with brand new 
adaptec cables. I already replaced a cable to disqualify a cable issue. 
smartctl looks ok. The system is working fine if I connect all 4 disk to the 
the onboard controller. 

Is this an known issue with this controller and the mpt driver? Is there an 
workaround? I hope someone can help me.

Regards,
Thomas


FreeBSD Information:
8.2-STABLE FreeBSD 8.2-STABLE #3: Sun Jul  3 19:39:11 UTC 2011  
r...@gw.lan:/usr/obj/usr/src/sys/GENERIC  amd64

Motherboard:
smbios.planar.maker=ASUSTeK Computer INC.
smbios.planar.product=P5N73-AM

Controller:
mptutil show adapter
mpt0 Adapter:
   Board Name: SASUC8I
   Board Assembly: L3-25071-00C
Chip Name: C1068E
Chip Revision: UNUSED
  RAID Levels: RAID0, RAID1, RAID1E
RAID0 Stripes: 64k
   RAID1E Stripes: 64k
 RAID0 Drives/Vol: 2-10
 RAID1 Drives/Vol: 2

dmesg about the controller:
mpt0: LSILogic SAS/SATA Adapter port 0xee00-0xeeff mem 
0xef7fc000-0xef7f,0xef7e-0xef7e irq 16 at device 0.0 on pci2
mpt0: [ITHREAD]
mpt0: MPI Version=1.5.20.0
mpt0: Capabilities: ( RAID-0 RAID-1E RAID-1 )
mpt0: 0 Active Volumes (2 Max)
mpt0: 0 Hidden Drive Members (14 Max)

zpool status:
  pool: tank
 state: ONLINE
 scan: scrub canceled on Sun Sep 18 10:44:46 2011
config:

NAMESTATE READ WRITE 
CKSUM
tankONLINE   0 0
 0
  raidz1-0  ONLINE   0 0
 0
gptid/7cd20811-2af6-11e0-9271-e5dbd4b6b481  ONLINE   0 0
 0
gptid/7d7af0fc-2af6-11e0-b0b1-85b4c14d926b  ONLINE   0 0
 0
gptid/7e24b41a-2af6-11e0-944f-fb3dae8bad6a  ONLINE   0 0
 0
gptid/dac06b61-972b-11e0-affc-1c6f65565b30  ONLINE   0 0
 0

/boot/loader.conf:
zfs_load=YES
vfs.root.mountfrom=zfs:tank/root
aio_load=yes
ahci_load=YES
ataahci_load=YES
ipfw_load=YES
vm.kmem_size=6G
vfs.zfs.arc_max=4G
vfs.zfs.prefetch_disable=1
vfs.zfs.txg.timeout=5

rc.conf:
ipv6_firewall_enable=YES
ipv6_firewall_type=OPEN
sendmail_enable=NO
sshd_enable=YES
zfs_enable=YES
firewall_enable=YES
firewall_type=OPEN

dmesg:
http://pastebin.com/sL15g1vt

smartctl:
http://pastebin.com/wYU2DXJ4

[1] 
http://www.intel.com/content/www/us/en/servers/raid/raid-controller-sasuc8i.html
[2] http://www.samsung.com/global/business/hdd/faqView.do?b2b_bbs_msg_id=386





___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: mpt0 timeouts with an intel SASUC8I controller

2011-09-19 Thread Thomas Vogt
Hello

On Sep 19, 2011, at 2:45 PM, Thomas Vogt wrote:
 I've stability issue with my new intel SASUC8I [1] PCIe controller. It's a 
 LSI 1068e based controller. After a few minutes with disk io (csup or scrub 
 by example) my FreeBSD 8-stable (64bit) is freezing for a couple of minutes 
 and I see a lot of error messages like:
 
 dmesg:
 http://pastebin.com/sL15g1vt
 
 smartctl:
 http://pastebin.com/wYU2DXJ4

Sorry: http://pastebin.com/0Na23R54

Regards,
Tom




___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: Usling vlan(4) without an actual lan behind it

2011-09-19 Thread David Magda
On Mon, September 19, 2011 08:45, Pete French wrote:
 Does it specifically have to be a vlan(4), or can you perhaps add
 another
 address to lo(4), or perhaps create a lo1 in addition to the lo0?

 It can be anything really - I was looking for a generic interface
 I can configure with IP addresses. But adding real addresses to
 loopback interfaces can cause problems can it not ?

No, you can create an lo0:1 that's a /32 and it shouldn't be a problem.

There are a bunch HOWTOs around on configuring anycast that instruct one
to put the service IP on an alias on lo0 and then run an OSPF process
that advertises that this IP is available on this host (which acts as a
router).

http://www.openfusion.net/linux/anycast_dns


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


change in io statistics since updating 8.2-STABLE

2011-09-19 Thread Robert Schulze

Hello,

like described in another thread, we recently updated a fileserver from 
8.2-STABLE@2011-05-08 to 8.2-STABLE@2011-09-08 (amd64 both). ZPool is 
left at v14, the filesystems at version 3. Nothing in /boot/loader.conf 
or /etc/sysctl.conf has been changed.


We now realize that r/w behaviour has changed on the whole pool.
The l2arc devices (2 SSDs) have decreased in writes from average 10M/sec 
(peaks at 20 M/sec) to 5M/sec.


On the other hand, reads on the disks (2x5 disks raidz2) have increased 
(nearly doubled on each disk).


I suspect, that the cache is somewhat faulty or has become slow, and so 
ZFS decides to read data from main storage instead of the cache. Is 
there another explaination? Are there some defaults of sysctls which 
have been changed from May to September?


with kind regards,
Robert Schulze
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: mpt0 timeouts with an intel SASUC8I controller

2011-09-19 Thread Thomas Vogt
Hi Marius

On 19.09.2011, at 21:06, Marius Strobl wrote:

 On Mon, Sep 19, 2011 at 02:45:04PM +0200, Thomas Vogt wrote:
 Hello
 
 I've stability issue with my new intel SASUC8I [1] PCIe controller. It's a 
 LSI 1068e based controller. After a few minutes with disk io (csup or scrub 
 by example) my FreeBSD 8-stable (64bit) is freezing for a couple of 
 minutes and I see a lot of error messages like:
 
 Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bc3b0:48367 timed out 
 for ccb 0xff00050a8000 (req-ccb 0xff00050a8000)
 Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bbb40:48368 timed out 
 for ccb 0xff0004f81800 (req-ccb 0xff0004f81800)
 Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
 0xff80002bc3b0:48367
 Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
 0xff80002bbb40:48368
 Sep 17 03:10:03 gw kernel: mpt0: Timedout requests already complete. 
 Interrupts may not be functioning
 
 
 If this really is an issue with interrupts not getting delivered you
 could try whether disabling MSI/MSI-X by setting hw.pci.enable_msi=0
 and hw.pci.enable_msix=0 either on the loader prompt or via loader.conf
 works around it.

I already tried this. It didn't change anything. Same timeouts.

I also tried vfs.zfs.vdev.min_pending=1 and vfs.zfs.vdev.max_pending=1 in 
loader.conf without any positive change.

Regards,
Tom




___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: mpt0 timeouts with an intel SASUC8I controller

2011-09-19 Thread Marius Strobl
On Mon, Sep 19, 2011 at 02:45:04PM +0200, Thomas Vogt wrote:
 Hello
 
 I've stability issue with my new intel SASUC8I [1] PCIe controller. It's a 
 LSI 1068e based controller. After a few minutes with disk io (csup or scrub 
 by example) my FreeBSD 8-stable (64bit) is freezing for a couple of minutes 
 and I see a lot of error messages like:
 
 Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bc3b0:48367 timed out 
 for ccb 0xff00050a8000 (req-ccb 0xff00050a8000)
 Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bbb40:48368 timed out 
 for ccb 0xff0004f81800 (req-ccb 0xff0004f81800)
 Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
 0xff80002bc3b0:48367
 Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
 0xff80002bbb40:48368
 Sep 17 03:10:03 gw kernel: mpt0: Timedout requests already complete. 
 Interrupts may not be functioning
 

If this really is an issue with interrupts not getting delivered you
could try whether disabling MSI/MSI-X by setting hw.pci.enable_msi=0
and hw.pci.enable_msix=0 either on the loader prompt or via loader.conf
works around it.

Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: mpt0 timeouts with an intel SASUC8I controller

2011-09-19 Thread Marius Strobl
On Mon, Sep 19, 2011 at 09:13:53PM +0200, Thomas Vogt wrote:
 Hi Marius
 
 On 19.09.2011, at 21:06, Marius Strobl wrote:
 
  On Mon, Sep 19, 2011 at 02:45:04PM +0200, Thomas Vogt wrote:
  Hello
  
  I've stability issue with my new intel SASUC8I [1] PCIe controller. It's a 
  LSI 1068e based controller. After a few minutes with disk io (csup or 
  scrub by example) my FreeBSD 8-stable (64bit) is freezing for a couple 
  of minutes and I see a lot of error messages like:
  
  Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bc3b0:48367 timed 
  out for ccb 0xff00050a8000 (req-ccb 0xff00050a8000)
  Sep 17 03:10:03 gw kernel: mpt0: request 0xff80002bbb40:48368 timed 
  out for ccb 0xff0004f81800 (req-ccb 0xff0004f81800)
  Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
  0xff80002bc3b0:48367
  Sep 17 03:10:03 gw kernel: mpt0: completing timedout/aborted req 
  0xff80002bbb40:48368
  Sep 17 03:10:03 gw kernel: mpt0: Timedout requests already complete. 
  Interrupts may not be functioning
  
  
  If this really is an issue with interrupts not getting delivered you
  could try whether disabling MSI/MSI-X by setting hw.pci.enable_msi=0
  and hw.pci.enable_msix=0 either on the loader prompt or via loader.conf
  works around it.
 
 I already tried this. It didn't change anything. Same timeouts.
 
 I also tried vfs.zfs.vdev.min_pending=1 and vfs.zfs.vdev.max_pending=1 in 
 loader.conf without any positive change.
 

Okay, I was just worried as I recently changed mpt(4) to use MSI/MSI-X
by default for the SAS variants but couldn't test with all of them. I'm
not aware of any issues with the driver causing command timeouts though.
I'd make sure that both the controller and the disks have the latest
firmware; IIRC some time ago someone reported on current@ that updating
the firmware of the disks solved the timeouts he was seeing for him.

Marius

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org


Re: ZFS panic on a RELENG_8 NFS server

2011-09-19 Thread Hiroki Sato
Hiroki Sato h...@freebsd.org wrote
  in 20110911.054601.1424617155148336027@allbsd.org:

hr Hiroki Sato h...@freebsd.org wrote
hr   in 20110910.044841.232160047547388224@allbsd.org:
hr
hr hr Hiroki Sato h...@freebsd.org wrote
hr hr   in 20110907.094717.2272609566853905102@allbsd.org:
hr hr
hr hr hr  During this investigation an disk has to be replaced and 
resilvering
hr hr hr  it is now in progress.  A deadlock and a forced reboot after that
hr hr hr  make recovering of the zfs datasets take a long time (for 
committing
hr hr hr  logs, I think), so I will try to reproduce the deadlock and get a
hr hr hr  core dump after it finished.
hr hr
hr hr  I think I could reproduce the symptoms.  I have no idea about if
hr hr  these are exactly the same as occurred on my box before because the
hr hr  kernel was replaced with one with some debugging options, but these
hr hr  are reproducible at least.
hr hr
hr hr  There are two symptoms.  One is a panic.  A DDB output when the panic
hr hr  occurred is the following:
hr
hr  I am trying vfs.lookup_shared=0 and seeing how it goes.  It seems the
hr  box can endure a high load which quickly caused these symptoms.

 There was no difference by the knob.  The same panic or
 unresponsiveness still occurs in about 24-32 hours or so.

-- Hiroki


pgpIwsQ57ZO6Q.pgp
Description: PGP signature