from:"Paul Mather"

cardbus0: Resource not specified in CIS: id=14, size=400?????

2006-09-03 Thread Paul Mather

I have a Netgear FA511 Cardbus NIC that I am trying to use in a Dell
Inspiron 8600 laptop running 6.1-STABLE, but with limited success.  When
I insert it, I get the message cardbus0: Resource not specified in CIS:
id=14, size=400 appear as part of the cardbus probe.  The card
superficially works, but very unreliably (e.g., quite a few dc0:
watchdog timeout messages and no network traffic at times).

What does cardbus0: Resource not specified in CIS: id=14, size=400
mean?  Is this a hardware or a firmware or a driver issue?

When I insert the NIC into the Dell Inspiron 8600 laptop, the MAC
address that gets assigned to the interface is 00-00-00-00-00-00.  This
happens both under FreeBSD and Windows XP.  However, when I tried the
NIC in a friend's Acer laptop, the MAC address was correctly reported by
Windows XP (couldn't try FreeBSD), and corresponds with the one printed
on the underside of the NIC.  So, it seems that the Cardbus NIC itself
is okay, just not in the Dell laptop. :-(

Is this something that is fixable, e.g., with a suitable device.hints or
sysctl setting?

Here's what is output on the Dell Inspiron 8600 when I insert the
Netgear FA511 NIC (extra cardbus debugging enabled):

cbb0: card inserted: event=0x, state=3920
cbb0: cbb_power: 3V
TUPLE: LINKTARGET [3]: 43 49 53
Product version: 5.0
Product name: NETGEAR, Inc. | FA511 | CardBus Mobile Adapter | 1.00 | 
Manufacturer ID: 2d021a51
Functions: Network Adaptor, Multi-Functioned
Function Extension: 0102
Function Extension: 0280969800
Function Extension: 0200e1f505
Function Extension: 0301
cardbus0: Opening BAR: type=IO, bar=10, len=0100
TUPLE: Unknown(0x04) [7]: 03 01 00 00 00 00 ff
TUPLE: Unknown(0x05) [5]: 41 80 fb 00 ff
CIS reading done
cardbus0: Resource not specified in CIS: id=14, size=400
cardbus0: Non-prefetchable memory at f6001000-f60013ff
cardbus0: IO port at d000-d0ff
dc0: Netgear FA511 10/100BaseTX port 0xd000-0xd0ff mem 0xf6001000-0xf60013ff 
irq 11 at device 0.0 on cardbus0
miibus0: MII bus on dc0
ukphy0: Generic IEEE 802.3u media interface on miibus0
ukphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto
dc0: link state changed to DOWN
dc0: link state changed to UP


Here is how the Cardbus adapter is probed during boot:

pcib2: ACPI PCI-PCI bridge at device 30.0 on pci0
pci_link1: BIOS IRQ 11 for 2.3.INTA is invalid
pci2: ACPI PCI bus on pcib2
cbb0: TI4510 PCI-CardBus Bridge at device 1.0 on pci2
cbb0: Found memory at f600
cbb0: Secondary bus is 0
cbb0: Setting primary bus to 2
cbb0: Secondary bus set to 3 subbus 4
cardbus0: CardBus bus on cbb0
pccard0: 16-bit PCCard bus on cbb0


Finally, here is the output of pciconf -vl for the laptop (the
PCI4510SDFSDFSD PC Card Controller SDFSDAFSADFSDAFSDAF description for
the Cardbus bridge looks highly suspicious to me):

[EMAIL PROTECTED]:0:0:class=0x06 card=0x016a1028 chip=0x33408086 
rev=0x03 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82855PM Host-Hub Interface Bridge'
class= bridge
subclass = HOST-PCI
[EMAIL PROTECTED]:1:0: class=0x060400 card=0x chip=0x33418086 rev=0x03 
hdr=0x01
vendor   = 'Intel Corporation'
device   = '82855PM AGP Bridge'
class= bridge
subclass = PCI-PCI
[EMAIL PROTECTED]:29:0:class=0x0c0300 card=0x016a1028 chip=0x24c28086 
rev=0x01 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller'
class= serial bus
subclass = USB
[EMAIL PROTECTED]:29:1:class=0x0c0300 card=0x016a1028 chip=0x24c48086 
rev=0x01 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller'
class= serial bus
subclass = USB
[EMAIL PROTECTED]:29:2:class=0x0c0300 card=0x016a1028 chip=0x24c78086 
rev=0x01 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB UHCI Controller'
class= serial bus
subclass = USB
[EMAIL PROTECTED]:29:7:class=0x0c0320 card=0x016a1028 chip=0x24cd8086 
rev=0x01 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) USB 2.0 EHCI Controller'
class= serial bus
subclass = USB
[EMAIL PROTECTED]:30:0:class=0x060400 card=0x chip=0x24488086 
rev=0x81 hdr=0x01
vendor   = 'Intel Corporation'
device   = '82801BAM/CAM/DBM (ICH2-M/3-M/4-M) Hub Interface to PCI Bridge'
class= bridge
subclass = PCI-PCI
[EMAIL PROTECTED]:31:0:class=0x060100 card=0x chip=0x24cc8086 
rev=0x01 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82801DBM (ICH4-M) LPC Interface Bridge'
class= bridge
subclass = PCI-ISA
[EMAIL PROTECTED]:31:1:  class=0x01018a card=0x016a1028 chip=0x24ca8086 
rev=0x01 hdr=0x00
vendor   = 'Intel Corporation'
device   = '82801DBM (ICH4-M) UltraATA/100 EIDE Controller'
class= mass storage
subclass = ATA
[EMAIL

Re: bge watchdog timeouts still happening

2006-09-15 Thread Paul Mather

On Fri, 2006-09-15 at 12:33 -0700, Kent Stewart wrote:
 On Friday 15 September 2006 09:28, Herve Boulouis wrote:
  Le 15/09/2006  18:05, Gleb Smirnoff a écrit:
   H bge0: Broadcom BCM5700 B2, ASIC rev. 0x7102 mem
   0xfeb0-0xfeb0 irq 17 at device 8.0 on pci1 H miibus0: MII
   bus on bge0
   H brgphy0: BCM5401 10/100/1000baseTX PHY on miibus0
   H brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX,
   1000baseTX, 1000baseTX-FDX, auto H bge0: Ethernet address:
   00:06:5b:1a:7f:4a
  
   Is it integrated or not? I've got exactly the same NIC and I can
   try to reproduce the problem if you describe the workload.
 
  Yes, it's the onboard bge. Workload is 10-25 Mbit/s of web hosting.
 
 It seems to be at the top of the tree somewhere because people are also 
 seeing the watchdog timeouts on em and I get them on the gigabit re's.
 
 I got them downloading the kde-3.5.4 distfiles on a 768kb DSL line. I 
 had setiathome running, which keeps the cpu useage close to 100%.

FWIW, I get repeated watchdog timeout errors on 6-STABLE with a
dc-based Cardus NIC (Netgear FA511).  The worst behaviour I observed was
under heavy NFS load, with the link being unavailable for extended
periods of time.  Mostly, though, the problem manifests itself when the
card is inserted and the interface is trying to be brought up via DHCP
using dhclient, as if the NIC is not being initialised properly,
perhaps.

I don't know if this is the same problem, but I thought I'd mention it.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: FreeBSD MySQL still WAY slower than Linux

2005-06-28 Thread Paul Mather

On Tue, 2005-06-28 at 11:21 +0200, Roman Neuhauser wrote:
 # [EMAIL PROTECTED] / 2005-06-21 16:51:10 +0200:
  For accurate measurements and comparisons, you have to make
  sure to use _exactly_ the same physical location on the
  disk.
 
 No you don't. You want to make a side-by-side comparison
 of two products, and if one of them underperforms, it just
 underperforms. You cannot use a poor location selection
 strategy in the driver as an excuse for poor operation.

The point people are making is that location can have a significant
effect on performance, and so should not be dismissed out of hand.

Here is what I get when I run diskinfo on one of the (somewhat elderly)
disks I use in my desktop system (this is a drive I use for data, and it
is idle):

zappa# diskinfo -tv /dev/ad4
/dev/ad4
512 # sectorsize
25590620160 # mediasize in bytes (24G)
49981680# mediasize in sectors
49585   # Cylinders according to firmware.
16  # Heads according to firmware.
63  # Sectors according to firmware.

Seek times:
Full stroke:  250 iter in   5.159189 sec =   20.637 msec
Half stroke:  250 iter in   4.206125 sec =   16.825 msec
Quarter stroke:   500 iter in   7.151951 sec =   14.304 msec
Short forward:400 iter in   2.794380 sec =6.986 msec
Short backward:   400 iter in   4.135579 sec =   10.339 msec
Seq outer:   2048 iter in   0.332711 sec =0.162 msec
Seq inner:   2048 iter in   0.363152 sec =0.177 msec
Transfer rates:
outside:   102400 kbytes in   7.677977 sec =13337 kbytes/sec
middle:102400 kbytes in   9.151475 sec =11189 kbytes/sec
inside:102400 kbytes in  14.345492 sec = 7138 kbytes/sec


Note how the transfer rate for the outside is almost twice that of the
inside.  Suppose I run tests on two different operating systems, one
of which resides in a partition on the inside portion and the other in
one on the outside portion.  (Note that however good or bad it may be,
the location selection strategy in the driver can only lay out data
within the confines of the partition.)  Now, I do a dd test and find
that the outside OS is almost twice as fast as the other.  Would it be
wise to conclude that the slower OS is woefully inefficient compared to
the faster one?  Suppose both tests turn out to take roughly the same
time.  Should I conclude that the OS residing on the inside is just as
efficient as the other OS?

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: FreeBSD MySQL still WAY slower than Linux

2005-06-28 Thread Paul Mather

On Tue, 2005-06-28 at 18:39 +0200, Roman Neuhauser wrote:
 # [EMAIL PROTECTED] / 2005-06-28 11:38:44 -0400:
  Note how the transfer rate for the outside is almost twice that of the
  inside.  Suppose I run tests on two different operating systems, one
  of which resides in a partition on the inside portion and the other in
  one on the outside portion.
 
 Which is not the case according to the OP...
 
  (Note that however good or bad it may be, the location selection
  strategy in the driver can only lay out data within the confines of
  the partition.)  Now, I do a dd test and find that the outside OS
  is almost twice as fast as the other.  Would it be wise to conclude
  that the slower OS is woefully inefficient compared to the faster one?
  Suppose both tests turn out to take roughly the same time.  Should I
  conclude that the OS residing on the inside is just as efficient as
  the other OS?
 
 ... rendering this completely irrelevant.

...especially when you've cut out the context of my reply. :-)

My apologies if it wasn't clear, but I was responding to your apparent
assertion that location does not matter in disk performance benchmarks.
If I had been responding to a specific aspect of the OP's benchmark (or
indeed anything the OP has said), I'm sure I would have quoted that to
make the context clear.

 I have seen people come to a freebsd list with completely flawed
 comparisons or benchmarks: OSs installed on different partitions
 side by side, not taking VM cache into account, whatever, and be
 told that their numbers are flawed.
 
 I have also seen people test a specific subsystem (dd), and be told
 that their numbers don't reflect real world.
 
 And I have seen people test real world performance (install FreeBSD,
 install MySQL, run a stress test, reformat, install Linux, install
 MySQL, run a stress test) and get responses that try to make up
 reasons why the bad results are the testers fault). Heck, if
 installing an operating system, a database, and running it isn't
 a real world test, I don't know what is. Even if the bug is FreeBSD
 puts /var/db/mysql in the wrong part of the disk (then it's still
 a problem in FreeBSD, not in the messenger).
 
 I just wish people here were less defensive, that's all.

What you see as being defensive I see as being rigorous.  If someone is
making a claim based upon a performance benchmark, people will quiz the
person conducting the benchmark to ascertain exactly how it has been
undertaken.  To put any stock in a benchmark result, it is important to
be able to convince yourself it is a meaningful result.  Well, at least
most people I've encountered believe that to be the case.

As it says in the BUGS section of the diskinfo man page: There are in
order of increasing severity: lies, damn lies, statistics, and computer
benchmarks. ;-)

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: FreeBSD MySQL still WAY slower than Linux

2005-06-28 Thread Paul Mather

On Tue, 2005-06-28 at 19:17 +0200, Roman Neuhauser wrote:
 # [EMAIL PROTECTED] / 2005-06-28 13:03:04 -0400:

  What you see as being defensive I see as being rigorous.  If someone is
  making a claim based upon a performance benchmark, people will quiz the
  person conducting the benchmark to ascertain exactly how it has been
  undertaken.  To put any stock in a benchmark result, it is important to
  be able to convince yourself it is a meaningful result.  Well, at least
  most people I've encountered believe that to be the case.
 
 Say I install FreeBSD (using default partitions), install MySQL from
 a package on the CD, run a stress test, collect numbers, then
 repeat the process with a Linux installed over the previous FreeBSD
 installation, and find out that FreeBSD allows the MySQL server
 process 1/3 queries less, what (if anything) will be wrong in my
 claim that MySQL/FreeBSD is slower than MySQL/Linux?

To make the specific claim above, it would be okay, at least in my book.
To make a more general claim, pedantically speaking, you should, e.g.,
replicate your benchmark using various different hardware combinations,
to rule out the possibility of a pathological case affecting one or
other OS (e.g., where one OS has much better driver support for some
specific hardware aspect than the other).

Also, you would need to be careful how you stated your claim.  For
example, it would be better to say something like untuned
MySQL/Particular-FreeBSD-Version on a default install is slower than
untuned MySQL/Particular-Linux-Distribution on a default install.  If
you test many Linux distributions and find they beat out all the
FreeBSDs, then a more general MySQL/FreeBSD and MySQL/Linux might be
appropriate.  Similarly, if no amount of tuning lets FreeBSD MySQL
compete with Linux, I'd say your original statement would be defensible.
However, if some combinations of MySQL and FreeBSD tuning perform better
than some well-tuned MySQL/Linux distributions, then it's not so
straightforward to claim MySQL/FreeBSD is slower than MySQL/Linux.  (It
may be that default tunings favour one or the other.  I'm sure the Linux
people would gripe that you're using filesystem X instead of filesystem
Y, which would give better performance.:)

If you just want to make a benchmark statement about untuned installs on
default distributions, that's fine, but I'm not sure how illuminating
that is for the real world given that production systems are normally
tuned for performance.

Although this might seem like splitting hairs to the extreme, I guess
the point I'm making is that benchmarks can be highly subjective.
Unless you learn the context and point of view in which it was
performed, you can't really tell if the results apply to you.  In fact,
even a very good benchmark may not yield the expected performance in the
real world when run in an environment containing other systems (e.g.,
Apache, Squid, Postfix, etc.) that interact and contend for resources
and affect performance in a way not measured when the systems are
benchmarked in isolation.

I guess the preferred colour for the consumers of benchmarks is black
and white, when in reality what you get are subtle shades of grey. :-)

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: dangerous situation with shutdown process

2005-07-16 Thread Paul Mather

On Sat, 2005-07-16 at 16:16 +0200, Matthias Buelow wrote:
 David Taylor wrote:
 
 No.  I'm just asking if you know of ANY ata drives that will wait for the
 cache to be flushed before claiming the disable cache command has
 succeeded.  I don't, but I haven't looked.
 
 I don't know either. I assume that they do. Does it matter?
 I mean, I'm not suggesting a frivolous new theory that is highly
 speculative and warrants a lengthy debate on its purported merits.
 What I described is common practice on Windows, Linux and probably
 a few other systems and I would think that they're not doing this
 for nothing. And, frankly, I'm a bit astonished that the FreeBSD
 (community) seems to be so ignorant of well-known measures for
 improving data safety on consumer-grade desktop hardware. Does that
 mean that FreeBSD is deemed generally unsuited for desktop and
 laptop use and should be reserved for servers with the appropriate
 (expensive) hardware? I hope not.

I recall reading on freebsd-current that Scott Long is working on adding
journalling support to FFS.  Perhaps you'd like to direct your input to
him instead of rehashing it repeatedly on here, which is the wrong
outlet for such discussion anyway: by definition, CURRENT, not STABLE
will get a new feature like journalling, and so discussing the ins and
outs of it on freebsd-current would seem more apropos.  (Plus, I'd hate
to see him implement something only for you to declare it to be the
wrong way to do things.  Better to get your 2p in now.:)

Reagrding your question of does that mean that FreeBSD is deemed
generally unsuited for desktop and laptop use, I can speak only from
experience.  I run 6-CURRENT on an ATA system using softupdates on my
desktop, and have been doing so for quite some time.  I've seen through
quite a few periods of extreme growing pains for CURRENT, with seemingly
random panics and mystery crashes at times.  (Who can forget the dark
days of the ULE + PREEMPTION instability?)  Add to the mix that my
neighbourhood has pretty flaky power, with a tendency for short
interruptions at the first whiff of bad weather.  This all adds up to a
good smattering of reboots at inopportune times (like when doing
buildworlds or large portupgrade sessions:).

Despite that, I have never EVER had a problem with data consistency on
my file systems.  (The only problem I have had is when I added an ATA
controller card one time and forgot to disable its RAID BIOS, which
promptly spammed over my geom_mirror metadata.:)  If softupdates were as
unsafe as you often hint, I'm surprised that I haven't lost a file
system by now.  (I would also expect to hear from the field a lot more
clamour about how unsafe it is, and that, in fact, the sky was indeed
falling.)  I guess I must be amazingly lucky and should start playing
the lottery right now. :-)

The main inconvenience I have with panics or outages is the fsck times
on reboot.  (Actually, what I find to be more inconvenient is the
resynchronisation time needed for my geom_mirror, which takes a lot
longer than a fsck.)  I understand that fsck delays for large file
systems is the major impetus behind the journalling work, not as a fix
for a perceived data consistency problem.

Cheers,

Paul.

PS: I also use softupdates on my NetBSD systems, again without problems.
I've also used LFS on NetBSD at times, but have always ended up
abandoning it due to performance and severe data reliability problems.
(To be clear, though, I'm not sure LFS was deemed to be for production
use, at least not the times I tried it.)
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: dangerous situation with shutdown process

2005-07-18 Thread Paul Mather

On Mon, 2005-07-18 at 16:35 +0200, Matthias Buelow wrote:
 Oliver Fromme [EMAIL PROTECTED] writes:
 
 buffers to disk.  While it is doing that, it displays the
 number of remaining buffers, with increasing time intervals
 between them.  If there are still buffers left after a
 certain number of intervals without change, the kernel
 gives up.
 
 Why is it doing this? Can't it just enumerate the buffers and write
 them, one by one?

Why would that necessarily be more successful?  If the outstanding
buffers count is not reducing between time intervals, it is most likely
because there is some underlying hardware problem (e.g., a bad block).
If the count still persists in staying put, it likely means whatever the
hardware is doing to try and fix things (e.g., write reallocation) isn't
working, and so the kernel may as well give up.

You can enumerate the buffers and *try* to write them, but that doesn't
guarantee they will be written successfully any more than observing the
relative number left outstanding.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: dangerous situation with shutdown process

2005-07-18 Thread Paul Mather

On Mon, 2005-07-18 at 17:14 +0200, Matthias Buelow wrote:
 Paul Mather [EMAIL PROTECTED] writes:
 
 Why would that necessarily be more successful?  If the outstanding
 buffers count is not reducing between time intervals, it is most likely
 because there is some underlying hardware problem (e.g., a bad block).
 If the count still persists in staying put, it likely means whatever the
 hardware is doing to try and fix things (e.g., write reallocation) isn't
 working, and so the kernel may as well give up.
 
 So the kernel is relying on guesswork whether the buffers are flushed
 or not...

I don't know if you are just deliberately trying to be contentious, but
that is a serious misrepresentation of what is happening.  Quite
obviously the kernel knows whether a buffer has successfully been
flushed, otherwise a count of outstanding buffers would be meaningless.
(Surely you're not saying the kernel simply guesses if a buffer has been
flushed in maintaining its count of outstanding buffers?  What would be
the point of that?)

If you calm down and think about it for a little, you'll realise what
you suggest to do and what is actually done amount to the same thing in
practical terms.

It's all very easy to say to write each buffer synchronously (and wait
for the disk's response), but what do you do when the buffer *does* get
stuck and won't complete (e.g., because someone removed the floppy or
USB disk, or your remote ggate server disappeared, or your hard disk is
going bad, etc.)?  Do you just bail immediately at that point?  Or do
you keep retrying in the hope it will complete?  In the end, it comes
down to waiting a certain amount of time for drivers to do their best
and then giving up.  The only real question is how long you wait, and
maybe whether syncer is not waiting long enough (and hence how to extend
the amount of time it is willing to wait until it gives up on buffers
being unflushable).  I'm not sure why that is fundamentally madness.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: READ_DMA, WRITE_DMA errors

2005-07-21 Thread Paul Mather

On Wed, 2005-07-20 at 23:54 -0500, Steve wrote:
 I've found tons of emails, news messages, listserv messages, and even 
 some bug reports of this seemingly common error.
 
 So, I had been running 5.2 on a server, and, updated to 5.3. Got the 
 READ_DMA and WRITE_DMA error and retries. So, figuring it might be a bad 
 update, took a new drive. put it in, loaded 5.4 for grins, and, same 
 issue, lots of these errors, eventually destroying the FS. Played around 
 with various settings, no avail. So, took it back, got different box, 
 everything new. Same problem, new install of 5.4
 
 So, took it back, got another with another MB (different model), but, 
 same maker (ASUS). Didn't have endless time to spend on production 
 machine. Sure enough, same problem. It's an ASUS A7V880. Controller is 
 SATA VT8237. Played around with tons of settings, eventually, after 
 reading various messages out there, discovered one that resolved the 
 problem. Had to set hw.ata.ata_dma=0. Of course, there is the obvious 
 downside to that! Speed!
 
 But it stinks to have decent hardware, yet, have to cripple the 
 machine. The place I got the equipment at runs ASUS only and has 
 thousands of them running under other OSes. Wished I had stayed with the 
 old FreeBSD version and old hardware now. I have not seen anyone that 
 has ever said the problem was being (or had been) solved though. I see 
 the bug reports, I take it no one has actually pinpointed the problem 
 though. BUT, I do hope it is understood that this is fairly widespread, 
 for me, the likelihood of 3 pcs, 2 different MB models, and, *complete* 
 new hardware for each of the 3 pcs kind of rules out hardware being 
 broken, might be badly designed, but, certainly not defective hardware.
 
 I do hope someone can eventually figure this out, seems to be extremely 
 common, and, definitely a problem for a stable release named 5.4.

I was one of the people who suffered from and reported this seemingly
common error.  On the systems that encountered problems, none had
particularly obscure or cutting-edge hardware (e.g., Intel PIIX4 ATA
controller on the motherboard).  One common thread in my case is that
all ran some kind of software RAID (gvinum or gmirror), though not all
of my software RAIDed machines exhibited the DMA problems leading me to
think perhaps it was a hardware/load/disk combination problem.  Quite
obviously, not all PIIX4 controller users were having this happen, and
so the it doesn't happen to me factor might have contributed to the
general notion that this was probably operator error or something like
that, and dismissed.

Anyway, as well as 5-STABLE, I also run a 6-CURRENT system that suffered
the problem.  Happily, after the ATA Mk.III merge, the situation
improved a LOT.  I occasionally still get the error reported, but it is
not fatal, unlike before (where the drive would be detached, breaking my
geom_mirror, necessitating a lengthy background rebuild).  So, I
consider the ATA Mk. III rewrite to have fixed the problem I had.  It
may be, then, that those upgrading to the upcoming 6.0-RELEASE (when it
appears) might also find their ATA DMA problems solved, too.

As for 5.x, I track -STABLE, and have noticed slight improvements
regarding the DMA TIMEOUT problem.  If you only run -RELEASE, you might
miss these ongoing improvements that crop up from time to time.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Quality of FreeBSD

2005-07-21 Thread Paul Mather

On Thu, 2005-07-21 at 14:26 -0500, Karl Denninger wrote:

 Ok, Robert, but then here's the question
 
 How come the ATA code which was very stable in 4.x was screwed with in a
 production release, breaking it, with no path backwards to the working
 code?

Not to mention that this happened during the 5.x release cycle.  It's
one thing to have a regression creep in when moving from one major
release to another (e.g., oh, that's the fallout from introducing Big
Feature XYZ or a big architectural revamp may have broken some
things), but it's another thing entirely to have it happen between
minor releases, which are supposed to be evolution, not revolution.

(Although the whole Early Adopter status for early 5.x releases might
mean all that is muddied when it comes to the 5.x series.)

My main disappointment with the ATA DMA TIMEOUT bug is not that it crept
in (these things happen), but that it did not seem to be taken seriously
when it had done so.  (Though, as Robert said, if the developers can't
reproduce the problem, it's hard for them to work on and fix it.)

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Create 2.5TB file system on 5.4S?

2005-08-16 Thread Paul Mather

On Tue, 2005-08-16 at 16:21 -0700, Brandon Fosdick wrote:
 David Magda wrote:
  There's the Bonnie and Bonnie++ file system testing programs. I believe 
  one or both are in the Ports.
 
 Thanks. I'll take a look.

There's also raidtest in benchmarks/raidtest in the ports tree that can
be used to generate synthetic I/O load and measure performance.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Sysinstall automatic filesystem size generation.

2005-08-29 Thread Paul Mather

On Tue, 2005-08-30 at 03:11 +0200, Matthias Buelow wrote:
 BTW., when have you last seen a broken NTFS? While
 I don't do Windows much, I have had quite a few crashes on Windows
 (2000, XP) over the years on various machines, and I always asked
 myself how it could be that the system is up almost immediately
 (probably due to log replay) with no discernible filesystem damage.
 Windows (NT) has been doing the write barrier flush tricks (disabling-/
 reenabling the cache for flushing it) for longer than Linux and I
 would think that this contributes to the fault resilience of NTFS.
 Not that I would imply that NTFS can't be corrupted, of course.

Funny you should mention it, but the last time I saw a broken NTFS was
back in July.  It was a friend's Windows 2000 system.  The net effect
was that the system would not boot fully; was not responsive to the
repair option; and wouldn't allow the recovery console to start.  In
the end, a wipe and reinstall was necessary.  Oddly enough, trying to
mount the NTFS file system via a Knoppix CD before resorting to that
yielded complaints about the journal being corrupted.

I guess I must be lucky because I've never yet had a corrupted file
system with softupdates enabled due to power loss or panic under FreeBSD
(though I've experienced plenty of power losses due to the flaky power
here and panics due to tracking CURRENT on my desktop system:).

By reading your regular dire warnings on the subject, my experience must
differ greatly from yours. ;-)

BTW, if you consider softupdates fundamentally broken wrt data
integrity, why don't you post your concerns to -current or -hackers,
say?  Surely the developers to address the problem are more likely to be
found reading there?

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: tx underrun ? (add entry into xl manpage)

2005-11-30 Thread Paul Mather

On Wed, 2005-11-30 at 04:37 -0800, Rob wrote:
 Vincent Blondel wrote:
  Hello all,
  
  When having a look at log files on my web servers, I
 regulary see next output on the 3COM ethernet
 interfaces :
  
  
  xl1: transmission error: 90
  xl1: tx underrun, increasing tx start threshold
 to 120 bytes
  xl1: transmission error: 90
  xl1: tx underrun, increasing tx start threshold
 to 180 bytes
  xl1: promiscuous mode enabled
  xl1: promiscuous mode disabled
  
  Can somebody explain me what it is and if this
 situation is normal ?
 
 Rumours are that these messages are harmless, but if
 someone
 can explain these messages properly, it would be nice
 to
 add an entry to the  DIAGNOSTICS  section of the xl
 manpage
 (hence, this mail also goes to doc mailinglist)
 
 I see these messages too on my router/gateway:
 
 xl1: transmission error: 90
 xl1: tx underrun, increasing tx start threshold to 120
 bytes
 xl1: transmission error: 90
 xl1: tx underrun, increasing tx start threshold to 180
 bytes
 xl0: transmission error: 90
 xl0: tx underrun, increasing tx start threshold to 120
 bytes
 xl1: transmission error: 90
 xl1: tx underrun, increasing tx start threshold to 240
 bytes

This is from the dc(4) man page:

 dc%d: TX underrun -- increasing TX threshold  The device generated a
 transmit underrun error while attempting to DMA and transmit a packet.
 This happens if the host is not able to DMA the packet data into the
 NIC's FIFO fast enough.  The driver will dynamically increase the trans-
 mit start threshold so that more data must be DMAed into the FIFO before
 the NIC will start transmitting it onto the wire.

I'm assuming the explanation also holds true for other drivers (xl,
etc.) that issue this warning.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: RELENG_5, snapshots and disk lock time

2005-03-07 Thread Paul Mather

On Mon, 2005-03-07 at 15:21 +0300, Dmitry Morozovsky wrote:
 Dear colleagues,
 
 dumping the snapshot of 140G ufs2 fyle system under contemporary RELENG_5 I 
 found that during mksnap_ffs file system is unresponsible even for reading 
 for 
 more than 3 minutes (it's on modern SATA disk with 50+ MBps linear transfer). 
 Is it normal?

Oddly enough, this happened to me last night on a RELENG_5 system.  In
my case, things were so bad that mksnap_ffs appeared to wedge
everything, meaning I'll have to make a trek in to where the machine is
located and press the ol' reset button to get things going again. :-(

The machine in question makes and mounts snapshots of all its
filesystems for backup each night via Tivoli TSM.  This has worked
flawlessly for many months.  Last night, I had many BitTorrent sessions
active on the filesystem that wedged.  I guess the activity broke the
snapshot mechanism. :-(  The odd thing is that it survived the night
before, when there were also BitTorrent sessions active.

I wonder how much activity mksnap_ffs can take?

Cheers,

Paul.

PS: The problematic file system was not low on space, which could be an
issue for snapshot creation.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: DANGER WILL ROBINSON! SERIOUS problem with current 5.4-PRERELEASE

2005-03-31 Thread Paul Mather

On Wed, 2005-03-30 at 23:30 -0600, Karl Denninger wrote:

 BTW its NOT your hardware at fault here - the same hardware that returns 
 these complaints for me on 5.x works perfectly with 4.11.  There have been 
 changes made to the ATA code that apparently interact VERY badly with 
 some controllers - particularly some very common SATA (SII chipset, used 
 on Adaptec and Bustek boards, among others) ones.  

It's not just a SATA problem.  I get the problem (though more
infrequently than it seems you do) on an Intel PIIX4 UDMA33 controller.
The problem occurs on two different systems (one Gateway, one Dell), and
only started happening some way through the 5.x life cycle, indicating
to me that a serious regression was introduced (in 5.2, I believe).  The
problem does not afflict 4.x.

 I don't know if GEOM/GMIRROR is truly involved here although that's the
 easiest way for me to provoke it - I suspect not - its just that
 GEOM/GMIRROR produces an I/O load pattern that is conducive to the 
 breakage showing up.  Specifically, a DD from one or more disks does NOT
 fail - a mix of reads and writes and fairly significant load appears 
 necessary to cause trouble.  Of course installation produces a very nice
 load of that type

On both systems that experience the problem, I am using some kind of
software mirroring.  On one I'm using geom_mirror, and on the other I'm
using geom_vinum.  Both suffer from the WRITE_DMA disconnect problem.
The Dell, using geom_mirror, is now running HEAD.  The Gateway running
RELENG_5 is annoying because when a drive becomes disconnected, the only
way right now to rebuild the plexes on the geom_vinum drive that is down
is to reboot the system.  (I've used setstate to flag the drive as up,
but then gvinum start of any down plex causes an immediate
panic/reboot.)

Ian Dowse posted a patch to the freebsd-current mailing list for the
WRITE_DMA issue
(http://lists.freebsd.org/mailman/htdig/freebsd-current/2005-February/046773.html).
  According to Dowse, the patch attempts to clean up the handling of timeouts 
in the ATA code by using the new callout_init_mtx() function.  It was 
successful for me.  I still got the WRITE_DMA timeouts, but not the 
disconnects.  I don't know if RELENG_5 has the new callout_init_mtx() 
function.  If it does, this patch might help there, too.

 I opened a PR on this quite some time ago - IMHO this sort of breakage
 should be considered a critical fault sufficient to stop a release until 
 its completely resolved.  A workaround that stops the system from blowing up
 but leaves the pauses and errors isn't really a fix - I doubt anyone
 will consider that acceptable as a means of truly addressing the problem 
 (at least I hope not!)

I agree that it wouldn't be ideal, but having something that fixed just
the disconnects in the tree would be better than nothing at all.  It's a
pain to have to track third-party patches.

 I got surprised by this (in a bad way) and have been fighting 
 workarounds since 5.3 was deemed production quality.  Going back to 
 4.x is possible for me, but highly undesireable for a number of reasons, not
 the least of which is the official FreeBSD posture on where work is and will
 be done on the OS down the road.

It's disappointing the way this problem appears to have been silently
ignored (except by those whom it afflicts), because it is a regression
that occurred during the 5.x lifecycle.  It's one thing to know that
your hardware won't work properly going from 4.x to 5.x, but another
thing to have it stop working going from one 5.x release to another.
(Or maybe it isn't, given the strange Early Adopter status of the
start of the 5.x release cycle.)

Anyway, I'm glad you are trying to keep this problem in the spotlight,
because an unreliable ATA subsystem is a miserable thing to have to
suffer. :-(

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: vinum or gvinum on FreeBSD 5.4

2005-04-21 Thread Paul Mather

On Thu, 2005-04-21 at 10:39 +0200, Claus Guttesen wrote:
 Hi.
 
 I may implement (g)vinum to stripe two 2 TB raid 5 volumes into a
 single 4 TB vinum-volume. The volumes reside on a atabeast doing the
 raid 5. The server is a 5.4 RC2 doing nfs on i386.
 
 According to some threads gvinum may not be completely stable (in 5.3,
 http://unix.derkeiler.com/Mailing-Lists/FreeBSD/current/2004-11/0537.html)
 when doing raid 5 but works fine when striping.
 
 Should I go vinum or gvinum? Will newfs have problems with a 4 TB
 volume? Are there any performance-degradation doing striping?

In addition to vinum and gvinum, have you considered geom_stripe?  See
gstripe(8) for details.  I've been using it for a while on 5-STABLE to
join two 65 GB slices on two different drives into a single 130 GB
device (/dev/stripe/data in my case).  I have had no stability problems,
which is why I mention this as a possible solution for you if you're
worried about the stability of vinum or gvinum in 5.x.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: vinum or gvinum on FreeBSD 5.4

2005-04-21 Thread Paul Mather

 On Thu, 2005-04-21 at 10:39 +0200, Claus Guttesen wrote:
  Hi.
  
  I may implement (g)vinum to stripe two 2 TB raid 5 volumes into a
  single 4 TB vinum-volume. The volumes reside on a atabeast doing the
  raid 5. The server is a 5.4 RC2 doing nfs on i386.
  
  According to some threads gvinum may not be completely stable (in 5.3,
  http://unix.derkeiler.com/Mailing-Lists/FreeBSD/current/2004-11/0537.html)
  when doing raid 5 but works fine when striping.
  
  Should I go vinum or gvinum? Will newfs have problems with a 4 TB
  volume? Are there any performance-degradation doing striping?

I forgot to mention, but you should consult
http://www.freebsd.org/projects/bigdisk/ to get an idea of the current
state of play and issues regarding large disk handling in FreeBSD.
According to the table of userland tools, it should be okay to newfs a
file system  2 TB, but you might not be able to fsck or dump/restore
it. :-(  Also, I think that page holds for -CURRENT, and may not apply
to 5.x.

But, Pawel Jakub Dawidek has worked on large disk support, and so
there's a good chance that his geom_stripe feature is able to handle  2
TB sizes.  You could test it via gstripe on large swap-backed md
providers.

I also dimly recall you may run into problems running out of RAM trying
to fsck very large filesystems.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: vinum or gvinum on FreeBSD 5.4

2005-04-21 Thread Paul Mather

On Fri, 2005-04-22 at 04:48 +0200, Vladimir Botka wrote:
 Hi,
 vinum is not stable under 5.4. After some research I found gmirror 
 (RAID1) is ok. There are some notes on:
 http://www.freebsd.org/releases/5.3R/errata.html
 
 Cheers,
 
 Vladimir.
 
 On Thu, 21 Apr 2005, Paul Mather wrote:
 
  On Thu, 2005-04-21 at 10:39 +0200, Claus Guttesen wrote:
  Hi.
 
  I may implement (g)vinum to stripe two 2 TB raid 5 volumes into a
  single 4 TB vinum-volume.

But geom_mirror (gmirror) would not do what the OP wants (i.e., stripe,
not mirror).

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: gmirror

2005-05-14 Thread Paul Mather

On Sat, 2005-05-14 at 11:30 +0100, Dominic Marks wrote:

 The seek times are way down, which is great, and makes sense using a
 round-robin strategy on the mirror, but my peak transfer rate has been
 almost halved too.
 
 I don't mind this too much as in my application low seek times are worth
 more than high transfer rates, but it is still puzzling to me to see such a
 remarkable drop in throughput.

For large sequential reads on a round-robin mirror, you might be forcing
each drive to do a lot more small seeks than if you just accessed the
data from a single drive, and those small seeks may be slowing down the
overall transfer significantly.  I would have thought that large,
whole-track caches prevalent on modern hard disks would ameliorate that
problem in an otherwise quiescent environment, but, dependent upon the
drive's caching policy, you never know...

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: gmirror

2005-05-14 Thread Paul Mather

On Sat, 2005-05-14 at 18:00 +0400, Vladimir Dzhivsanoff wrote:

 three parallel tasks of dd  is not good model for random reads ?

Probably not, especially if you start the parallel tasks going at the
same time.  That way, the second and third tasks are almost certainly
hitting data in the hard drive's cache, rather than the actual disk
platters, and so are more likely to be testing the interface transfer
speed than the hard drive's sustained performance.  For a better model
(of parallel large sequential transfers), you should at the very least
stagger the start times of each task, to minimise cache effects.

The better question to ask yourself is this: are large sequential
transfers a good model of my workload.  That is what you are testing
with your dd's.  Seek times are the dominant cost of a disk transfer.
Large sequential transfers are a best-case scenario for I/O measurements
because they involve minimal seek overheads.  However, best-case and
real-world are not usually the same thing.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: atacontrol raid1 vs. gmirror

2005-06-11 Thread Paul Mather

On Sat, 2005-06-11 at 14:42 -0400, Mike Jakubik wrote:
 Can someone explain tome the difference between a RAID1 setup done via
 atacontrol and gmirror? I have a VIA 6420 SATA150 controller, which also
 has raid, but is not supported by -stable.

Here are the main differences, as I see them:

atacontrol RAID:

- Only for ATA;
- Compatible with quite a few ATA RAID card BIOS metadata formats, hence
you can create the RAID using the RAID controller's BIOS menu;
- Supports only two-way mirroring (IIRC);
- Supports spares.

gmirror:

- Works with any GEOM provider (ATA, SCSI, ggate, etc.);
- Uses the last sector of each RAID component to store its own metadata;
- Supports N-way mirroring;
- Does not support spares (though gmirror activate/deactivate can be
used to associate a component with a mirror somewhat akin to having a
spare).


I found array rebuilding to be troublesome on atacontrol RAID---so much
so I abandoned it and used vinum and then gmirror instead for my RAID 1.
(I have a bootable geom_mirror setup, now.)  Mind you, that was in the
pre-ATA mk.III days, and I hear the RAID support underwent a big revamp
in the mk.III rewrite.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: atacontrol Raid, cannot re-add member to array

2004-07-04 Thread Paul Mather

On Sun, 2004-07-04 at 18:10, Simon L. Nielsen wrote:
 On 2004.07.04 23:33:28 +0200, Harald Schmalzbauer wrote:
 
  I've never tried ataraid with non-raid controllers but I doubt that 
  detach/attach would work.
 
 It does work, you just have to hotswap the disk [1].  I tried it
 some time ago, and I successfully did a rebuild, though I did kill one
 of the disks shortly after since I tried to plug in the power cable
 the wrong way during another test (yes that's stupid, I know :-) ).

So does this mean ATA RAID doesn't work on non-raid controllers that
have non-hot swappable drives attached?  (E.g., a drive get hard errors,
is marked as failed and the RAID as DEGRADED, and you have to shut down
the machine to remove and replace it---but ATA RAID won't recognise
it/rebuild onto it when you reboot with the replacement drive.)

If that is the case, the man page really should note that serious
limitation.

I have tried the detach/attach on a non-raid controller to simulate
failure and have *never* managed to get rebuild to work.  I've also
tried the shutdown/remove/replace/reboot method but, again, *never*
managed to get rebuild to work on a non-raid controller. :-(

I've never had any hotswap-capable drives to test the hot-swap
replacement/rebuild method. :-(

 Well, I'm using it just fine on my main mailserver (running 5.2.1) but
 that's a RAID1 with standard a ATA controller.

I wish it would work for me. :-)  I have a non-raid controller 5.2.1
system that I tried it on (with non-hot swap drives) that I eventually
had to bail and use Vinum on.  Alas, Vinum seems to be rotting at a fast
pace in CURRENT, so I'm worried about the prospect of being able to
upgrade that system when 5.3 rolls around... :-(

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Vinum problems in 5.3-BETA7?

2004-10-08 Thread Paul Mather

On Fri, 2004-10-08 at 07:52, Patrick M. Hausen wrote:

 We have a production system that runs on a vinum system drive
 configured like this:

[[Configuration omitted.]]

 It's currently running fine with FreeBSD 5.2.1-RELEASE-p10.
 
 After upgrading to 5.3-BETA7, buildworld, buildkernel, installkernel
 and reboot the system stops:
 
 vinum: loaded
 vinum: no drives found
 
 That's it. Of course it complains that it can't mount /dev/vinum/root.
 
 The list of detected GEOM devices at the mountroot  prompt
 includes ad0s1h, ad1s1h, ad0s1, ad1s1, ad0, ad1 and some more
 partitions.
 
 
 Where do I go from here? Is this expected behaviour due to the ongoing
 GEOM changes or should I go read Greg's how to debug vinum problems
 document? I will do that, no problem. Just want to know if it makes
 sense at all, because now everyone might tell me vinum is known broken in
 5.3 or similar.

Vinum is known broken in 5.3. :-)  You should be using geom_vinum
instead.  It will largely be a drop-in replacement for your above Vinum
configuration.  (I am using it on a similar root-on-vinum setup.)  The
main changes are these:

1) Load geom_vinum in /boot/loader.conf, e.g., add
'geom_vinum_load=YES' to /boot/loader.conf.  This will auto-detect
your Vinum on-disk configuration during boot.  You don't need any
rc.conf glue with geom_vinum.

2) Change vinum to gvinum in /etc/fstab.  E.g., use
/dev/gvinum/root instead of /dev/vinum/root

3) The userland utility is gvinum instead of vinum.

I am using geom_vinum on a root-on-vinum configuration under 6-CURRENT
since before RELENG_5 was branched, and I believe the same holds true
for RELENG_5 and HEAD as far as the above three points are concerned.

I don't know if there are plans to replace vinum entirely with gvinum
(and drop the g prefix) for 5.3-RELEASE.  Lukas Ertl would know.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa

___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: gvinum again?

2004-11-14 Thread Paul Mather

On Mon, 2004-11-15 at 00:36 -0500, Zaphod Beeblebrox wrote:
 I'm looking at gstat while fsck'ing a gvinum partition.  The trouble
 I'm seeing is that I don't see activity on the second disk.  Now...
 I'm using fsck -n ... just checking things ... so there's no writes,
 but does gvinum not (yet) load balance on reads?

Your observation is correct: it doesn't (yet) load balance across plexes
of mirrored volumes; geom_mirror (gmirror) does, though (and offers
various load-balancing options).

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: gvinum again?

2004-11-15 Thread Paul Mather

On Mon, 2004-11-15 at 02:31 -0500, Zaphod Beeblebrox wrote:
 On Mon, 15 Nov 2004 01:26:54 -0500, Paul Mather [EMAIL PROTECTED] wrote:
 
  Your observation is correct: it doesn't (yet) load balance across plexes
  of mirrored volumes; geom_mirror (gmirror) does, though (and offers
  various load-balancing options).
 
 Is this something planned for the near future?  I'm migrating legacy
 units here, so I don't really have a choice.

I don't know about the *near* future, but I believe it is on
Lukas' ([EMAIL PROTECTED]) list of things to do.  You'd probably have to
ask him how high it is on the list, though.

 ... also ... I suspect, whatever form it eventually takes, that volume
 management and geom and/or vinum is essential.

I my case I had a legacy root-on-vinum all-mirrored setup, and so was
interested in mirroring more than LVM.  (Like many, I'd used vinum just
as a way of accomplishing a software RAID 1 across two drives to provide
extra tolerance to drive failures.)  So, migrating from geom_vinum to
geom_mirror was not such a burden (or a hurdle).

I don't know if growfs is 100% robust enough yet to provide the other
important ingredient to a true LVM storage management system a la the
logical volume manager on AIX or AdvFS on Tru64, say.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: gvinum again?

2004-11-15 Thread Paul Mather

On Mon, 2004-11-15 at 19:33 +0100, Matthias Schuendehuette wrote:
 Hi,
 
 Am Montag, 15. November 2004 18:25 schrieb Paul Mather:
  [...]
  I don't know if growfs is 100% robust enough yet to provide the other
  important ingredient to a true LVM storage management system a la the
  logical volume manager on AIX or AdvFS on Tru64, say.
 
 Yes, it is. I use (g)vinum primarily as a LVM with concat plexes on 
 ProLiant Servers with SmartRAID-Controllers, so mirroring and/or RAID5 
 is done in hardware.
 
 The extension of filesystems on concat plexes works (simply) as 
 advertised... :-) At least *I* had no problems so far.

That's encouraging to hear!  Can you shrink volumes and filesystems like
you can on, say, AdvFS under Tru64?  That would be really nice.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: vmstat regression (RELENG_4 - RELENG_5)

2004-11-17 Thread Paul Mather

On Wed, 2004-11-17 at 09:51 +0200, Dmitry Pryanishnikov wrote:
 Hello!
 
While playing with fresh installation of 5.3-RELEASE (GENERIC kernel) I've
 noticed that I can only see my HDD in the left lower corner of the systat's
 vmstat screen. I also have working floppy and CD-RW drives, both are detected
 and work, but I can't see data transfer stats for them. In 4.10-RELEASE, I see
 ad0,acd0,fd0 here, but in 5.3-RELEASE I see only ad0.
After reading manual page systat(8) I see the root reason of this 
 behaviour:
 kernel devstat(9) subsystem only sees my ad0. Under 4.10-RELEASE, sysctl
 kern.devstat.numdevs gives 3, but under 5.3-RELEASE only 1. How can I enable
 disk statistics gathering for disks other than HDD in 5.3-RELEASE?

The gstat(8) command will display statistics for all GEOM disks,
including floppy and CD-{ROM,R,RW}.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Mount pending error?

2004-11-23 Thread Paul Mather

Last night, this appeared in my logs (and on my console):

Nov 23 03:05:36 zappa kernel: /data: mount pending error: blocks -1696 files 0


What is a mount pending error?  I have heard this in conjunction with
unclean shutdowns and fsck, but my system was not shut down recently,
nor has /data (on a geom_stripe) been unmounted (cleanly or otherwise).

The only thing I have done that could be related is to make a snapshot
(later removed), but it was not of the filesystem being complained
about.  (Also, I create and mount snapshots of all my filesystems as
part of my nightly TSM backup.)

I guess I should shutdown and fsck just to make certain everything is
okay.  I am puzzled why this happened, though.  (I've been running the
backups for a while, and have not had any complaints.)

BTW, I'm running RELENG_5, last rebuilt Nov. 19th 2004.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: graid3 - requirements or manpage wrong?

2004-11-26 Thread Paul Mather

Brian Szymanski wrote:
As for the swap: why would you want to do that? It was my understanding
that the kernel load balanced swap requests across drives?
 

You'd want to do it not for load-balancing but for fault tolerance.  
With a RAID 1/3/5 setup you could have a drive fail and still have 
swapping (and hence the system) continue to work.  That's not the same 
as (or true of) having multiple swap partitions with the system 
balancing load over all of them.

Cheers,
Paul.
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: bios disk numbers and device names

2004-11-29 Thread Paul Mather

On Mon, 2004-11-29 at 15:06 +0100, Andrea Campi wrote:

 The manpage explains it all, and that's all I know as well. glabel
 specifies a transient label, i.e. it's not saved on the disk, so you
 loose it on reboot or if the disk goes away.

Glabel can create both transient and permanent labels: glabel create
label provider is transient; glabel label label provider is
permanent.  In the latter case, the label metadata is stored in the last
sector of provider.  If geom_label is built into the kernel, or loaded
as a kernel module at boot (e.g., via /boot/loader.conf) then labelled
providers will be automagically discovered and /dev/label/... entries
created during boot.  In this way, devices can be given logical labels
that will stick with them when they move around.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: drive failure during rebuild causes page fault

2004-12-13 Thread Paul Mather

On Mon, 2004-12-13 at 10:28 -0800, Doug White wrote:
 On Sun, 12 Dec 2004, Joe Rhett wrote:
 
  On Sun, Dec 12, 2004 at 09:59:16PM -0800, Doug White wrote:
   Thats a nice shotgun you have there.
 
  Yessir.  And that's what testing is designed to uncover.  The question is
  why this works, and how do we prevent it?
 
 I'm sure Soren appreciates you donating your feet to the cause :)
 
 Why it works: the system assumes the administrator is competent enough to
 not yank a disk that is being rebuilt to.

That's not quite fair.  He was obviously testing to see how resilient
ATA RAID is to drive failures during rebuilding, as part of a series of
tests.  (Obviously, it is not.)  If you look at his original message, he
did not even yank the disk.  He detached it in a somewhat orderly
fashion using atacontrol detach.  (One can argue that physically
yanking it might have been a more accurate, if more severe failure
test.)  This makes the ensuing panic even more sad.  (Would the same
panic result if the disk being rebuilt fell victim to one of those
TIMEOUT - WRITE_DMA errors that are in vogue nowadays and was detached
by the system?  I get those errors occasionally [never used to under 5.1
on the exact same hardware] but my geom_mirror has coped with it so far,
thankfully.)

It's reasonable to conduct simulated failure testing of ATA RAID (or
others such as geom_mirror and geom_vinum) prior to adopting it on your
system.  I know I did in the case of ATA RAID and abandoned it precisely
because it turned out for me to be too flaky when it came to error
recovery.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: FreeBSD 5.3 and vinum upgrade #2

2004-12-18 Thread Paul Mather

On Sat, 2004-12-18 at 20:52 +0100, Nikolaj Hansen wrote:

 While some uncommon configurations, such as multiple vinum drives on a 
 disk, are not supported, it is generally backward compatible. Note that 
 for the geom(4)-aware vinum, its new userland control program, gvinum, 
 should be used, and it is not yet feature-complete.
 
 I think I have to disagree calling muliple drives on a disk uncommon.
 In fact, I think I remember that being the way it was demonstrated in an 
 old version of the handbook. Here is my current setup after rolling back 
 to FreeBSD 5.2.1:

I think you are misunderstanding things a little, here.  It's not that
multiple vinum volumes per disk can't be handled, but instead it's
multiple vinum configurations per disk that are problematic.  In other
words, I believe it's not supported to have, say, a /dev/da0s1g vinum
partition (containing vinum volumes, plexes, and subdisks) and also,
say, a /dev/da0s1h vinum partition (again, containing vinum volumes,
plexes, and subdisks).  Such a setup was okay under the old vinum, but
is not okay under geom_vinum (AFAIK). 

 As far as I can tell, the new 5.3 release makes this disk configuration 
 invalid?

The vinum configuration you listed appears fine for geom_vinum.  I
transitioned by old root-on-vinum all-mirrored setup over to geom_vinum
without any problems.  (Yours looks the same, except that you also have
a third drive with a single concat plex volume on it.)

 If not I have a major problem here :-(

The biggest problem you'll have is if your system suffers the ATA
TIMEOUT - WRITE_DMA woe that bedevils some of us under 5.3.  When that
happens, your mirror will be knocked into a degraded state (half of your
mirrored plexes will be marked down) even though the drive is okay.
Unfortunately, without setstate being implemented in gvinum to mark
the drive as up, thereby allowing you to issue gvinum starts for the
downed plexes, there's little more you can do to get the failed
drive recognised as being in the up state other than to reboot.  (You
might be able to use atacontrol to stop/start or otherwise reset the
drive; in my particular system I can't use atacontrol detach/attach
because they're both on the same channel.)  At any rate, every so often,
when this happens, you'll have to resynchronise the failed plexes,
which *really* sucks the I/O life out of the system because there's no
way to throttle back reconstruction, unlike with geom_mirror (which has
two sysctls to govern the load imposed by resynchronisation).

But, it looks like you're lucky, because your mirrored drives are SCSI.
I don't know about your ATA concat plex volume, though...

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: FreeBSD 5.3 and vinum upgrade #2

2004-12-18 Thread Paul Mather

On Sun, 2004-12-19 at 00:15 +0100, Matthias Schuendehuette wrote:
 Am Samstag, 18. Dezember 2004 23:35 schrieb Paul Mather:
 
  The biggest problem you'll have is if your system suffers the ATA
  TIMEOUT - WRITE_DMA woe that bedevils some of us under 5.3.  When
  that happens, your mirror will be knocked into a degraded state (half
  of your mirrored plexes will be marked down) even though the drive is
  okay. Unfortunately, without setstate being implemented in gvinum
  to mark the drive as up, thereby allowing you to issue gvinum
  starts for the downed plexes, there's little more you can do to
  get the failed drive recognised as being in the up state other
  than to reboot. [...]
 
 'gvinum setstate' was MFCed from -current together with 'gvinum 
 checkpatity' and 'gvinum rebuildparity' a week ago or so...
 
 So that should make it easier to handle these ATA-Problems...

That's great to hear!  (I'd been hoping for an MFC5.)  It's handy, too,
as I just recently got another one of those TIMEOUT - WRITE DMA
induced failures, which was getting to be a real drag. :-(

I'll update my 5.3-STABLE system right away...

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: FreeBSD 5.3 and vinum upgrade #2

2004-12-21 Thread Paul Mather

On Tue, 2004-12-21 at 19:00 +0100, Nikolaj Hansen wrote:

 I also do not think it belongs in the stable branch just yet :-D Any hope
 of you fixing the old vinum in the 5.3 branch or is it a wait for the 5.4?

AFAIK, the old vinum will *never* be fixed in the 5.3 branch or any
subsequent branch, so don't hold your breath for 5.4.  The fact that it
wasn't being fully GEOM-ified (and hence was dying from neglect in 5.x)
was the whole reason for Lukas to embark on geom_vinum.

Geom_vinum *is* vinum for all intents and purposes, from this point on.
The alternative to not having geom_vinum in the stable branch would be
not to have vinum support at all.

Cheers,

Paul.

PS: If you're unhappy with geom_vinum stability and performance, and
you're using it mainly to do software RAID 1 (as opposed to using it for
its LVM features), you might consider using geom_mirror instead.  I
converted one of my root-on-geom_vinum mirrored systems over to a
bootable geom_mirror.  (I did it in-place, too.)  Right now, geom_mirror
will load-balance reads across disks, whereas geom_vinum does not, which
is something in geom_mirror's favour.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: how to remote update 4.10 - 5.3?

2004-12-29 Thread Paul Mather

Palle Girgensohn wrote:
I've tried the UPDATING instructions, both locally and remotely (the 
latter failed ;-). But really, everything has to be reinstalled, ports 
and the lot, so a new install is probably the best way...

That's not so bad.  A reinstall means you can newfs your partitions as 
UFS2 filesystems, which wouldn't be the case if you upgraded 4.10 in-place.

Cheers,
Paul.
--
e-mail: [EMAIL PROTECTED]
Without music to decorate it, time is just a bunch of boring production
deadlines or dates by which bills must be paid.
   --- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: how to remote update 4.10 - 5.3?

2004-12-30 Thread Paul Mather

Igor Pokrovsky wrote:
Are there any other advantages of UFS2 over UFS except maximum disk size?
 

There's FFS snapshots capability in UFS2, for starters...
Cheers,
Paul.
--
e-mail: [EMAIL PROTECTED]
Without music to decorate it, time is just a bunch of boring production
deadlines or dates by which bills must be paid.
   --- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: CFLAGS=-O2 -pipe problems in RELENG_5 ?

2005-01-20 Thread Paul Mather

On Thu, 2005-01-20 at 19:17 +, Chris wrote:
 what does -fno-strict-alias do? I cannot find it in man gcc

It is a truncation of -fno-strict-aliasing.  The flag does not appear
to be described in the gcc man page, but is documented in the gcc info
page (search for -fstrict-aliasing).  To quote the info page,
-fstrict-aliasing allows the compiler to assume the strictest aliasing
rules applicable to the language being compiled. ...

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: CFLAGS=-O2 -pipe problems in RELENG_5 ?

2005-01-20 Thread Paul Mather

On Thu, 2005-01-20 at 16:44 -0500, Mike Jakubik wrote:
 Paul Mather said:
 
  On Thu, 2005-01-20 at 19:17 +, Chris wrote:
  what does -fno-strict-alias do? I cannot find it in man gcc
 
  It is a truncation of -fno-strict-aliasing.  The flag does not appear
  to be described in the gcc man page, but is documented in the gcc info
  page (search for -fstrict-aliasing).  To quote the info page,
  -fstrict-aliasing allows the compiler to assume the strictest aliasing
  rules applicable to the language being compiled. ...
 
 Wouldn't we want strict aliasing then?

Yes, but unfortunately some ports circumvent the strict aliasing rules,
which is why those ports break when building with -O2 (which causes
-fstrict-aliasing to be enabled).

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: Starting gvinum/vinum with RAID 5 and Compaq 39160 - 5.3-STABLE

2005-01-24 Thread Paul Mather

On Mon, 2005-01-24 at 14:29 -0500, Michael L. Squires wrote:
 I've been playing with vinum and gvinum with a RAID-1 boot setup (now 
 abandoned, hardware RAID was $29 - SM P4DC6+ with Adaptec 2005S) and with 
 a RAID5 array using a Compaq version of the Adaptec 39160 card.
 
 I've found that only geom_vinum_load=YES works in order to get the RAID 
 5 array mounted at boot time; vinum_load with or without 
 vinum.autostart doesn't work.  (Doesn't work means that the boot fails 
 when /dev/vinum/raid can't be mounted as per /etc/fstab, and I have to 
 execute vinum before I can mount it while running single user after the 
 crash).
 
 I've seen one other message that vinum_load didn't work for one other 
 5.3-R user, but I don't know if we both made the same mistake or if 
 something else is wrong.

For 5.3-STABLE you should be using geom_vinum.  That means having
'geom_vinum_load=YES' in /boot/loader.conf; /dev/gvinum/raid...
in /etc/fstab (in your case); and using gvinum for userland control, not
vinum.  The vinum /etc/rc.conf knobs are not applicable to geom_vinum.

Mixing vinum and gvinum is not recommended.  If you just want a RAID-1
boot setup, you might consider geom_mirror instead.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: opinion on which software RAID to use

2006-03-02 Thread Paul Mather

On Thu, 2006-03-02 at 14:11 -0500, Vivek Khera wrote:

 Anyhow, I see at least three ways to set up a mirror of these drives  
 and/or partitions:
 
   gvinum
   gmirror
   atacontrol
 
 Any opinions on which has both qualities: easy to configure/manage  
 (ie, recover after failure) and performance?  The handbook RAID page  
 doesn't even mention gmirror.  The atacontrol seems very simple to  
 use, at least.
 
 Let me know what you think.  Thanks!

For basic mirroring, I'd say go for gmirror.  I've been using it
successfully for a long time.  I migrated to it from vinum/gvinum
because it offered more flexible read load-balancing (you can choose
between various policies) and easier recovery.  It can even do N-way
mirroring.

I tried atacontrol in the past, but could never get it to reconstruct
after a simulated failure.  With gmirror, you have the option of
automatic or manual reconstruction for failed/stale providers.

You might want to consider gvinum if you think you might want to use
RAID 5 or LVM-type aggregations in the future, but if you are just
planning on mirroring I'd strongly suggest going with gmirror.

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: portsnap mirror servers

2006-04-21 Thread Paul Mather

On Fri, 2006-04-21 at 14:40 +0200, Benjamin Lutz wrote:
 On Wednesday 19 April 2006 00:44, Colin Percival wrote:
  I have a list of people who have offered mirrors, but so far I haven't
  seen any need for additional mirrors -- the two which already exist are
  showing no signs of slowing down.
 
 Hm, but I see a quite noticeable speed difference between portsnap1 and 
 portsnap2. The second one is quite a bit faster.

I notice that on 4.x portsnap never finds any mirrors because the grep
of the output returned by host -t srv ... is not appropriate for 4.x's
version of /usr/bin/host, which produces output different to that of 5.x
onwards (a BIND8 vs BIND9 issue, I guess).  So, maybe because of this,
all of the portsnaps running on 4.x machines are hitting the same server
each time instead of randomly choosing a mirror, thereby causing that
mirror to be a bit more loaded?

Cheers,

Paul.
-- 
e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

Re: dumping large partition to USB drive fails

2007-06-28 Thread Paul Mather


On 28 Jun 2007, at 9:06 PM, Roland Smith wrote:


On Thu, Jun 28, 2007 at 12:24:15PM -0700, Jeremy Chadwick wrote:

On Thu, Jun 28, 2007 at 08:55:46PM +0200, Roland Smith wrote:
A tip from Paul Mather pointed to problems with USB/firewire  
chipset,
the PL-3507, which was what I found in my enclosure. Most likely  
this is

the culprit. I'm going to buy a new enclosure.


Can you provide these details (forward what Paul sent you, etc.)?  It
would be nice to have this info somewhere in the archives in case
someone mails about similar...


Here it is;


I don't have your original posting, so I don't know what model of
chipset you have, though I do dimly recall it being a Prolific,  
which is

why I thought I'd e-mail you.  Given that the PL3507 had an early
history of problems when writing to it using large transfers (and  
that
GELI uses large transfers, IIRC), and you say that enclosure is  
old, you

might want to see if chipset bugs are the source of your problem.

Here is a link I had bookmarked when I was looking for firmware  
updates for

the Prolific PL3507:
http://missig.org/julian/blog/2004/06/10/prolific-pl3507-firewire- 
device/


Here is a link to the corruption issue that plagued various chipsets,
including the Prolific PL3507:
http://www.bustrace.com/delayedwrite/index.htm


My apologies for not doing a reply all when responding to Roland  
initially.  As a bit of extra context to the above, a while ago I  
bought a cheap external FireWire/USB DVD writer for my iBook G4.  I  
was disappointed to discover upon using it that its read performance  
when using the FireWire interface was much poorer than when using its  
USB connector.  (It annoyed me especially because I had bought  
something that supported FireWire because I wanted to daisy-chain it  
with my Western Digital MyBook external hard drive to avoid using up  
one of the two USB ports on my iBook.)


Because the FireWire and USB performance of my MyBook external hard  
drive is good, I doubted the problem lay with the iBook, and  
suspected the enclosure chipset instead.  When I began searching the  
Web for information about the Prolific PL3507 chipset, used in the  
enclosure, I quickly began discovering horror stories (see above  
links for a taste).  I realised that, if I had my time over and had  
known this DVD writer's enclosure used that chipset, I would not have  
bought it and would have looked for something whose enclosure used a  
different one (e.g., Oxford).  I also realised that not all  
enclosures are created equal, and it's definitely worthwhile to know  
what chipset it uses beforehand and make sure you avoid the bad ones  
(like the Prolific). :-)


Somewhat fortunately, revisions B and C of the Prolific PL3507 can  
have their firmware upgraded via USB.  (Unfortunately, Prolific only  
appear to make a Windows flash writer application available.)  It  
seems that the latest firmware at least appears to correct the worst  
excesses of the chipset, though people do still seem to harbour  
mistrust over the hardware and complain of performance problems.


Although I have no plans to buy another enclosure for this DVD writer  
(because it works well enough for the task in hand), I think I  
would were it housing a hard drive, if nothing else because of the  
poor performance relative to the WD MyBook.


Cheers,

Paul.

e-mail: [EMAIL PROTECTED]

Without music to decorate it, time is just a bunch of boring production
 deadlines or dates by which bills must be paid.
--- Frank Vincent Zappa


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to [EMAIL PROTECTED]

ZFS pool upgrade to v14 broke ZFS booting

2010-01-27 Thread Paul Mather

I have a FreeBSD guest running under VirtualBox 3.1.2 on Mac OS X.  It's 
running a recent 8-STABLE and is a ZFS-only install booting via gptzfsboot.  I 
use this VirtualBox guest as a test install.

A day or so ago I noticed zpool status report that my pool could be upgraded 
from v13 to v14.  I did this, via zfs upgrade -a.

Today, when attempting to fire up this FreeBSD guest in VirtualBox I get this 
on the console:

=
ZFS: unsupported ZFS version 14 (should be 13)
No ZFS pools located, can't boot
_
=

and the boot halts at that point.  I don't see the boot menu I normally see 
that lists the opportunity to boot single-user; disable ACPI; and so on.

Has anyone else experienced this?  Is this a mismatch between gptzfsboot and my 
current pool version?  (Gptzfsboot includes the message I'm seeing.)  Am I 
supposed to rebuild and replace gptzfsboot every time the pool version is 
updated?  (There was no advisory in /usr/src/UPDATING concerning this, nor do I 
remember seeing it elsewhere.)

Now I have to figure out how to dig out from this.  Well, I guess that's what 
test installations are for... :-)

Cheers,

Paul.___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: make world fails in usr.sbin/config?

2010-05-24 Thread Paul Mather

On May 24, 2010, at 8:29 AM, Jeremy Chadwick wrote:

 For added posterity, it looks like usr.sbin/config has been mostly
 untouched for quite some time, sans mkoptions.c and mkmakefile.c:
 
 http://www.freebsd.org/cgi/cvsweb.cgi/src/usr.sbin/config/

Having said that, there is this entry in /usr/src/UPDATING dating from early 
May:

20100502:
The config(8) command has been updated to maintain compatibility
with config files from 8.0-RELEASE.  You will need a new version
of config to build kernels (this version can be used from 8.0-RELEASE
forward).  The buildworld target will generate it, so following
the instructions in this file for updating will work glitch-free.
Merely doing a make buildkernel without first doing a make buildworld
(or kernel-toolchain), or attempting to build a kernel using
traidtional methods will generate a config version warning, indicating
you should update.


Stefan's kernel looks to have last been built on 20th February 2010.  It isn't 
explicit in the first posting of this thread how Stefan is doing his build, so 
there is a possibility that he's being affected by the above UPDATING entry.

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: make world fails in usr.sbin/config?

2010-05-24 Thread Paul Mather

On May 24, 2010, at 9:27 AM, Stefan Bethke wrote:

 I've just moved from a root on UFS plus data on ZFS setup, to root on ZFS; 
 that's the only real difference I can think of.  Although I don't see how 
 that would affect building world, especially since I've had src and obj on 
 ZFS before.

FWIW, I successfully completed buildworld + buildkernel for 8-STABLE on a 
root-on-ZFS system yesterday.

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Make ZFS auto-destroy snapshots when the out of space?

2010-05-31 Thread Paul Mather

On May 30, 2010, at 10:35 PM, David Magda wrote:

 An event framework would certainly be helpful in a general sense (Linux has 
 event(3) AFAIK), and that could certainly be useful for purging snapshots 
 during resource constrained situations. But even if we don't have it, I doubt 
 a fork(2) from cron(8) and a statfs(2) would be onerous on a system. :)

Devd already receives several ZFS-based events (failed vdev, I/O error, 
checksum mismatch, etc.), so perhaps it would be useful to add another, e.g., 
space which is set to be triggered when a pool attains a certain percentage 
full.  This could default to 100%, but be capable of being set lower by an 
associated kernel sysctl.  You could then have any auto-pruning/snapshot 
management script triggered from devd.  (You'd probably also have to figure out 
some kind of throttling mechanism for this devd event, too.)

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Problems with ATA_CAM support in RELENG_8

2010-06-30 Thread Paul Mather

I am running FreeBSD/amd64 RELENG_8 on a Dell Optiplex 745.  The hard drive in 
the system is SATA and I have Normal, not Legacy SATA support enabled in 
the BIOS.  (BIOS is V2.6.4.)  I am assuming this will enable native AHCI mode 
for the drive.

I built a kernel with ATA_CAM support, but for some reason the SATA drive is 
probing at slow, UDMA2, speeds:

ada0 at ata2 bus 0 scbus0 target 0 lun 0
ada0: ST3160812AS 3.ADJ ATA-7 SATA 2.x device
ada0: 33.300MB/s transfers (UDMA2, PIO 8192bytes)
ada0: 152587MB (31250 512 byte sectors: 16H 63S/T 16383C)

When I run camcontrol identify on the drive, it states the drive is capable 
of UDMA6 speeds:

backup# camcontrol identify ada0
pass0: ST3160812AS 3.ADJ ATA-7 SATA 2.x device
pass0: 33.300MB/s transfers (UDMA2, PIO 8192bytes)

protocol  ATA/ATAPI-7 SATA 2.x
device model  ST3160812AS
firmware revision 3.ADJ
serial number 5LS8PPDD
cylinders 16383
heads 16
sectors/track 63
sector size   logical 512, physical 512, offset 0
LBA supported 268435455 sectors
LBA48 supported   31250 sectors
PIO supported PIO4
DMA supported WDMA2 UDMA6 

Feature  Support  EnableValue   Vendor
read ahead yes  yes
write cacheyes  yes
flush cacheyes  yes
overlapno
Tagged Command Queuing (TCQ)   no   no
Native Command Queuing (NCQ)   yes  32 tags
SMART  yes  yes
microcode download yes  yes
security   yes  no
power management   yes  yes
advanced power management  no   no  65278/0xFEFE
automatic acoustic management  yes  yes 254/0xFE208/0xD0
media status notification  no   no
power-up in Standbyno   no
write-read-verify  no   no  0/0x0
unload no   no
free-fall  no   no
data set management (TRIM) no


So, why the slower speed?  Also, camcontrol identify states the drive 
supports NCQ with up to 32 tags supported, yet camcontrol tags ada0 reports 
only 1 device opening, not 32 as I would expect:

backup# camcontrol tags ada0
(pass0:ata2:0:0:0): device openings: 1

To enable ATA_CAM AHCI support, I included this in my kernel config file:

# ATA and ATAPI devices
options ATA_CAM
device  ahci
device  atacore
device  atapci
options ATA_STATIC_ID   # Static device numbering

Is this the correct way to enable ATA_CAM AHCI support?  I tried initially 
including just options ATA_CAM and device ahci but the resultant kernel 
would not probe my disk drive as ada0.

Does my problem lie with my kernel config or is the Dell Optiplex 745 BIOS 
brain dead when it comes to AHCI native support?  The only option it appears to 
have in the BIOS is Normal and Legacy when it comes to the SATA controller 
mode.

I've attached my full kernel config and dmesg at the end of this e-mail.

Cheers,

Paul.

Kernel config:

#
# GENERIC -- Generic kernel configuration file for FreeBSD/amd64
#
# For more information on this file, please read the config(5) manual page,
# and/or the handbook section on Kernel Configuration Files:
#
#
http://www.FreeBSD.org/doc/en_US.ISO8859-1/books/handbook/kernelconfig-config.html
#
# The handbook is also available locally in /usr/share/doc/handbook
# if you've installed the doc distribution, otherwise always see the
# FreeBSD World Wide Web server (http://www.FreeBSD.org/) for the
# latest information.
#
# An exhaustive list of options and more detailed explanations of the
# device lines is also present in the ../../conf/NOTES and NOTES files.
# If you are in doubt as to the purpose or necessity of a line, check first
# in NOTES.
#
# $FreeBSD: src/sys/amd64/conf/GENERIC,v 1.531.2.13 2010/05/02 06:24:17 imp Exp 
$

cpu HAMMER
ident   VPN

# To statically compile in device wiring instead of /boot/device.hints
#hints  GENERIC.hints # Default places to look for devices.

# Use the following to compile in values accessible to the kernel
# through getenv() (or kenv(1) in userland). The format of the file
# is 'variable=value', see kenv(1)
#
# env   GENERIC.env

#makeoptionsDEBUG=-g# Build kernel with gdb(1) debug symbols

options SCHED_ULE   # ULE scheduler
options PREEMPTION  # Enable kernel thread preemption
options INET# InterNETworking
options INET6   # IPv6 communications protocols
options SCTP# Stream Control Transmission Protocol
options FFS # Berkeley Fast Filesystem
options SOFTUPDATES # Enable FFS soft updates support
options UFS_ACL #

Re: Problems with ATA_CAM support in RELENG_8

2010-06-30 Thread Paul Mather

On Jun 30, 2010, at 1:38 PM, Alexander Motin wrote:

 To enable ATA_CAM AHCI support, I included this in my kernel config file:
 
 # ATA and ATAPI devices
 options ATA_CAM
 device  ahci
 device  atacore
 device  atapci
 options ATA_STATIC_ID   # Static device numbering
 
 Is this the correct way to enable ATA_CAM AHCI support?  I tried
 initially including just options ATA_CAM and device ahci but the
 resultant kernel would not probe my disk drive as ada0.
 
 Your controller seems to not report AHCI support. In such case legacy
 mode driver attaches to it. But as soon as you have no `device ataintel`
 line, only generic driver was there, limited by UDMA2 mode.
 
 PS: ATA_STATIC_ID is useless when ATA_CAM option enabled.

Thank you (and Jeremy Chadwick) for the help and information.  The kernel 
configuration options I used above were taken from a VirtualBox FreeBSD/amd64 
install I have that I converted over to ATA_CAM when the code first went into 
RELENG_8 and it wasn't exactly clear at the time what options were absolutely 
required.  (I'm not even sure that options ATA_CAM is needed any more, given 
device ahci implies it.)

 Does my problem lie with my kernel config or is the Dell Optiplex 745
 BIOS brain dead when it comes to AHCI native support?  The only option
 it appears to have in the BIOS is Normal and Legacy when it comes
 to the SATA controller mode.
 
 Your controller is not identified as AHCI:
 
 atapci0: Intel ATA controller port
 0xfe00-0xfe07,0xfe10-0xfe13,0xfe20-0xfe27,0xfe30-0xfe33,0xfec0-0xfecf,0xecc0-0xeccf
 irq 20 at device 31.2 on pci0
 atapci0: [ITHREAD]
 ata2: ATA channel 0 on atapci0
 ata2: [ITHREAD]
 ata3: ATA channel 1 on atapci0
 ata3: [ITHREAD]
 atapci1: Intel ATA controller port
 0xfe40-0xfe47,0xfe50-0xfe53,0xfe60-0xfe67,0xfe70-0xfe73,0xfed0-0xfedf,0xecd0-0xecdf
 irq 20 at device 31.5 on pci0
 atapci1: [ITHREAD]
 ata4: ATA channel 0 on atapci1
 ata4: [ITHREAD]
 ata5: ATA channel 1 on atapci1
 ata5: [ITHREAD]
 
 You may check `pciconf -lvcb` output. For ICH8 with AHCI you should see
 there something like:
 ah...@pci0:0:31:2:  class=0x010601 card=0xa00c14ff chip=0x28298086
 rev=0x03 hdr=0x00
vendor = 'Intel Corporation'
device = 'Mobile SATA AHCI Controller'
class  = mass storage
subclass   = SATA
bar   [10] = type I/O Port, range 32, base 0xe880, size  8, enabled
bar   [14] = type I/O Port, range 32, base 0xe800, size  4, enabled
bar   [18] = type I/O Port, range 32, base 0xe480, size  8, enabled
bar   [1c] = type I/O Port, range 32, base 0xe400, size  4, enabled
bar   [20] = type I/O Port, range 32, base 0xe080, size 32, enabled
bar   [24] = type Memory, range 32, base 0xfeaff800, size 2048, enabled
cap 05[80] = MSI supports 4 messages enabled with 4 messages
cap 01[70] = powerspec 3  supports D0 D3  current D0
cap 12[a8] = SATA Index-Data Pair
 
 Pay attention to subclass = SATA.

Then that's where my problem lies:

atap...@pci0:0:31:2:class=0x01018f card=0x01da1028 chip=0x28208086 rev=0x02 
hdr=0x00
vendor = 'Intel Corporation'
device = 'SATA IDE Controller:4 port (82801HB/HR/HH/HO)'
class  = mass storage
subclass   = ATA
bar   [10] = type I/O Port, range 32, base 0xfe00, size  8, enabled
bar   [14] = type I/O Port, range 32, base 0xfe10, size  4, enabled
bar   [18] = type I/O Port, range 32, base 0xfe20, size  8, enabled
bar   [1c] = type I/O Port, range 32, base 0xfe30, size  4, enabled
bar   [20] = type I/O Port, range 32, base 0xfec0, size 16, enabled
bar   [24] = type I/O Port, range 32, base 0xecc0, size 16, enabled
cap 01[70] = powerspec 3  supports D0 D3  current D0

atap...@pci0:0:31:5:class=0x010185 card=0x01da1028 chip=0x28258086 rev=0x02 
hdr=0x00
vendor = 'Intel Corporation'
device = '82801H (ICH8 Family) 2 port SATA Controller'
class  = mass storage
subclass   = ATA
bar   [10] = type I/O Port, range 32, base 0xfe40, size  8, enabled
bar   [14] = type I/O Port, range 32, base 0xfe50, size  4, enabled
bar   [18] = type I/O Port, range 32, base 0xfe60, size  8, enabled
bar   [1c] = type I/O Port, range 32, base 0xfe70, size  4, enabled
bar   [20] = type I/O Port, range 32, base 0xfed0, size 16, enabled
bar   [24] = type I/O Port, range 32, base 0xecd0, size 16, enabled
cap 01[70] = powerspec 3  supports D0 D3  current D0


I thought ICH8 supported AHCI, but maybe it's only ICH8R that does?  I'm 
assuming that subclass = ATA means the controller can't operate in AHCI mode. 
 The BIOS setting is also confusing.  It has two options, Normal and 
Legacy.  Normal mode says, The hard drive controller is configured for 
native mode.  This mode provides the highest drive performance and most 
flexibility.  I guess I misinterpreted native mode to be AHCI mode.

Thanks again for the help and for clearing things up.

Cheers,

Re: Problems with ATA_CAM support in RELENG_8

2010-07-01 Thread Paul Mather

On Jun 30, 2010, at 5:18 PM, Alexander Motin wrote:

 Paul Mather wrote:
 PS: ATA_STATIC_ID is useless when ATA_CAM option enabled.
 
 Thank you (and Jeremy Chadwick) for the help and information.  The kernel 
 configuration options I used above were taken from a VirtualBox 
 FreeBSD/amd64 install I have that I converted over to ATA_CAM when the code 
 first went into RELENG_8 and it wasn't exactly clear at the time what 
 options were absolutely required.  (I'm not even sure that options ATA_CAM 
 is needed any more, given device ahci implies it.)
 
 `options ATA_CAM` enables CAM wrapper for legacy drivers, which gave you
 adaX devices instead of adX. It doesn't give major benefits, just
 unifies behavior.

So, does that mean if you omit options ATA_CAM and have device ahci you 
will get adX devices, not adaX devices?  In other words, if you have device 
ahci (or device siis or device mvs) will you will always get adaX devices, 
whether or not you have options ATA_CAM in your kernel config file?

Does options ATA_CAM work with device ata or the modular ATA subsystem?  Is 
that the intended use of options ATA_CAM: to provide adaX devices and a CAM 
interface for accessing ATA devices?

Cheers,

Paul.___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: 8.x grudges

2010-07-08 Thread Paul Mather

On Jul 7, 2010, at 5:39 PM, Garrett Cooper wrote:

 On Wed, Jul 7, 2010 at 1:17 PM, Mikhail T. mi+t...@aldan.algebra.com wrote:
 07.07.2010 14:59, Jeremy Chadwick ???(??):
 
  FREEBSD_COMPAT7 kernel option is, apparently, a requirement (and
  thus not an option) -- the kernel-config files, that worked with
  7.x, break without this option in them (in addition to all the
  nuisance, that's documented in UPDATING -- which, somehow, makes
  the breakage acceptable). config(8) would not warn about this, but
  kernel build fails.
 
 
 We don't use this option (meaning it's removed from our kernels).  It's
 definitely not required.  All it does is ensure your kernel can
 comprehend executables/binaries built on 7.x.
 
 
 Attached is the kernel config-file (i386), that worked fine under 7.x. The
 kernel-compile will break (some *freebsd7* structs undefined), without the
 COMPAT_FREEBSD7 option. Try it for yourself...
 
 options   SYSVSHM # SYSV-style shared memory
 options   SYSVMSG # SYSV-style message queues
 options   SYSVSEM # SYSV-style semaphores
 
 Those require COMPAT_FREEBSD7. 

I have an FreeBSD/amd64 8.1-PRERELEASE system with all COMPAT_FREEBSDx options 
except COMPAT_FREEBSD32 commented out in my kernel config file and it builds 
fine.  So, unless I am misunderstanding you, I don't think options 
SYSV{SHM,MSG,SEM} requires COMPAT_FREEBSD7.

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Using GTP and glabel for ZFS arrays

2010-07-22 Thread Paul Mather

On Jul 21, 2010, at 11:05 PM, Dan Langille wrote:

 I hope my terminology is correct
 
 I have a ZFS array which uses raw devices.  I'd rather it use glabel and 
 supply the GEOM devices to ZFS instead.  In addition, I'll also partition the 
 HDD to avoid using the entire HDD: leave a little bit of space at the start 
 and end.
 
 Why use glabel?
 
 * So ZFS can find and use the correct HDD should the HDD device ever
   get renumbered for whatever reason.  e.g. /dev/da0 becomes /dev/da6
   when you move it to another controller.

I have created ZFS pools using this strategy.  However, about a year ago I 
still fell foul of the drive shuffling problem, when GEOM labels appeared not 
to be detected properly:

http://lists.freebsd.org/pipermail/freebsd-geom/2009-July/003654.html

This was using RELENG_7, and the problem was provoked by external USB drives.

The same issue might not occur with FreeBSD 8.x, but I thought I'd point out my 
experience as a possible warning about using glabel.

Nowadays, I use GPT labels (gpart ... -l somelabel, referenced via 
/dev/gpt/somelabel).

 Why use partitions?
 
 * Primarily: two HDD of a given size, say 2TB, do not always provide
   the same amount of available space.  If you use a slightly smaller
   partition instead of the entire physical HDD, you're much more
   likely to have a happier experience when it comes time to replace an
   HDD.
 
 * There seems to be a consensus amongst some that leaving the start and
   and of your HDD empty.  Give the rest to ZFS.

You should also try and accommodate 4K sector size drives these days.  
Apparently, the performance boosts from hitting 4K-aligned sectors can be very 
good.

Cheers,

Paul.___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: MFC of ZFSv15

2010-09-17 Thread Paul Mather

On Sep 16, 2010, at 1:44 AM, Martin Matuska wrote:

 I have fixed the missing bits in r212688.
 
 Thanks for the notice.
 
 Dňa 15. 9. 2010 21:12, Xin LI  wrote / napísal(a):
 On 2010/09/15 11:30, Mike Tancsa wrote:
 At 02:07 PM 9/15/2010, Pascal Stumpf wrote:
 First of all, a great thanks to mm@ and pjd@ for the excellent work on
 ZFS in FreeBSD. :) And especially for the MFC of v15 a few hours ago.
 
 [...]
 here too.  RELENG_8 AMD64.  The tinderboxes havent hit that branch yet
 (http://tinderbox.freebsd.org/http://tinderbox.freebsd.org/), so it
 will be a few hrs before they get to test RELENG_8
 [...]
 -lsbuf  -lm -lnvpair -luutil -lutil
 /usr/obj/usr/src/tmp/usr/lib/libzfs.so: undefined reference to `getmntent'
 *** Error code 1
 
 Sorry for that, it seems to be caused by a partial merge
 (cddl/compat/opensolaris/misc/mnttab.c).  mm@ is going to fix that ASAP.
 
 Cheers,

I am getting a build failure on 8.1-STABLE:

=
[[...]]
cc -c -O2 -pipe -fno-strict-aliasing -march=pentium4 -std=c99  -Wall 
-Wredundant-decls -Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes 
-Wpointer-arith -Winline -Wcast-qual  -Wundef -Wno-pointer-sign 
-fformat-extensions -nostdinc  -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq 
-D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common 
-finline-limit=8000 --param inline-unit-growth=100 --param 
large-function-growth=1000  -mno-align-long-strings 
-mpreferred-stack-boundary=2  -mno-mmx -mno-3dnow -mno-sse -mno-sse2 -mno-sse3 
-ffreestanding -fstack-protector -Werror  /usr/src/sys/kern/p1003_1b.c
cc -c -O2 -pipe -fno-strict-aliasing -march=pentium4 -std=c99  -Wall 
-Wredundant-decls -Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes 
-Wpointer-arith -Winline -Wcast-qual  -Wundef -Wno-pointer-sign 
-fformat-extensions -nostdinc  -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq 
-D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common 
-finline-limit=8000 --param inline-unit-growth=100 --param 
large-function-growth=1000  -mno-align-long-strings 
-mpreferred-stack-boundary=2  -mno-mmx -mno-3dnow -mno-sse -mno-sse2 -mno-sse3 
-ffreestanding -fstack-protector -Werror  /usr/src/sys/kern/posix4_mib.c
cc -c -O2 -pipe -fno-strict-aliasing -march=pentium4 -std=c99  -Wall 
-Wredundant-decls -Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes 
-Wpointer-arith -Winline -Wcast-qual  -Wundef -Wno-pointer-sign 
-fformat-extensions -nostdinc  -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq 
-D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common 
-finline-limit=8000 --param inline-unit-growth=100 --param 
large-function-growth=1000  -mno-align-long-strings 
-mpreferred-stack-boundary=2  -mno-mmx -mno-3dnow -mno-sse -mno-sse2 -mno-sse3 
-ffreestanding -fstack-protector -Werror  /usr/src/sys/kern/sched_ule.c
cc1: warnings being treated as errors
/usr/src/sys/kern/sched_ule.c: In function 'sched_switch':
/usr/src/sys/kern/sched_ule.c:1807: warning: implicit declaration of function 
'sched_pickcpu'
/usr/src/sys/kern/sched_ule.c:1807: warning: nested extern declaration of 
'sched_pickcpu'
*** Error code 1

Stop in /usr/obj/usr/src/sys/BACKUP.
*** Error code 1

Stop in /usr/src.
*** Error code 1

Stop in /usr/src.
=

Unfortunately (for me, I guess), GENERIC will build successfully on this 
system.  It's only my custom kernel config file that is failing make 
buildkernel.  The custom kernel config is GENERIC with various inapplicable 
drivers removed, plus some non-GENERIC things added in (such as ALTQ support 
options).  I am building via the standard make buildworld followed by make 
buildkernel method.  Can anyone spot anything obviously awry?  I've included 
my config file at the end.

Cheers,

Paul.

#
# GENERIC -- Generic kernel configuration file for FreeBSD/i386
#
# For more information on this file, please read the handbook section on
# Kernel Configuration Files:
#
#
http://www.FreeBSD.org/doc/en_US.ISO8859-1/books/handbook/kernelconfig-config.html
#
# The handbook is also available locally in /usr/share/doc/handbook
# if you've installed the doc distribution, otherwise always see the
# FreeBSD World Wide Web server (http://www.FreeBSD.org/) for the
# latest information.
#
# An exhaustive list of options and more detailed explanations of the
# device lines is also present in the ../../conf/NOTES and NOTES files.
# If you are in doubt as to the purpose or necessity of a line, check first
# in NOTES.
#
# $FreeBSD: src/sys/i386/conf/GENERIC,v 1.474.2.19 2009/07/15 08:32:19 ed Exp $

cpu I586_CPU
cpu I686_CPU
ident   BACKUP

# To statically compile in device wiring instead of /boot/device.hints
#hints  GENERIC.hints # Default places to look for devices.

#makeoptionsDEBUG=-g# Build kernel with gdb(1) debug symbols

#optionsSCHED_4BSD  # 4BSD scheduler
options SCHED_ULE   # ULE scheduler
options

Re: MFC of ZFSv15

2010-09-18 Thread Paul Mather

On Sep 17, 2010, at 8:37 AM, Paul Mather wrote:

 On Sep 16, 2010, at 1:44 AM, Martin Matuska wrote:
 
 I have fixed the missing bits in r212688.
 
 Thanks for the notice.
 
 Dňa 15. 9. 2010 21:12, Xin LI  wrote / napísal(a):
 On 2010/09/15 11:30, Mike Tancsa wrote:
 At 02:07 PM 9/15/2010, Pascal Stumpf wrote:
 First of all, a great thanks to mm@ and pjd@ for the excellent work on
 ZFS in FreeBSD. :) And especially for the MFC of v15 a few hours ago.
 
 [...]
 here too.  RELENG_8 AMD64.  The tinderboxes havent hit that branch yet
 (http://tinderbox.freebsd.org/http://tinderbox.freebsd.org/), so it
 will be a few hrs before they get to test RELENG_8
 [...]
 -lsbuf  -lm -lnvpair -luutil -lutil
 /usr/obj/usr/src/tmp/usr/lib/libzfs.so: undefined reference to `getmntent'
 *** Error code 1
 
 Sorry for that, it seems to be caused by a partial merge
 (cddl/compat/opensolaris/misc/mnttab.c).  mm@ is going to fix that ASAP.
 
 Cheers,
 
 I am getting a build failure on 8.1-STABLE:
 
 =
 [[...]]
 cc -c -O2 -pipe -fno-strict-aliasing -march=pentium4 -std=c99  -Wall 
 -Wredundant-decls -Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes 
 -Wpointer-arith -Winline -Wcast-qual  -Wundef -Wno-pointer-sign 
 -fformat-extensions -nostdinc  -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq 
 -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common 
 -finline-limit=8000 --param inline-unit-growth=100 --param 
 large-function-growth=1000  -mno-align-long-strings 
 -mpreferred-stack-boundary=2  -mno-mmx -mno-3dnow -mno-sse -mno-sse2 
 -mno-sse3 -ffreestanding -fstack-protector -Werror  
 /usr/src/sys/kern/p1003_1b.c
 cc -c -O2 -pipe -fno-strict-aliasing -march=pentium4 -std=c99  -Wall 
 -Wredundant-decls -Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes 
 -Wpointer-arith -Winline -Wcast-qual  -Wundef -Wno-pointer-sign 
 -fformat-extensions -nostdinc  -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq 
 -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common 
 -finline-limit=8000 --param inline-unit-growth=100 --param 
 large-function-growth=1000  -mno-align-long-strings 
 -mpreferred-stack-boundary=2  -mno-mmx -mno-3dnow -mno-sse -mno-sse2 
 -mno-sse3 -ffreestanding -fstack-protector -Werror  
 /usr/src/sys/kern/posix4_mib.c
 cc -c -O2 -pipe -fno-strict-aliasing -march=pentium4 -std=c99  -Wall 
 -Wredundant-decls -Wnested-externs -Wstrict-prototypes  -Wmissing-prototypes 
 -Wpointer-arith -Winline -Wcast-qual  -Wundef -Wno-pointer-sign 
 -fformat-extensions -nostdinc  -I. -I/usr/src/sys -I/usr/src/sys/contrib/altq 
 -D_KERNEL -DHAVE_KERNEL_OPTION_HEADERS -include opt_global.h -fno-common 
 -finline-limit=8000 --param inline-unit-growth=100 --param 
 large-function-growth=1000  -mno-align-long-strings 
 -mpreferred-stack-boundary=2  -mno-mmx -mno-3dnow -mno-sse -mno-sse2 
 -mno-sse3 -ffreestanding -fstack-protector -Werror  
 /usr/src/sys/kern/sched_ule.c
 cc1: warnings being treated as errors
 /usr/src/sys/kern/sched_ule.c: In function 'sched_switch':
 /usr/src/sys/kern/sched_ule.c:1807: warning: implicit declaration of function 
 'sched_pickcpu'
 /usr/src/sys/kern/sched_ule.c:1807: warning: nested extern declaration of 
 'sched_pickcpu'
 *** Error code 1
 
 Stop in /usr/obj/usr/src/sys/BACKUP.
 *** Error code 1
 
 Stop in /usr/src.
 *** Error code 1
 
 Stop in /usr/src.
 =
 
 Unfortunately (for me, I guess), GENERIC will build successfully on this 
 system.  It's only my custom kernel config file that is failing make 
 buildkernel.  The custom kernel config is GENERIC with various inapplicable 
 drivers removed, plus some non-GENERIC things added in (such as ALTQ support 
 options).  I am building via the standard make buildworld followed by make 
 buildkernel method.  Can anyone spot anything obviously awry?  I've included 
 my config file at the end.

Just to follow up myself, here: I added options SMP to my custom kernel 
config file and that allowed me successfully to finish make buildkernel ... 
without error.

So, is options SMP mandatory now on FreeBSD/i386 (even on uniprocessor 
systems), and, if so, shouldn't it be included in DEFAULTS?

Cheers,

Paul.

Re: mysqld_safe holding open a pty/tty on FreeBSD (7.x and 8.x)

2010-09-30 Thread Paul Mather

On Sep 30, 2010, at 3:56 AM, Jeremy Chadwick wrote:

 The diff is pretty obvious/simple (2 line change), so the other
 databases/mysqlXX-server ports can be upgraded in the same manner.
 
 --- files/mysql-server.sh.in.orig 2010-03-27 03:24:53.0 -0700
 +++ files/mysql-server.sh.in  2010-09-30 00:45:38.0 -0700
 @@ -35,8 +35,8 @@
 mysql_user=mysql
 mysql_limits_args=-e -U ${mysql_user}
 pidfile=${mysql_dbdir}/`/bin/hostname`.pid
 -command=%%PREFIX%%/bin/mysqld_safe
 -command_args=--defaults-extra-file=${mysql_dbdir}/my.cnf 
 --user=${mysql_user} --datadir=${mysql_dbdir} --pid-file=${pidfile} 
 ${mysql_args}  /dev/null 21 
 +command=/usr/sbin/daemon
 +command_args=-c -f /usr/local/bin/mysqld_safe 
 --defaults-extra-file=${mysql_dbdir}/my.cnf --user=${mysql_user} 
 --datadir=${mysql_dbdir} --pid-file=${pidfile} ${mysql_args}

Shouldn't this be -c -f %%PREFIX%%/bin/mysqld_safe ... rather than 
hard-coding /usr/local?

 procname=%%PREFIX%%/libexec/mysqld
 start_precmd=${name}_prestart
 start_postcmd=${name}_poststart
 
 -- 
 | Jeremy Chadwick   j...@parodius.com |
 | Parodius Networking   http://www.parodius.com/ |
 | UNIX Systems Administrator  Mountain View, CA, USA |
 | Making life hard for others since 1977.  PGP: 4BD6C0CB |

Cheers,

Paul.___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Support for Areca ARC-1300-4X on 8-STABLE?

2010-11-23 Thread Paul Mather

Does anyone know whether the Areca ARC-1300-4X external SAS HBA is currently 
supported under FreeBSD/amd64 8-STABLE?  I just fired up a system that has one 
of these cards that is running 8.0-RELEASE and the HBA is not being detected 
properly by FreeBSD.  (It is being misidentified by the arcmsr driver.)  I'm 
hoping that once I upgrade from 8.0-RELEASE to a contemporary 8-STABLE that 
it'll be supported.  I found a posting on the Web that indicated a driver would 
be available in the first quarter of 2010 
(http://lists.freebsd.org/pipermail/freebsd-hardware/2009-December/006115.html).

I'm really hoping I can use this card, though I'm somewhat discouraged by the 
fact that the Areca Web site lists only drivers for Windows, Linux, and Mac OS 
X for this particular model. :-(

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Support for Areca ARC-1300-4X on 8-STABLE?

2010-11-24 Thread Paul Mather

On Nov 24, 2010, at 12:32 AM, Jeremy Chadwick wrote:

 On Tue, Nov 23, 2010 at 09:41:43PM -0500, Paul Mather wrote:
 Does anyone know whether the Areca ARC-1300-4X external SAS HBA is
 currently supported under FreeBSD/amd64 8-STABLE?  I just fired up a
 system that has one of these cards that is running 8.0-RELEASE and the
 HBA is not being detected properly by FreeBSD.  (It is being
 misidentified by the arcmsr driver.)  I'm hoping that once I upgrade
 from 8.0-RELEASE to a contemporary 8-STABLE that it'll be supported.
 I found a posting on the Web that indicated a driver would be
 available in the first quarter of 2010
 (http://lists.freebsd.org/pipermail/freebsd-hardware/2009-December/006115.html).
 
 I'm really hoping I can use this card, though I'm somewhat discouraged
 by the fact that the Areca Web site lists only drivers for Windows,
 Linux, and Mac OS X for this particular model. :-(
 
 Have you contacted Areca about this?  Their Technical Support folks
 would be able to tell you why this is, and if there is work being done
 to provide a driver for FreeBSD.  A hardware vendor that is known to
 support FreeBSD is more authoritative about their drivers than the
 FreeBSD Project.  :-)

I'll concede the latter point, but I'd also hope that such a vendor would be 
contributing its drivers directly into the source tree rather than having a 
separate set of patches or downloads for end-users to fiddle about with.  I 
prefer my updates to come via csup. :-)

The only Areca port I can see is the one for the CLI for their RAID cards 
(sysutils/areca-cli).

I have contacted their support e-mail address asking what the situation is, 
from the horse's mouth, so to speak.  I hope the news is good.

 Also, please see this thread (which is very recent):
 
 http://lists.freebsd.org/pipermail/freebsd-stable/2010-November/059985.html

Thanks for the link!  I'm not sure whether Fixed arcmsr driver prevent arcsas 
support for Areca SAS HBA ARC13x0 in the Bug fixes list is good news or bad 
news for me.  It suggests that the arcmsr driver has been fixed to prevent it 
attaching to the ARC-13X0 cards, implying that it is not supported.  This entry 
about OS support on the Areca official page for the ARC-1300 product 
(http://www.areca.us/products/sasnoneraid.htm) is also not inspiring: 
BSD/FreeBSD (will be available with 6Gb/s Host Adapter).  Does this mean only 
their 6 Gb/s cards will be supported under FreeBSD, or that support for the 3 
Gb/s cards will appear alongside the 6 Gb/s cards, whenever they are released?

I have to say all this has left a sour taste in my mouth.  I chose Areca 
because of their solid FreeBSD support. :-(

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Support for Areca ARC-1300-4X on 8-STABLE?

2010-11-24 Thread Paul Mather

On Nov 24, 2010, at 7:06 PM, Xin LI wrote:

 On Wed, Nov 24, 2010 at 7:01 AM, Paul Mather p...@gromit.dlib.vt.edu wrote:
 Thanks for the link!  I'm not sure whether Fixed arcmsr driver prevent 
 arcsas support for Areca SAS HBA ARC13x0 in the Bug fixes list is good 
 news or bad news for me.  It suggests that the arcmsr driver has been fixed 
 to prevent it attaching to the ARC-13X0 cards, implying that it is not 
 supported.  This entry about OS support on the Areca official page for the 
 ARC-1300 product (http://www.areca.us/products/sasnoneraid.htm) is also not 
 inspiring: BSD/FreeBSD (will be available with 6Gb/s Host Adapter).  Does 
 this mean only their 6 Gb/s cards will be supported under FreeBSD, or that 
 support for the 3 Gb/s cards will appear alongside the 6 Gb/s cards, 
 whenever they are released?
 
 The 6Gb/s cards are not released yet.  The updated driver resolved a
 conflict with the newer ARC13x0 cards, without it one will have to
 update arcmsr(4) before installing the SAS card driver.

So, to clarify, does this mean the 3 Gb/s ARC-1300 cards are not currently 
supported under 8-STABLE?  Or, are they supported by a different driver to 
arcmsr (if so, which one?), but arcmsr must be updated so it doesn't probe and 
claim the card instead?

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: FreeBSD Security Advisory FreeBSD-SA-10:10.openssl

2010-12-03 Thread Paul Mather

On Dec 2, 2010, at 4:54 AM, Gareth de Vaux wrote:

 On Thu 2010-12-02 (00:05), jhell wrote:
 Try that with a ( make includes ) in that same directory and if it works
 then the advisory will have to be revised.
 
 Ah awesome, that works thanx.
 
 (I don't see why though, since it was only half complaining about
 a missing definition, even when I manually included dtls1.h. Also,
 I tried on a different system and the advisory instructions worked
 as is).

The advisory instructions worked as-is for me on a 7.3-RELEASE-p3 system but 
failed on my 8.1-STABLE systems.  However, a make buildworld followed by a 
make install in the appropriate directory work successfully on the 8.1-STABLE 
machines.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Supported SAS controllers (single port)

2010-12-08 Thread Paul Mather

On Dec 7, 2010, at 8:36 PM, Daniel O'Connor wrote:

 
 On 08/12/2010, at 3:51, Mike Andrews wrote:
 On 12/7/2010 8:00 AM, Daniel O'Connor wrote:
 Does anyone have a recommendation for one?
 
 I am looking to connect a LTO tape drive to a FreeBSD 7 or 8 box and I've 
 only ever used Adaptec 19160 and similar cards and LVDS SCSI.
 
 Our supplier has an LSI SAS3081E-R which is not outrageously expensive, has 
 anyone used one?
 
 Yes, I've got one, it works fine :)
 
 Ahh, good news, thanks :)
 
 Have you used it with a tape drive?
 
 You may want to flash the non-raid firmware from LSI onto it.
 
 OK thanks.
 
 I just realised it doesn't have any external connectors so I'll have to find 
 another candidate.
 
 However I do see the LSI spec sheet for it lists the chip which is listed in 
 mpt(4) so I should be able to find something.

I have a LSI SAS3801E (note lack of -R suffix) in a FreeBSD 8 box that is 
hooked up to a Quantum SuperLoader LTO-4 tape drive.  It's not in production 
yet, and I haven't done extensive testing, but so far it probes the tape drive 
and autochanger correctly.

The LSI SAS3801E itself attaches as a mpt device.  It has two external 
connectors but no internal ones.

I had intended to use an Areca ARC-1300-4X external SAS controller to drive 
this tape unit but, alas, I discovered that there is as yet no FreeBSD driver 
for that card. :-(

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: umass: AutoSense failed

2010-12-10 Thread Paul Mather

On Dec 9, 2010, at 5:11 PM, Jeremy Chadwick wrote:

 On Thu, Dec 09, 2010 at 10:35:56PM +0100, Harald Weis wrote:
 What could be the reason for the following failure?
 
 ugen2.2: Sony at usbus2
 umass0: Sony Sony DSC, class 0/0, rev 1.10/4.50, addr 2 on usbus2
 umass0:  RBC over CBI; quirks = 0x
 umass0:1:0:-1: Attached to scbus1
 (probe0:umass-sim0:0:0:0): AutoSense failed
 
 This occurs on 4 different boxes, all on 8.1-RELEASE.
 Never happened on previous releases.
 The camera works on an Ubuntu system.
 
 Please try a 8.1-STABLE or 8.2-PRERELEASE snapshot (you can boot the
 livefs CD or USB memstick image to test) and see if things are different
 (improved) there.
 
 ftp://ftp4.freebsd.org/pub/FreeBSD/snapshots/201011/
 
 8.x uses a newer/different USB stack than 7.x.


I get something similar to this happening on 8.2-PRERELEASE.  In my case, it's 
not during boot probing or device attachment.  Instead, it happens occasionally 
after boot.  The devices concerned are Maxtor OneTouch external USB hard 
drives.  Every now and then, I will get something akin to the following crop up 
in the console log:

(da1:umass-sim1:1:0:0): AutoSense failed

I have three of these Maxtor OneTouch drives attached to the system as part of 
a ZFS pool.  When I get an AutoSense failed message, it is usually 
accompanied by the ZFS pool being marked as faulted.

The Maxtor OneTouch drives are wont to spin down and go into a deep sleep after 
a period of inactivity and appear very slow to wake up again when I/O occurs.  
I have always assumed that the AutoSense failed is associated with 
this---that there is some kind of timeout in the FreeBSD stack that this device 
is exceeding.  In fact, sometimes the devices fail to probe properly during 
boot when they are asleep.

This is what the OneTouch normally probes as:

da0 at umass-sim0 bus 0 scbus0 target 0 lun 0
da0: Maxtor OneTouch 0121 Fixed Direct Access SCSI-4 device 
da0: 40.000MB/s transfers
da0: 953869MB (1953525168 512 byte sectors: 255H 63S/T 121601C)


Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: OS X Lion time machine = (afpd|iSCSI) = ZFS question

2011-07-25 Thread Paul Mather

On Jul 25, 2011, at 2:51 PM, Bakul Shah wrote:

 On Mon, 25 Jul 2011 14:15:06 EDT Gary Palmer gpal...@freebsd.org  wrote:
 On Thu, Jul 21, 2011 at 02:56:17PM -0700, Bakul Shah wrote:
 I am in no hurry to upgrade my MBP to OS X Lion but given Lion
 time machine and netatalk issues, I got wondering if iSCSI on
 FreeBSD is stable enough for time machine use. How much duct
 tape and baling wire are needed to make it work?!
 
 Just a word of caution - I read somewhere that using iSCSI for Time
 Machine means that you cannot use TM during an OS reinstall to
 restore your files from the Time Machine archive as iSCSI isn't
 available at that point.  It seems you have to use a locally 
 attached disk or AFP for the support to be there during 
 reinstall.  Not sure if thats still the case in Lion or not, I
 would tend to suspect they still don't consider iSCSI a top tier
 transport.
 
 I read you can create a boot dvd from one of the files in the
 upgrade.


The problem is that the Boot DVD or Recovery Partition does not have an iSCSI 
initiator.  In fact, the Mac does not ship with an iSCSI initiator, and the 
common solution to obtaining one is to install the free globalSAN iSCSI 
Initiator for OS X.

Without an installed iSCSI initiator, you won't be able to mount the LUNs upon 
which is located the Time Machine volume from which you are trying to restore 
files.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: usbus is seen as network interface - Fwd: sjakie.klop.ws daily run output

2011-09-20 Thread Paul Mather

On Sep 20, 2011, at 6:11 AM, Ronald Klop wrote:

 Hello,
 
 Why is usbus seen as a network interface since some time?
 I'm running 9-CURRENT on amd64.

I've been wondering this, too.  It also started happening sometime in the 
lifetime of 8-STABLE some months ago, with netstat -i showing usbus network 
interfaces despite no ethernet NIC being attached.  It appears that ifconfig 
is smart enough to omit these in output, as was netstat in former times on 
RELENG_8 (and still is on RELENG_7), so why include them now?  Just wondering...

Cheers,

Paul.___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: fsck_ufs out of swapspace

2011-12-17 Thread Paul Mather

On Dec 17, 2011, at 3:36 PM, Michiel Boland wrote:

 FreeBSD 9.0-PRERELEASE locked up while into some heavy I/O and failed to shut 
 down properly, so I had to power-cycle. After it came back up it said
 
 Starting file system checks:
 ** SU+J Recovering /dev/ada0a
 ** Reading 33554432 byte journal from inode 4.
 swap_pager: out of swap space
 swap_pager_getswapspace(16): failed
 pid 67 (fsck_ufs), uid 0, was killed: out of swap space
 fsck: /dev/ada0a: Killed: 9
 Script /etc/rc.d/fsck running
 Unknown error; help!
 ERROR: ABORTING BOOT (sending SIGTERM to parent)!
 
 The only way to continue was to do a full fsck (with no journal)
 
 This is a Sun Blade 100 (sparc64) with 768M of RAM.
 So the fsck is taking up all of this? That can't be right.
 
 What can I do to troubleshoot this further?


FWIW, I had this happen to me several weeks ago on FreeBSD/powerpc64 9-CURRENT. 
 I had to get the machine up and running so I simply abandoned use of SU+J and 
went back to using just UFS+SU.  (Not very helpful, I know, but there you go.)  
I figure it is likely to be some kind of endianness problem in the SU+J code, 
given the lack of complaints on FreeBSD/i386 and FreeBSD/amd64.

Cheers,

Paul.

PS: The system I was using is an Apple Xserve G5 with 4 GB of RAM and 5 GB of 
swap space.  As you say, surely fsck can't be using that much 
memory...___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

ZFS + nullfs + Linuxulator = panic?

2012-02-14 Thread Paul Mather

I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, last 
built 2012-02-08).  It will panic during the daily periodic scripts that run at 
3am.  Here is the most recent panic message:

Fatal trap 9: general protection fault while in kernel mode
cpuid = 0; apic id = 00
instruction pointer = 0x20:0x8069d266
stack pointer   = 0x28:0xff8094b90390
frame pointer   = 0x28:0xff8094b903a0
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= resume, IOPL = 0
current process = 72566 (ps)
trap number = 9
panic: general protection fault
cpuid = 0
KDB: stack backtrace:
#0 0x8062cf8e at kdb_backtrace+0x5e
#1 0x805facd3 at panic+0x183
#2 0x808e6c20 at trap_fatal+0x290
#3 0x808e715a at trap+0x10a
#4 0x808cec64 at calltrap+0x8
#5 0x805ee034 at fill_kinfo_thread+0x54
#6 0x805eee76 at fill_kinfo_proc+0x586
#7 0x805f22b8 at sysctl_out_proc+0x48
#8 0x805f26c8 at sysctl_kern_proc+0x278
#9 0x8060473f at sysctl_root+0x14f
#10 0x80604a2a at userland_sysctl+0x14a
#11 0x80604f1a at __sysctl+0xaa
#12 0x808e62d4 at amd64_syscall+0x1f4
#13 0x808cef5c at Xfast_syscall+0xfc
Uptime: 3d19h6m0s
Dumping 1308 out of 2028 MB:..2%..12%..21%..31%..41%..51%..62%..71%..81%..91%
Dump complete
Automatic reboot in 15 seconds - press a key on the console to abort
Rebooting...


The reason for the subject line is that I have another RELENG_8 system that 
uses ZFS + nullfs but doesn't panic, leading me to believe that ZFS + nullfs is 
not the problem.  I am wondering if it is the combination of the three that is 
deadly, here.

Both RELENG_8 systems are root-on-ZFS installs.  Each night there is a separate 
backup script that runs and completes before the regular periodic daily run.  
This script takes a recursive snapshot of the ZFS pool and then mounts these 
snapshots via mount_nullfs to provide a coherent view of the filesystem under 
/backup.  The only difference between the two RELENG_8 systems is that one uses 
rsync to back up /backup to another machine and the other uses the Linux Tivoli 
TSM client to back up /backup to a TSM server.  After the backup is completed, 
a script runs that unmounts the nullfs file systems and then destroys the ZFS 
snapshot.

The first (rsync backup) RELENG_8 system does not panic.  It has been running 
the ZFS + nullfs rsync backup job without incident for weeks now.  The second 
(Tivoli TSM) RELENG_8 will reliably panic when the subsequent periodic daily 
job runs.  (It is using the 32-bit TSM 6.2.4 Linux client running dsmc 
schedule via the linux_base-f10-10_4 package.)  The actual ZFS + nullfs Tivoli 
TSM backup job appears to run successfully, making me wonder if perhaps it has 
some memory leak or other subtle corruption that sets up the ensuing panic when 
the periodic daily job later gives the system a workout.

If I can provide more information about the panic, please let me know.  Despite 
the message about dumping in the panic output above, when the system reboots I 
get a No core dumps found message during boot.  (I have dumpdev=AUTO set in 
/etc/rc.conf.)  My swap device is on separate partitions but is mirrored using 
geom_mirror as /dev/mirror/swap.  Do crash dumps to gmirror devices work on 
RELENG_8?

Does anyone have any idea what is to blame for the panic, or how I can fix or 
work around it?

Cheers,

Paul.

PS: The uptime of three days in the panic message is because I disabled the 
Tivoli TSM backup job on Friday so it would not run over the weekend.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: ZFS + nullfs + Linuxulator = panic?

2012-02-15 Thread Paul Mather

On Feb 14, 2012, at 7:23 PM, Jeremy Chadwick wrote:

 On Tue, Feb 14, 2012 at 09:38:18AM -0500, Paul Mather wrote:
 I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, last 
 built 2012-02-08).  It will panic during the daily periodic scripts that run 
 at 3am.  Here is the most recent panic message:
 
 Fatal trap 9: general protection fault while in kernel mode
 cpuid = 0; apic id = 00
 instruction pointer = 0x20:0x8069d266
 stack pointer   = 0x28:0xff8094b90390
 frame pointer   = 0x28:0xff8094b903a0
 code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags= resume, IOPL = 0
 current process = 72566 (ps)
 trap number = 9
 panic: general protection fault
 cpuid = 0
 KDB: stack backtrace:
 #0 0x8062cf8e at kdb_backtrace+0x5e
 #1 0x805facd3 at panic+0x183
 #2 0x808e6c20 at trap_fatal+0x290
 #3 0x808e715a at trap+0x10a
 #4 0x808cec64 at calltrap+0x8
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 #9 0x8060473f at sysctl_root+0x14f
 #10 0x80604a2a at userland_sysctl+0x14a
 #11 0x80604f1a at __sysctl+0xaa
 #12 0x808e62d4 at amd64_syscall+0x1f4
 #13 0x808cef5c at Xfast_syscall+0xfc
 Uptime: 3d19h6m0s
 Dumping 1308 out of 2028 MB:..2%..12%..21%..31%..41%..51%..62%..71%..81%..91%
 Dump complete
 Automatic reboot in 15 seconds - press a key on the console to abort
 Rebooting...
 
 
 The reason for the subject line is that I have another RELENG_8 system that 
 uses ZFS + nullfs but doesn't panic, leading me to believe that ZFS + nullfs 
 is not the problem.  I am wondering if it is the combination of the three 
 that is deadly, here.
 
 Both RELENG_8 systems are root-on-ZFS installs.  Each night there is a 
 separate backup script that runs and completes before the regular periodic 
 daily run.  This script takes a recursive snapshot of the ZFS pool and then 
 mounts these snapshots via mount_nullfs to provide a coherent view of the 
 filesystem under /backup.  The only difference between the two RELENG_8 
 systems is that one uses rsync to back up /backup to another machine and the 
 other uses the Linux Tivoli TSM client to back up /backup to a TSM server.  
 After the backup is completed, a script runs that unmounts the nullfs file 
 systems and then destroys the ZFS snapshot.
 
 The first (rsync backup) RELENG_8 system does not panic.  It has been 
 running the ZFS + nullfs rsync backup job without incident for weeks now.  
 The second (Tivoli TSM) RELENG_8 will reliably panic when the subsequent 
 periodic daily job runs.  (It is using the 32-bit TSM 6.2.4 Linux client 
 running dsmc schedule via the linux_base-f10-10_4 package.)  The actual 
 ZFS + nullfs Tivoli TSM backup job appears to run successfully, making me 
 wonder if perhaps it has some memory leak or other subtle corruption that 
 sets up the ensuing panic when the periodic daily job later gives the 
 system a workout.
 
 If I can provide more information about the panic, please let me know.  
 Despite the message about dumping in the panic output above, when the system 
 reboots I get a No core dumps found message during boot.  (I have 
 dumpdev=AUTO set in /etc/rc.conf.)  My swap device is on separate 
 partitions but is mirrored using geom_mirror as /dev/mirror/swap.  Do crash 
 dumps to gmirror devices work on RELENG_8?
 
 See gmirror(8) man page, section NOTES.  Read the full thing.


Thanks!  I've changed the balance algorithm to prefer, so hopefully I'll get 
saved crash dumps to examine from now on.


 Does anyone have any idea what is to blame for the panic, or how I can fix 
 or work around it?
 
 Does the panic always happen when ps is run?  That's what's shown in
 the above panic message.  Quoting:
 
 current process = 72566 (ps)
 
 And I'm inclined to think it does, based on the backtrace:
 
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 
 But if you can go through the previous panics and confirm that, it would
 be helpful to developers in tracking down the problem.


Just going by memory, at least one other time it did a panic during df.  But, 
most of the time I remember the panic occurring during ps.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: ZFS + nullfs + Linuxulator = panic?

2012-02-16 Thread Paul Mather

On Feb 14, 2012, at 7:47 PM, Konstantin Belousov wrote:

 On Tue, Feb 14, 2012 at 09:38:18AM -0500, Paul Mather wrote:
 I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, last 
 built 2012-02-08).  It will panic during the daily periodic scripts that run 
 at 3am.  Here is the most recent panic message:
 
 Fatal trap 9: general protection fault while in kernel mode
 cpuid = 0; apic id = 00
 instruction pointer = 0x20:0x8069d266
 stack pointer   = 0x28:0xff8094b90390
 frame pointer   = 0x28:0xff8094b903a0
 code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags= resume, IOPL = 0
 current process = 72566 (ps)
 trap number = 9
 panic: general protection fault
 cpuid = 0
 KDB: stack backtrace:
 #0 0x8062cf8e at kdb_backtrace+0x5e
 #1 0x805facd3 at panic+0x183
 #2 0x808e6c20 at trap_fatal+0x290
 #3 0x808e715a at trap+0x10a
 #4 0x808cec64 at calltrap+0x8
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 #9 0x8060473f at sysctl_root+0x14f
 #10 0x80604a2a at userland_sysctl+0x14a
 #11 0x80604f1a at __sysctl+0xaa
 #12 0x808e62d4 at amd64_syscall+0x1f4
 #13 0x808cef5c at Xfast_syscall+0xfc
 
 Please look up the line number for the fill_kinfo_thread+0x54.


Is there a way for me to do this from the above information?  As I said in the 
original message, I failed to get a crash dump after reboot (because, it turns 
out, I hadn't set up my gmirror swap device properly).  Alas, with the latest 
panic, it appears to have hung[1] during the Dumping phase, so it looks like 
I won't get a saved crash dump this time, either. :-(

Speaking of the latest panic, here it is:

Fatal trap 12: page fault while in kernel mode
cpuid = 0; apic id = 00
fault virtual address   = 0x308
fault code  = supervisor read data, page not present
instruction pointer = 0x20:0x806026ef
stack pointer   = 0x28:0xff8094acd2d0
frame pointer   = 0x28:0xff8094acd350
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= interrupt enabled, resume, IOPL = 0
current process = 20992 (df)
trap number = 12
panic: page fault
cpuid = 0
KDB: stack backtrace:
#0 0x8062cf8e at kdb_backtrace+0x5e
#1 0x805facd3 at panic+0x183
#2 0x808e6c20 at trap_fatal+0x290
#3 0x808e6f71 at trap_pfault+0x201
#4 0x808e742f at trap+0x3df
#5 0x808cec64 at calltrap+0x8
#6 0x80602e1e at _sx_xlock+0x4e
#7 0x80f9ca35 at rrw_enter+0xa5
#8 0x80f9ce86 at zfs_statfs+0x46
#9 0x80681258 at __vfs_statfs+0x28
#10 0x81476521 at nullfs_statfs+0x51
#11 0x80681258 at __vfs_statfs+0x28
#12 0x80690b22 at kern_statfs+0x1b2
#13 0x80690c77 at statfs+0x37
#14 0x808e62d4 at amd64_syscall+0x1f4
#15 0x808cef5c at Xfast_syscall+0xfc


Cheers,

Paul.

[1] Not quite hung solid: when I press the power button I get this appearing on 
the console:

acpi0: suspend request ignored (not ready yet)
acpi0: request to enter state S5 failed (err 6)

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: ZFS + nullfs + Linuxulator = panic?

2012-02-16 Thread Paul Mather

On Feb 16, 2012, at 10:49 AM, Konstantin Belousov wrote:

 On Thu, Feb 16, 2012 at 10:09:27AM -0500, Paul Mather wrote:
 On Feb 14, 2012, at 7:47 PM, Konstantin Belousov wrote:
 
 On Tue, Feb 14, 2012 at 09:38:18AM -0500, Paul Mather wrote:
 I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, 
 last built 2012-02-08).  It will panic during the daily periodic scripts 
 that run at 3am.  Here is the most recent panic message:
 
 Fatal trap 9: general protection fault while in kernel mode
 cpuid = 0; apic id = 00
 instruction pointer = 0x20:0x8069d266
 stack pointer   = 0x28:0xff8094b90390
 frame pointer   = 0x28:0xff8094b903a0
 code segment= base 0x0, limit 0xf, type 0x1b
   = DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags= resume, IOPL = 0
 current process = 72566 (ps)
 trap number = 9
 panic: general protection fault
 cpuid = 0
 KDB: stack backtrace:
 #0 0x8062cf8e at kdb_backtrace+0x5e
 #1 0x805facd3 at panic+0x183
 #2 0x808e6c20 at trap_fatal+0x290
 #3 0x808e715a at trap+0x10a
 #4 0x808cec64 at calltrap+0x8
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 #9 0x8060473f at sysctl_root+0x14f
 #10 0x80604a2a at userland_sysctl+0x14a
 #11 0x80604f1a at __sysctl+0xaa
 #12 0x808e62d4 at amd64_syscall+0x1f4
 #13 0x808cef5c at Xfast_syscall+0xfc
 
 Please look up the line number for the fill_kinfo_thread+0x54.
 
 
 Is there a way for me to do this from the above information? As
 I said in the original message, I failed to get a crash dump after
 reboot (because, it turns out, I hadn't set up my gmirror swap device
 properly). Alas, with the latest panic, it appears to have hung[1]
 during the Dumping phase, so it looks like I won't get a saved crash
 dump this time, either. :-(
 
 Load the kernel.debug into kgdb, and from there do
 list *fill_kinfo_thread+0x54.


gromit# kgdb /usr/obj/usr/src/sys/GENERIC/kernel.debug
GNU gdb 6.1.1 [FreeBSD]
Copyright 2004 Free Software Foundation, Inc.
GDB is free software, covered by the GNU General Public License, and you are
welcome to change it and/or distribute copies of it under certain conditions.
Type show copying to see the conditions.
There is absolutely no warranty for GDB.  Type show warranty for details.
This GDB was configured as amd64-marcel-freebsd...
(kgdb) list *fill_kinfo_thread+0x54
0x805ee034 is in fill_kinfo_thread (/usr/src/sys/kern/kern_proc.c:854).
849 thread_lock(td);
850 if (td-td_wmesg != NULL)
851 strlcpy(kp-ki_wmesg, td-td_wmesg, 
sizeof(kp-ki_wmesg));
852 else
853 bzero(kp-ki_wmesg, sizeof(kp-ki_wmesg));
854 strlcpy(kp-ki_ocomm, td-td_name, sizeof(kp-ki_ocomm));
855 if (TD_ON_LOCK(td)) {
856 kp-ki_kiflag |= KI_LOCKBLOCK;
857 strlcpy(kp-ki_lockname, td-td_lockname,
858 sizeof(kp-ki_lockname));
(kgdb) 


Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: ZFS + nullfs + Linuxulator = panic?

2012-02-20 Thread Paul Mather

On Feb 17, 2012, at 8:27 PM, Konstantin Belousov wrote:

 On Thu, Feb 16, 2012 at 12:07:46PM -0500, Paul Mather wrote:
 On Feb 16, 2012, at 10:49 AM, Konstantin Belousov wrote:
 
 On Thu, Feb 16, 2012 at 10:09:27AM -0500, Paul Mather wrote:
 On Feb 14, 2012, at 7:47 PM, Konstantin Belousov wrote:
 
 On Tue, Feb 14, 2012 at 09:38:18AM -0500, Paul Mather wrote:
 I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, 
 last built 2012-02-08).  It will panic during the daily periodic scripts 
 that run at 3am.  Here is the most recent panic message:
 
 Fatal trap 9: general protection fault while in kernel mode
 cpuid = 0; apic id = 00
 instruction pointer = 0x20:0x8069d266
 stack pointer   = 0x28:0xff8094b90390
 frame pointer   = 0x28:0xff8094b903a0
 code segment= base 0x0, limit 0xf, type 0x1b
  = DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags= resume, IOPL = 0
 current process = 72566 (ps)
 trap number = 9
 panic: general protection fault
 cpuid = 0
 KDB: stack backtrace:
 #0 0x8062cf8e at kdb_backtrace+0x5e
 #1 0x805facd3 at panic+0x183
 #2 0x808e6c20 at trap_fatal+0x290
 #3 0x808e715a at trap+0x10a
 #4 0x808cec64 at calltrap+0x8
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 #9 0x8060473f at sysctl_root+0x14f
 #10 0x80604a2a at userland_sysctl+0x14a
 #11 0x80604f1a at __sysctl+0xaa
 #12 0x808e62d4 at amd64_syscall+0x1f4
 #13 0x808cef5c at Xfast_syscall+0xfc
 
 Please look up the line number for the fill_kinfo_thread+0x54.
 
 
 Is there a way for me to do this from the above information? As
 I said in the original message, I failed to get a crash dump after
 reboot (because, it turns out, I hadn't set up my gmirror swap device
 properly). Alas, with the latest panic, it appears to have hung[1]
 during the Dumping phase, so it looks like I won't get a saved crash
 dump this time, either. :-(
 
 Load the kernel.debug into kgdb, and from there do
 list *fill_kinfo_thread+0x54.
 
 
 gromit# kgdb /usr/obj/usr/src/sys/GENERIC/kernel.debug
 GNU gdb 6.1.1 [FreeBSD]
 Copyright 2004 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain conditions.
 Type show copying to see the conditions.
 There is absolutely no warranty for GDB.  Type show warranty for details.
 This GDB was configured as amd64-marcel-freebsd...
 (kgdb) list *fill_kinfo_thread+0x54
 0x805ee034 is in fill_kinfo_thread 
 (/usr/src/sys/kern/kern_proc.c:854).
 849 thread_lock(td);
 850 if (td-td_wmesg != NULL)
 851 strlcpy(kp-ki_wmesg, td-td_wmesg, 
 sizeof(kp-ki_wmesg));
 852 else
 853 bzero(kp-ki_wmesg, sizeof(kp-ki_wmesg));
 854 strlcpy(kp-ki_ocomm, td-td_name, sizeof(kp-ki_ocomm));
 855 if (TD_ON_LOCK(td)) {
 856 kp-ki_kiflag |= KI_LOCKBLOCK;
 857 strlcpy(kp-ki_lockname, td-td_lockname,
 858 sizeof(kp-ki_lockname));
 (kgdb) 
 
 This is indeed strange. It can only occur if td pointer is damaged.
 
 Please, try to get a core and at least print the content of *td in this case.


Hopefully, I will be able to get a crash dump tonight.  I disabled the Tivoli 
dsmc schedule job over the weekend, because I don't have ready physical 
access to the machine and prefer it not to be down for very extended periods of 
time.  (As I reported previously, for some reason the system seems to get stuck 
saving the crash dump.  If this persists, maybe it might be better to get the 
system to drop into the debugger on panic instead of hoping forlornly for a 
successful crash dump.)

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: ZFS + nullfs + Linuxulator = panic?

2012-02-21 Thread Paul Mather

On Feb 17, 2012, at 8:27 PM, Konstantin Belousov wrote:

 On Thu, Feb 16, 2012 at 12:07:46PM -0500, Paul Mather wrote:
 On Feb 16, 2012, at 10:49 AM, Konstantin Belousov wrote:
 
 On Thu, Feb 16, 2012 at 10:09:27AM -0500, Paul Mather wrote:
 On Feb 14, 2012, at 7:47 PM, Konstantin Belousov wrote:
 
 On Tue, Feb 14, 2012 at 09:38:18AM -0500, Paul Mather wrote:
 I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, 
 last built 2012-02-08).  It will panic during the daily periodic scripts 
 that run at 3am.  Here is the most recent panic message:
 
 Fatal trap 9: general protection fault while in kernel mode
 cpuid = 0; apic id = 00
 instruction pointer = 0x20:0x8069d266
 stack pointer   = 0x28:0xff8094b90390
 frame pointer   = 0x28:0xff8094b903a0
 code segment= base 0x0, limit 0xf, type 0x1b
  = DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags= resume, IOPL = 0
 current process = 72566 (ps)
 trap number = 9
 panic: general protection fault
 cpuid = 0
 KDB: stack backtrace:
 #0 0x8062cf8e at kdb_backtrace+0x5e
 #1 0x805facd3 at panic+0x183
 #2 0x808e6c20 at trap_fatal+0x290
 #3 0x808e715a at trap+0x10a
 #4 0x808cec64 at calltrap+0x8
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 #9 0x8060473f at sysctl_root+0x14f
 #10 0x80604a2a at userland_sysctl+0x14a
 #11 0x80604f1a at __sysctl+0xaa
 #12 0x808e62d4 at amd64_syscall+0x1f4
 #13 0x808cef5c at Xfast_syscall+0xfc
 
 Please look up the line number for the fill_kinfo_thread+0x54.
 
 
 Is there a way for me to do this from the above information? As
 I said in the original message, I failed to get a crash dump after
 reboot (because, it turns out, I hadn't set up my gmirror swap device
 properly). Alas, with the latest panic, it appears to have hung[1]
 during the Dumping phase, so it looks like I won't get a saved crash
 dump this time, either. :-(
 
 Load the kernel.debug into kgdb, and from there do
 list *fill_kinfo_thread+0x54.
 
 
 gromit# kgdb /usr/obj/usr/src/sys/GENERIC/kernel.debug
 GNU gdb 6.1.1 [FreeBSD]
 Copyright 2004 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain conditions.
 Type show copying to see the conditions.
 There is absolutely no warranty for GDB.  Type show warranty for details.
 This GDB was configured as amd64-marcel-freebsd...
 (kgdb) list *fill_kinfo_thread+0x54
 0x805ee034 is in fill_kinfo_thread 
 (/usr/src/sys/kern/kern_proc.c:854).
 849 thread_lock(td);
 850 if (td-td_wmesg != NULL)
 851 strlcpy(kp-ki_wmesg, td-td_wmesg, 
 sizeof(kp-ki_wmesg));
 852 else
 853 bzero(kp-ki_wmesg, sizeof(kp-ki_wmesg));
 854 strlcpy(kp-ki_ocomm, td-td_name, sizeof(kp-ki_ocomm));
 855 if (TD_ON_LOCK(td)) {
 856 kp-ki_kiflag |= KI_LOCKBLOCK;
 857 strlcpy(kp-ki_lockname, td-td_lockname,
 858 sizeof(kp-ki_lockname));
 (kgdb) 
 
 This is indeed strange. It can only occur if td pointer is damaged.
 
 Please, try to get a core and at least print the content of *td in this case.


Alas, I was unable to obtain a crash dump (or even a panic) last night, but I 
have learned more about the circumstances that lead to this panic.

In attempting to find a workaround for this panic (because having nightly 
backups instead of panics is a good thing:) I discovered two successful 
approaches.  In the original situation that causes a reliable panic I have a 
daemonised Tivoli dsmc schedule job running.  This communicates with the 
Tivoli TSM server to determine when it should run its scheduled backup.  When 
the backup is run, there is defined  in a Tivoli client config file (in 
/compat/linux/opt/tivoli/tsm/client/ba/bin/dsm.sys) a preschedulecmd and a 
postschedulecmd, which are /usr/local/bin/make_zfs_backup_snapshot and 
/usr/local/bin/remove_zfs_backup_snapshot respectively.  The preschedulecmd 
script (run prior to the actual backup) basically makes ZFS snapshots of all 
filesets and nullfs-mounts them under /backup.  It then creates 
/compat/linux/etc/mtab to list these nullfs filesystems as ext2 file systems, 
so that the Tivoli backup client can know about them to back them up.  The 
postschedulecmd (run after the actual backup) unmounts all the nullfs-mounted 
filesystems corresponding to the ZFS backup snapshots and then destroys all the 
backup snapshots.

The first workaround that doesn't lead to a panic is this: Do not run dsmc 
schedule.  Instead, via cron, run this simple script:

#!/bin/sh
/usr/local/bin

Re: ZFS + nullfs + Linuxulator = panic?

2012-02-22 Thread Paul Mather

On Feb 17, 2012, at 8:27 PM, Konstantin Belousov wrote:

 On Thu, Feb 16, 2012 at 12:07:46PM -0500, Paul Mather wrote:
 On Feb 16, 2012, at 10:49 AM, Konstantin Belousov wrote:
 
 On Thu, Feb 16, 2012 at 10:09:27AM -0500, Paul Mather wrote:
 On Feb 14, 2012, at 7:47 PM, Konstantin Belousov wrote:
 
 On Tue, Feb 14, 2012 at 09:38:18AM -0500, Paul Mather wrote:
 I have a problem with RELENG_8 (FreeBSD/amd64 running a GENERIC kernel, 
 last built 2012-02-08).  It will panic during the daily periodic scripts 
 that run at 3am.  Here is the most recent panic message:
 
 Fatal trap 9: general protection fault while in kernel mode
 cpuid = 0; apic id = 00
 instruction pointer = 0x20:0x8069d266
 stack pointer   = 0x28:0xff8094b90390
 frame pointer   = 0x28:0xff8094b903a0
 code segment= base 0x0, limit 0xf, type 0x1b
  = DPL 0, pres 1, long 1, def32 0, gran 1
 processor eflags= resume, IOPL = 0
 current process = 72566 (ps)
 trap number = 9
 panic: general protection fault
 cpuid = 0
 KDB: stack backtrace:
 #0 0x8062cf8e at kdb_backtrace+0x5e
 #1 0x805facd3 at panic+0x183
 #2 0x808e6c20 at trap_fatal+0x290
 #3 0x808e715a at trap+0x10a
 #4 0x808cec64 at calltrap+0x8
 #5 0x805ee034 at fill_kinfo_thread+0x54
 #6 0x805eee76 at fill_kinfo_proc+0x586
 #7 0x805f22b8 at sysctl_out_proc+0x48
 #8 0x805f26c8 at sysctl_kern_proc+0x278
 #9 0x8060473f at sysctl_root+0x14f
 #10 0x80604a2a at userland_sysctl+0x14a
 #11 0x80604f1a at __sysctl+0xaa
 #12 0x808e62d4 at amd64_syscall+0x1f4
 #13 0x808cef5c at Xfast_syscall+0xfc
 
 Please look up the line number for the fill_kinfo_thread+0x54.
 
 
 Is there a way for me to do this from the above information? As
 I said in the original message, I failed to get a crash dump after
 reboot (because, it turns out, I hadn't set up my gmirror swap device
 properly). Alas, with the latest panic, it appears to have hung[1]
 during the Dumping phase, so it looks like I won't get a saved crash
 dump this time, either. :-(
 
 Load the kernel.debug into kgdb, and from there do
 list *fill_kinfo_thread+0x54.
 
 
 gromit# kgdb /usr/obj/usr/src/sys/GENERIC/kernel.debug
 GNU gdb 6.1.1 [FreeBSD]
 Copyright 2004 Free Software Foundation, Inc.
 GDB is free software, covered by the GNU General Public License, and you are
 welcome to change it and/or distribute copies of it under certain conditions.
 Type show copying to see the conditions.
 There is absolutely no warranty for GDB.  Type show warranty for details.
 This GDB was configured as amd64-marcel-freebsd...
 (kgdb) list *fill_kinfo_thread+0x54
 0x805ee034 is in fill_kinfo_thread 
 (/usr/src/sys/kern/kern_proc.c:854).
 849 thread_lock(td);
 850 if (td-td_wmesg != NULL)
 851 strlcpy(kp-ki_wmesg, td-td_wmesg, 
 sizeof(kp-ki_wmesg));
 852 else
 853 bzero(kp-ki_wmesg, sizeof(kp-ki_wmesg));
 854 strlcpy(kp-ki_ocomm, td-td_name, sizeof(kp-ki_ocomm));
 855 if (TD_ON_LOCK(td)) {
 856 kp-ki_kiflag |= KI_LOCKBLOCK;
 857 strlcpy(kp-ki_lockname, td-td_lockname,
 858 sizeof(kp-ki_lockname));
 (kgdb) 
 
 This is indeed strange. It can only occur if td pointer is damaged.
 
 Please, try to get a core and at least print the content of *td in this case.



Another panic last night, after reverting dsmc schedule scripts to use 
/bin/sh (actually /compat/linux/bin/sh):

Fatal trap 12: page fault while in kernel mode
cpuid = 1; apic id = 01
fault virtual address   = 0x308
fault code  = supervisor read data, page not present
instruction pointer = 0x20:0x806026ef
stack pointer   = 0x28:0xff8094af02d0
frame pointer   = 0x28:0xff8094af0350
code segment= base 0x0, limit 0xf, type 0x1b
= DPL 0, pres 1, long 1, def32 0, gran 1
processor eflags= interrupt enabled, resume, IOPL = 0
current process = 90872 (df)
trap number = 12
panic: page fault
cpuid = 1
KDB: stack backtrace:
#0 0x8062cf8e at kdb_backtrace+0x5e
#1 0x805facd3 at panic+0x183
#2 0x808e6c20 at trap_fatal+0x290
#3 0x808e6f71 at trap_pfault+0x201
#4 0x808e742f at trap+0x3df
#5 0x808cec64 at calltrap+0x8
#6 0x80602e1e at _sx_xlock+0x4e
#7 0x80f9ca35 at rrw_enter+0xa5
#8 0x80f9ce86 at zfs_statfs+0x46
#9 0x80681258 at __vfs_statfs+0x28
#10 0x81476521 at nullfs_statfs+0x51
#11 0x80681258 at __vfs_statfs+0x28
#12 0x80690b22 at kern_statfs+0x1b2
#13 0x80690c77 at statfs+0x37
#14 0x808e62d4 at amd64_syscall+0x1f4
#15 0x808cef5c at Xfast_syscall+0xfc


Alas, the system became hung here: there is no further

Re: zfs, 1 gig of RAM and periodic weekly

2012-02-28 Thread Paul Mather

On Feb 27, 2012, at 11:10 PM, Eugene M. Zheganin wrote:

 Hi.
 
 On 28.02.2012 01:02, Nenhum_de_Nos wrote:
 regardless of the pool size ?
 
 I was planning on making an atom board a file server for my home, and I have 
 two options: soekris
 net6501 2GB RAM and intel board powered by the 330 atom (says 2GB limited as 
 well). My plans are
 to use from 4 up to 8 disks, and they should be 2TB at least.
 
 As its for home use, some p2p software and mostly music listening and 
 sometimes movie streaming.
 
 should 2GB be that bad, that I should drop it and use UFS instead ?
 
 I may run any version of FreeBSD on it, was planning on 9-STABLE or 9.1.
 
 In the same time I have a couple of hosts successfully running zfs on 768 
 Megs and on 1 Gig of RAM. Both i386.
 And they aren't affected by the periodic weekly for some reason. And they are 
 used only as fileservers.


Basically the same story here: I am using a FreeBSD/i386 system with 768 MB of 
RAM running RELENG_8 with 4 x 1 TB drives arranged as a RAIDZ1 vdev.  It is 
used as a Bacula server, backing up to the ZFS pool (with ZFS compression 
enabled).  It has been rock solid, and I've had no problems with any of the 
periodic jobs.

Here are the ZFS-related tunings I have in /boot/loader.conf:

vm.kmem_size=640M
vm.kmem_size_max=640M
vfs.zfs.arc_max=512M
vfs.zfs.txg.timeout=5


If you are planning on using P2P a lot, I had heard that large files fetched 
via Bittorrent can become very fragmented under ZFS (due to the COW nature of 
ZFS and the way Bittorrent fetches files), especially if the pool is very full, 
and so ZFS might not be the best thing to use if you are also planning on 
streaming these files, especially on modest hardware.  UFS might be preferable 
in these circumstances.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Why Are You NOT Using FreeBSD?

2012-06-02 Thread Paul Mather

On Jun 2, 2012, at 1:31 PM, Chris Rees wrote:

 On Jun 2, 2012 3:19 PM, O. Hartmann ohart...@zedat.fu-berlin.de wrote:
 
 On 06/02/12 14:47, Daniel Kalchev wrote:
 
 
 On 02.06.12 15:32, Erich wrote:
 I know that the ports tree is a moving target. But it stops moving
 during the release period. This could be used to give a fall back
 solution.
 
 Or do I see this really too simple?
 
 The ports tree is a moving target during release periods still, although
 there are efforts to make movements smaller. This is why, after a
 release it suddenly moves more :)
 
 Daniel
 
 Even IF the ports tree IS a moving target, updating of UPDATING, for
 instance, follows most times AFTER the critical ports has been
 changed/updated and folks started updating their ports without realizing
 that they have shot themselfs into the foot!
 
 
 Not reading UPDATING until there are problems is not the fault of the ports
 tree; it should be checked every time you update.
 
 Of course, many of us forget, but that still doesn't make it anyone else's
 problem when we do!


The point he made was actually not a matter of people not reading UPDATING but 
that UPDATING is oftentimes not updated until after the disruptive/potentially 
dangerous change has already hit the ports tree.  So, even though people check 
UPDATING, it won't help them because there will be nothing apropos there until 
maybe days later when someone has decided an UPDATING entry was merited in 
retrospect.

I'm not sure what the solution is for the end user.  I know I get somewhat 
leery of updating my ports if I see a large number of changes coming via 
portsnap (like the 4000+ that accompanied the recent libpng upgrade) and there 
is nothing new in UPDATING (which, happily wasn't the case with the libpng 
upgrade).  Usually, I wait a while for the dust to clear and an UPDATING entry 
potentially to appear.

Maybe the solution is to track the freebsd-ports mailing list get get advanced 
warning of large changes, but that would mean following another high-volume 
list. :-(

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Why Are You NOT Using FreeBSD?

2012-06-03 Thread Paul Mather

On Jun 2, 2012, at 2:56 PM, Chris Nehren wrote:

 On Sat, Jun 02, 2012 at 14:11:06 -0400 , Paul Mather wrote:
 I'm not sure what the solution is for the end user.  I know I get
 somewhat leery of updating my ports if I see a large number of changes
 coming via portsnap (like the 4000+ that accompanied the recent libpng
 upgrade) and there is nothing new in UPDATING (which, happily wasn't
 the case with the libpng upgrade).  Usually, I wait a while for the
 dust to clear and an UPDATING entry potentially to appear.
 
 If you're concerned about things breaking, don't follow the bleeding
 edge. This seems to be common sense.

Unfortunately, unlike the base operating system, which has -CURRENT, -STABLE, 
and -RELEASE, there is no well-defined bleeding edge in the case of ports.  
(Although there is a strong case to be made that it is analogous to -CURRENT.)  
So, as I said above, you have to fall back on heuristics to determine when it 
is best to update (with the caveat that waiting too long to update can also be 
as troublesome as updating too quickly).  Certainly, it's far from a case of 
read UPDATING and you'll be okay, as someone in this thread was seeming to 
imply.

NetBSD's pkgsrc has a nice feature: the quarterly package branches.  These 
follow a quarterly release cycle and receive only security updates.  It makes 
pkgsrc more akin to -CURRENT and -STABLE (or -RELEASE) instead of just -CURRENT.

 Maybe the solution is to track the freebsd-ports mailing list get get
 advanced warning of large changes, but that would mean following
 another high-volume list. :-(
 
 And any decent mailer setup can filter those messages for you, leaving
 only the messages relevant to ports you're interested in. There are also
 systems like gmane which provide an NNTP feed for mailing lists.
 Combined with a newsreader with good killfile / scoring features, it
 shouldn't be hard to keep up.


Probably not, but then again you're still relying on it breaking for someone 
else (and thereby being reported) to avoid it breaking for you. :-)

I'm not saying these are insurmountable problems, and, in my experience, most 
of the time ports updates go smoothly.  But, it can present more of a challenge 
for those that are running an individual FreeBSD system (as their 
desktop/laptop system, say), and especially if they are using non-default port 
options in the ports they install, as these don't get the benefit of widespread 
testing.

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: PF to Preventing SMTP Brute Force Attacks

2012-06-15 Thread Paul Mather

On Jun 15, 2012, at 12:55 PM, Shiv. Nath wrote:

 # START
 table bruteforce persist
 block in log quick from bruteforce
 
 pass in on $ext_if proto tcp \
 from any to $ext_if port $trusted_tcp_ports \
 flags S/SA keep state \
 (max-src-conn-rate 3/300, overload bruteforce flush global)
 
 # END
 
 AND CRON:
 */12 * * * *  /sbin/pfctl -t ssh-bruteforce -T expire 604800 /dev/null
 21
 
 What is the function expire 604800 are they entries in the table?
 should it be -t bruteforce or -t ssh-bruteforce


It refers to entries in the table specified by the -t option and instructs pf 
to expire (remove from the table) all entries older than the specified time (in 
seconds).  Basically, the value 604800 will expire entries older than 1 week.

For the above pf rules, the cron entry should be -t bruteforce (although in 
the pf rules you should be using bruteforce).

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

uhci0 excessive interrupts---how can I disable or reset specific USB port?

2012-08-15 Thread Paul Mather

I am running FreeBSD/amd64 9-STABLE (built Mon Jul 23 10:45:51 EDT 2012) on a 
Dell Optiplex 760 and, today, noticed I had almost 30% system CPU load in top 
even when the system was idle.  A perusal of vmstat revealed the cause to be 
excessive interrupts on uhci0, even though nothing was plugged into that USB 
port:

# vmstat -i
interrupt  total   rate
irq4: uart0   22  0
irq16: uhci0617002282738 310969
irq23: uhci3 ehci183  0
irq256: hpet0:t0   135818421 68
irq257: hpet0:t1  659301   1120
irq264: em0 29529304 14
irq265: ahci0   11132506  5
Total   619401422375 312178


Because I am only using the front USB ports on that hardware, I thought I would 
disable the other (rear) USB ports in the BIOS.  I rebooted and duly disabled 
them.  However, when FreeBSD booted, it appeared to ignore the BIOS setting: 
all the USB ports were probed as usual.  (The high interrupts had vanished, 
though that might have been due to FreeBSD correctly shutting down the 
controllers at shutdown, or just the act of rebooting itself.)

I added hint.uhci.0.disabled=1 to /boot/loader.conf (hoping it would disable 
uhci0), but, again all the USB ports appeared in the boot probes.  (However, it 
*appears* as if uhci0 has been disabled because the irq16: uhci0 line no 
longer appears in vmstat -i.  However, all the same ugen devices appear in 
/dev.)

Is there a way of disabling a specific USB controller that you don't want to 
use?  If so, how?  I looked at usbconfig, but that appears to me to be more 
about controlling devices plugged in to a USB port rather than the port itself. 
 The reset command of usbconfig appears to be about resetting USB devices, 
not ports.

If I can't disable a specific USB port, is there a way to reset it without 
rebooting?  If I ever get another crazy interrupt storm like I noticed today it 
would be nice to be able to stop it without having to do a reboot.

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: virtio for 9.1-R

2012-11-27 Thread Paul Mather

On Nov 27, 2012, at 3:49 PM, Joe Holden li...@rewt.org.uk wrote:

 On 27/11/2012 19:25, Sergey Kandaurov wrote:
 On 27 November 2012 22:12, Joe Holden li...@rewt.org.uk wrote:
 Hi guys,
 
 I can't see virtio in releng/9.1, is there any particular reason why it
 isn't going to be included given that it works reasonable well (and is
 optional anyway, so not likely to be detrimental)?
 
 virtio appeared in stable/9 a bit after 9.1 cut off,
 and it is too late now regardless of virtio shape.
 Anyway you can installed it from ports.
 
 Ah I see, doesn't really help all the people who can't install it in KVM and 
 such though unfortunately, seems silly making them wait even
 longer and having to use Linux :)


FWIW, I installed FreeBSD 9-STABLE (pre-virtio in src) in KVM, initially using 
the emulated devices and then, post-install, installed the 
emulators/virtio-kmod port and switched over to vtbd/vtnet devices without 
problem.  When virtio appeared in src, I ditched the virtio-kmod port, again, 
without issues.

I found making the transition to virtio devices no harder than the ad - ada 
device transition.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Upgrade of RELENG_8 ZFS boot pool leads to unbootable system

2013-01-02 Thread Paul Mather

Yesterday, I updated my RELENG_8 ZFS-only system that has worked like a champ 
for ages.  After a successful install{kernel,world} and reboot, I noticed the 
20121130 entry in /usr/src/UPDATING and upgraded my ZFS pool via zfs upgrade 
-a.  I also upgraded my boot blocks as requested, and as per the ZFS notes 
section of /usr/src/UPDATING.

Unfortunately rebooting with the upgraded pool failed.  The windmill boot 
spinner spins for a tiny amount of time and then stops dead. :-(  I don't get 
to the boot loader menu at all.

I downloaded a very recent RELENG_8 snapshot 
(FreeBSD-8.3-RELENG_8-r244923-JPSNAP-amd64-amd64-memstick.img) from 
pub.allbsd.org and was able to boot successfully from USB using that.  I 
entered Fixit Mode and tried to write the boot blocks on the memstick image 
onto my hard drives but the resultant system still wouldn't boot.  The commands 
I used (from Fixit Mode) are these:

gpart bootcode -b /dist/boot/pmbr -p /dist/boot/gptzfsboot -i 1 ad4
gpart bootcode -b /dist/boot/pmbr -p /dist/boot/gptzfsboot -i 1 ad6

(ad4 and ad6 are my two hard drives.)

If I load zfs before booting the USB memstick then I can see my old pool 
listed when I do zfs import.  I haven't tried importing the pool because I'm 
not sure if that would make the problem worse.

Does anyone have any advice in restoring this system to bootability?  I 
followed the standard root on ZFS recipe using a two drive mirror when 
installing the system initially.  Each drive uses GPT with three partitions: 
freebsd-boot, freebsd-swap, and freebsd-zfs in that order.  Like I said at the 
start, all this worked for a long time until just now when I upgraded the pool 
to enable feature flags support. :-(

Any help is appreciated.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Upgrade of RELENG_8 ZFS boot pool leads to unbootable system

2013-01-02 Thread Paul Mather

On Jan 2, 2013, at 2:10 PM, Matthew Seaman m.sea...@infracaninophile.co.uk 
wrote:

 On 02/01/2013 17:49, Paul Mather wrote:
 Yesterday, I updated my RELENG_8 ZFS-only system that has worked like a 
 champ for ages.  After a successful install{kernel,world} and reboot, I 
 noticed the 20121130 entry in /usr/src/UPDATING and upgraded my ZFS pool via 
 zfs upgrade -a.  I also upgraded my boot blocks as requested, and as per 
 the ZFS notes section of /usr/src/UPDATING.
 
 Unfortunately rebooting with the upgraded pool failed.  The windmill boot 
 spinner spins for a tiny amount of time and then stops dead. :-(  I don't 
 get to the boot loader menu at all.
 
 I downloaded a very recent RELENG_8 snapshot 
 (FreeBSD-8.3-RELENG_8-r244923-JPSNAP-amd64-amd64-memstick.img) from 
 pub.allbsd.org and was able to boot successfully from USB using that.  I 
 entered Fixit Mode and tried to write the boot blocks on the memstick image 
 onto my hard drives but the resultant system still wouldn't boot.  The 
 commands I used (from Fixit Mode) are these:
 
  gpart bootcode -b /dist/boot/pmbr -p /dist/boot/gptzfsboot -i 1 ad4
  gpart bootcode -b /dist/boot/pmbr -p /dist/boot/gptzfsboot -i 1 ad6
 
 (ad4 and ad6 are my two hard drives.)
 
 If I load zfs before booting the USB memstick then I can see my old pool 
 listed when I do zfs import.  I haven't tried importing the pool because 
 I'm not sure if that would make the problem worse.
 
 Does anyone have any advice in restoring this system to bootability?  I 
 followed the standard root on ZFS recipe using a two drive mirror when 
 installing the system initially.  Each drive uses GPT with three partitions: 
 freebsd-boot, freebsd-swap, and freebsd-zfs in that order.  Like I said at 
 the start, all this worked for a long time until just now when I upgraded 
 the pool to enable feature flags support. :-(
 
 Any help is appreciated.
 
 I think you may be running into problems with zpool.cache.  This has
 been fixed in current, which now has the ability to find the root zpool
 without a valid zpool.cache, but that I suspect is faint comfort for you.
 
 To recover from a toasted zpool.cache, you need to boot from alternate
 media and then import your root zpool.  It's easiest to do that to a
 temporary directory.  The important bit is to copy the zpool.cache onto
 your actual zroot device:
 
 -- Boot from install media to 'Live CD' and log in as root (no password)

Given the above, does this need to be a -CURRENT Live CD?  I've been using the 
RELENG_8 snapshot memstick.img mentioned in my original message above.


 
 # kldload zfs   -- should load opensolaris.ko automatically
 # cd /tmp   -- this should be a writable MFS; you'll
   need to arrange something similar if
   not.
 # zpool import -o cachefile=/tmp/zpool.cache -R /tmp/zroot zroot
-- this should create a zpool.cache file


I tried this and it complained about the pool being in use by another 
system---the original system that won't boot any more.  I expected this, and 
added -f to force an import.

 # cp zpool.cache /tmp/zroot/boot/zfs/


This part also failed for me.  My zroot fileset has a mountpoint property set 
to legacy.  I had to mount this manually, via mount -t zfs zroot /tmp/zroot 
to make the /tmp/zroot/boot/zfs directory accessible.


 # zfs umount -a
 # shutdown -r
 
 Eject the install media, and the system should boot up from your root spool.


Unfortunately, it didn't boot from the root pool.  I get the same thing 
happening: the windmill spins for a very short time and then stops dead.  I 
don't even make it to the BTX Loader output, let alone the boot loader menu 
options. :-(

Thank you for the suggestions.

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Solved?: Re: Upgrade of RELENG_8 ZFS boot pool leads to unbootable system

2013-01-03 Thread Paul Mather

On Jan 2, 2013, at 2:10 PM, Matthew Seaman m.sea...@infracaninophile.co.uk 
wrote:

 On 02/01/2013 17:49, Paul Mather wrote:
 Yesterday, I updated my RELENG_8 ZFS-only system that has worked like a 
 champ for ages.  After a successful install{kernel,world} and reboot, I 
 noticed the 20121130 entry in /usr/src/UPDATING and upgraded my ZFS pool via 
 zfs upgrade -a.  I also upgraded my boot blocks as requested, and as per 
 the ZFS notes section of /usr/src/UPDATING.
 
 Unfortunately rebooting with the upgraded pool failed.  The windmill boot 
 spinner spins for a tiny amount of time and then stops dead. :-(  I don't 
 get to the boot loader menu at all.
 
 I downloaded a very recent RELENG_8 snapshot 
 (FreeBSD-8.3-RELENG_8-r244923-JPSNAP-amd64-amd64-memstick.img) from 
 pub.allbsd.org and was able to boot successfully from USB using that.  I 
 entered Fixit Mode and tried to write the boot blocks on the memstick image 
 onto my hard drives but the resultant system still wouldn't boot.  The 
 commands I used (from Fixit Mode) are these:
 
  gpart bootcode -b /dist/boot/pmbr -p /dist/boot/gptzfsboot -i 1 ad4
  gpart bootcode -b /dist/boot/pmbr -p /dist/boot/gptzfsboot -i 1 ad6
 
 (ad4 and ad6 are my two hard drives.)
 
 If I load zfs before booting the USB memstick then I can see my old pool 
 listed when I do zfs import.  I haven't tried importing the pool because 
 I'm not sure if that would make the problem worse.
 
 Does anyone have any advice in restoring this system to bootability?  I 
 followed the standard root on ZFS recipe using a two drive mirror when 
 installing the system initially.  Each drive uses GPT with three partitions: 
 freebsd-boot, freebsd-swap, and freebsd-zfs in that order.  Like I said at 
 the start, all this worked for a long time until just now when I upgraded 
 the pool to enable feature flags support. :-(
 
 Any help is appreciated.
 
 I think you may be running into problems with zpool.cache.  This has
 been fixed in current, which now has the ability to find the root zpool
 without a valid zpool.cache, but that I suspect is faint comfort for you.


It turns out it was my /boot.config that was preventing booting.  The system is 
usually always headless, so I have -S115200 -Dh as the sole line in 
/boot.config to enable a 115200 baud serial console.  This has been working 
fine for me up until I did a {build,install} {kernel,world} on 1st January 
2013.  I was pretty sure my woes began after I did the zpool upgrade -a and 
subsequently rebooted again, but now I can't be sure whether I successfully 
rebooted at all after the make installworld and mergemaster step.

Does anyone know a sure-fire way of getting a dual console setup (high-speed 
serial + VGA).  The recipe I had been using had worked well for a long time.  I 
had -S115200 -Dh in /boot.config and the following entries in 
/boot/loader.conf:

boot_multicons=YES
comconsole_speed=115200
console=comconsole,vidconsole

Now, though, if I have -S115200 -Dh then the system locks up at boot.  
Removing /boot.config gets me dual console, but only at 9600 baud. :-(

Cheers,

Paul.

PS: Is the BOOT_COMCONSOLE_SPEED entry in /etc/make.conf needed?  I was under 
the impression it has been obsolete for a while and took it out of my 
/etc/make.conf file.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Solved?: Re: Upgrade of RELENG_8 ZFS boot pool leads to unbootable system

2013-01-04 Thread Paul Mather

On Jan 4, 2013, at 5:39 AM, Vincent Hoffman vi...@unsane.co.uk wrote:

 On 03/01/2013 21:18, Paul Mather wrote:
 
 
 It turns out it was my /boot.config that was preventing booting.  The system 
 is usually always headless, so I have -S115200 -Dh as the sole line in 
 /boot.config to enable a 115200 baud serial console.  This has been working 
 fine for me up until I did a {build,install} {kernel,world} on 1st January 
 2013.  I was pretty sure my woes began after I did the zpool upgrade -a 
 and subsequently rebooted again, but now I can't be sure whether I 
 successfully rebooted at all after the make installworld and mergemaster 
 step.
 
 Does anyone know a sure-fire way of getting a dual console setup (high-speed 
 serial + VGA).  The recipe I had been using had worked well for a long time. 
  I had -S115200 -Dh in /boot.config and the following entries in 
 /boot/loader.conf:
 
  boot_multicons=YES
  comconsole_speed=115200
  console=comconsole,vidconsole
 
 Now, though, if I have -S115200 -Dh then the system locks up at boot.  
 Removing /boot.config gets me dual console, but only at 9600 baud. :-(
 I have
 root@copia:/root # more /boot.config
 -Dh
 
 and
 root@copia:/root # grep 'cons' /boot/loader.conf
 console=comconsole,vidconsole
 comconsole_speed=19200
 boot_multicons=yes
 
 Which works fine for ipmi based serial over lan console in a generic
 9.1-RELEASE.
 Not sure thats that helpful but 19200 is better than 9600 ;)


That's weird.  I have the same basic /boot/loader.conf entries (except for a 
speed of 115200) and even just putting -Dh into /boot.config leads to the 
same unbootable system behaviour. :-(

Maybe something broke recently in the RELENG_8 boot loader?  Like I said 
originally, that /boot.config entry had been working for me without problems 
for a long time.

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: pkgng and updated packages

2013-01-28 Thread Paul Mather

On Jan 28, 2013, at 1:59 PM, Rainer Duffner rai...@ultra-secure.de wrote:

   
 Am 28.01.2013 um 18:31 schrieb Glen Barber g...@freebsd.org:
 
 On Mon, Jan 28, 2013 at 06:28:02PM +0100, Rainer Duffner wrote:
 I go from PERL 5.10 to PERL 5.16, for example and it complains that
 perl5.16 conflicts with perl5.10...
 
 This I needed, too:
 
 pkg set -o long/perl5.10:lang/perl5.16
 pkg remove perl 
 pkg set -o devel/pkg-config:devel/pkgconf
 pkg remove -f pkg-config
 
 
 Hmm, you should not have needed to remove perl or pkg-config.  They
 should have been upgraded as any other package.
 
 
 
 I tried it without and it said it conflicted. It wanted to install perl5.16, 
 without doing anything to 5.10.


The lang/perl5.10 and lang/perl5.16 ports are separate ports that are marked in 
their respective Makefiles as conflicting (because they install files into 
common places).  Perl 5.16 is not an upgrade of Perl 5.10 in the standard 
ports sense.  That is why both the Makefile and pkgng will complain if you try 
and install both (e.g., installing Perl 5.16 when Perl 5.10 is still installed).

If you want to switch to lang/perl5.16 from lang/perl5.10 you could follow a 
procedure like that outlined in the 20120630 entry of /usr/ports/UPDATING.

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: some issues with /usr/sbin/service

2013-02-16 Thread Paul Mather

On Feb 16, 2013, at 4:21 AM, Jeremy Chadwick wrote:

 On Sat, Feb 16, 2013 at 12:23:33PM +0400, Boris Samorodov wrote:
 16.02.2013 01:32, Jeremy Chadwick ??:
 
 Follow up -- I read Alfred's most recent mail.  Lo and behold, I find
 this in /var/log/messages (but such did not come to my terminal):
 
 Feb 15 13:26:20 icarus jdc: /usr/sbin/service: WARNING: $svnserve_enable is 
 not set properly - see rc.conf(5).
 Feb 15 13:26:20 icarus jdc: /usr/sbin/service: WARNING: $smartd_enable is 
 not set properly - see rc.conf(5).
 Feb 15 13:26:20 icarus jdc: /usr/sbin/service: WARNING: $rsyncd_enable is 
 not set properly - see rc.conf(5).
 Feb 15 13:26:20 icarus jdc: /usr/sbin/service: WARNING: 
 $htcacheclean_enable is not set properly - see rc.conf(5).
 Feb 15 13:26:20 icarus jdc: /usr/sbin/service: WARNING: $fetchmail_enable 
 is not set properly - see rc.conf(5).
 
 Cute.  Agreed -- this is unacceptable on two levels (as I see it):
 
 1) These messages should be going to stdout or stderr in some way, so
 honestly logger(8) should be called with the -s flag (IMO).
 
 Fully agreed here.
 
 It turns out logger -s has no effect, just like how the echo 12
 statements in warn() and err() have no effect either (these should be
 outputting the warnings in question to stderr) -- see rc.subr's source
 for what I'm referring to.
 
 Gary and I have been discussing this off-list and the reason has been
 found: service(8) has this code in it:
 
   checkyesno $rcvar 2/dev/null  echo $file
 
 This explains why there's no warn() or err() output on the terminal --
 it's being redirected to /dev/null prior.
 
 I do not know who maintains the rc(8) and rc.subr(8) framework, but
 they've got their work cut out for them.
 
 (Note: the echo statements in warn() and err() could be replaced with
 logger -s as I said; this would allow the echo 12 to be removed)
 
 2) These messages should not be displayed at all (i.e. lack of an
 xxx_enable variable should imply xxx_enable=no).
 
 I see this message as one more level of supervision.
 
 If undefined at /etc/make.conf the value of xxx_enable is no from the
 system's POV (i.e. the service is not strarted). From the
 admininstrators's POV the port was installed BUT is not used. It's up
 to admininstrator whether it's OK or not -- just let him remind.
 
 I believe the point you're trying to make is that the warning in
 question should 'act as a reminder to the administrator that they need
 to set xxx_enable=yes in rc.conf'.
 
 If not: please explain if you could what you mean, because I don't
 understand.
 
 If so: I strongly disagree with this method of approach, as what you've
 proposed is a borderline straw man argument.
 
 Reminding the admin to set xxx_enable is presently done inside most
 ports' pkg-message.  IMO, this should really be done inside bsd.port.mk
 when USE_RC_SUBR is used, emitting a message during install that says
 something like:
 
 To enable the xxx service, please add the following to /etc/rc.conf:
 xxx_enable=yes
 
 Of course, I don't know if this would work for packages.
 
 The current message for missing xxx_enable in rc.conf is this:
 
 WARNING: $xxx_enable is not set properly - see rc.conf(5).
 
 The message is entirely misleading for this specific situation; it isn't
 reminding an administrator -- if anything it's confusing them (thread
 is case in point).  If we're going to cater to ignorance, then the
 message should reflect the situation.
 
 Thus IMO, this is what ***should*** happen:
 
 Definition in rc.confBehaviour/result
 ---  ---
 myprog_enable=yes  emit no warnings, service should run
 myprog_enable=no   emit no warnings, service should not run
 myprog_enable=abc123   emit a warning,   service should not run
 no definition  emit no warnings, service should not run


I think case 4 (no definition) is a case where a warning should be emitted 
because it is arguably not immediately apparent what will actually happen if no 
definition is present.  In the case of services in the base OS it is 
well-defined: every service should have an explicit default in 
/etc/defaults/rc.conf that you can easily consult to know definitively what 
will happen with that service.  (If it doesn't, that is a bug, IMHO.)

For ports, the case is not so clear.  There is a general trend for the port 
rc.d script to default its respective xxx_enable explicitly to NO.  But it is 
not a universal rule that no definition = default to NO.  The net/avahi-app 
port, for example, doesn't default to NO if xxx_enable is not set: it 
defaults to whatever the gnome_enable setting is defined to be.

For the base OS, I agree with your case 4; for the land of ports, I don't think 
the answer is so cut and dried.  That is why I think the warning is useful: it 
reminds admins to look at the service in question and make a firm decision of 
their own as to whether it should explicitly be turned on or off.  It

Re: ask about stable

2013-02-20 Thread Paul Mather

On Feb 20, 2013, at 4:29 AM, Fleuriot Damien m...@my.gd wrote:

 Here, I think this is what you're looking for:
 
 http://www.freebsd.org/doc/handbook/updating-upgrading-freebsdupdate.html
 
 
 You should be able to move from 9.1-RELEASE to 9-STABLE with freebsd-update :)


Except that the freebsd-update(8) man page includes the following:

DESCRIPTION

 The freebsd-update tool is used to fetch, install, and rollback binary
 updates to the FreeBSD base system.  Note that updates are only available
 if they are being built for the FreeBSD release and architecture being
 used; in particular, the FreeBSD Security Team only builds updates for
 releases shipped in binary form by the FreeBSD Release Engineering Team,
 e.g., FreeBSD 7.3-RELEASE and FreeBSD 8.0-RELEASE, but not FreeBSD
 6.3-STABLE or FreeBSD 9.0-CURRENT.


In other words, you can't use it to track -STABLE or -CURRENT.

Cheers,

Paul.


 
 
 
 On Feb 20, 2013, at 9:57 AM, Nurhermansyah eka ekanoti...@gmail.com wrote:
 
 hi,
 oh ya .. true .. I'm running 9.1-RELEASE and I want to update to 9-STABLE
 
 it's how to update to 9-STABLE?
 
 thank you
 
 sorry if my english is bad, i from indonesia.. 
 
 
 2013/2/20 Damien Fleuriot m...@my.gd
 
 On 20 Feb 2013, at 06:03, Nurhermansyah eka ekanoti...@gmail.com wrote:
 
 hi all..
 
 if I could make a stable version for freebsd 9.1-release ??
 
 thanks
 
 
 Hi,
 
 Did you mean I'm running 9.1-RELEASE and I want to update it to 9-STABLE ?
 
 If not, kindly clarify...
 
 
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org
 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: IPMI serial console

2013-02-22 Thread Paul Mather

On Feb 21, 2013, at 11:29 PM, Jeremy Chadwick j...@koitsu.org wrote:

 On Fri, Feb 22, 2013 at 02:22:52PM +1030, Daniel O'Connor wrote:
 On 22/02/2013, at 12:02, Jeremy Chadwick j...@koitsu.org wrote:
 Hmm I tried putting '-S 115200' in /boot.config and it broke - the boot 
 process didn't run the loader (or kernel).
 
 I'll talk a bit about this -- again, sorry for the verbosity.  I'll
 explain what I've historically used/done, then speculate a bit about
 your IPMI stuff:
 
 For me, on systems without IPMI, all I had to do was this (and nothing
 else):
 
 * Put the following in /boot.config:
 
 -S115200 -Dh
 
 This breaks the boot for me, boot.config has to contain more than just
 flags it seems. In any case I believe setting boot_multicons and
 boot_serial is the same as -Dh. Not sure about the baud rate though.
 
 Then someone broke something (parser or something else).  This has
 always, *always* worked (just flags).  The last time I verified it was
 with the release of 9.0-RELEASE.  I do have a system I could test this
 on, but I'd need to find a null modem cable first.
 
 I have seen some MFCs that touch those bits in the bootloader, but from
 my memory it didn't touch anything other than supporting /boot/config as
 an alternate location to the classic /boot.config file.  I would be very
 surprised if this broke it.
 
 I can assure you that those were the only flags that were needed, and in
 exactly that syntax.  Even the Handbook has this in it, as well as
 boot(8).
 
 I believe your explanation of boot_multicons and boot_serial are correct
 and do correlate with -D and -h.  I could look at the bootstrap code to
 verify.  The options are described in loader(8) but not loader.conf(5).


I think something did break back at the start of the year that caused 
/boot.config contents to render the system completely unbootable.  At least 
that is what happened to me on RELENG_8: 
http://lists.freebsd.org/pipermail/freebsd-stable/2013-January/071579.html

Bruce A. Mah reports later in the same thread that his happened to him on 
8.3-RELEASE.  I don't know if this was fixed.

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: IPMI serial console

2013-02-22 Thread Paul Mather

On Feb 21, 2013, at 7:14 PM, Daniel O'Connor docon...@gsoft.com.au wrote:

 
 On 22/02/2013, at 10:40, Jeremy Chadwick j...@koitsu.org wrote:
 X9SIL-F BIOS version 1.1 (05/27/10)
 IPMI firmware is 2.01.
 
 I can't find this motherboard listed on Supermicro's site.
 
 kenv | grep smbios output please?
 
 
 Sorry, brainfart, it's an X8SIL-F 
 http://www.supermicro.com/xeon_3400/Motherboard/X8SIL.cfm?IPMI=Y


I am also using that motherboard and currently IPMI is working for me under 
9.1-STABLE (r246366).  My system has BIOS version 1.2 and IPMI firmware version 
2.66.  I have both the VGA KVM console and SOL console working (though the VGA 
console seems sensitive to the version of Java you have installed). I am using 
sysutils/ipmitool to access the system via IPMI SOL console.

Here is what my IPMI SOL setup looks like in the BIOS setup screen:

  Advanced  

* Advanced - Remote Access Configuration  * Select Remote Access   *
* *** * type.  *
* Remote Access  [Enabled]**
* **
* Serial Port Number [COM3 *] **
*  Base Address, IRQ [3E8h, 5]**
* Serial Port Mode   [115200 8,n,1]   **
* Flow Control   [None]   **
* Redirection After BIOS POST[Always] **
* Terminal Type  [ANSI]   **
* VT-UTF8 Combo Key Support  [Enabled]**
* Sredir Memory Display Delay[No Delay]   **
* *   :Move*
* *   Enter:Select *
* *   +/-/:Value   *
* *   F10:Save *
* *   ESC:Exit *
* *   F1:General Help  *
* *   F8:Fail-Safe Defaults*
* *   F9:Optimized Defaults*



Here is what I have in /boot/loader.conf regarding a serial console:

=
# Enable IPMI serial console
#console=comconsole
# Give preference to VGA console
console=vidconsole,comconsole
# Uncomment below and comment above to give serial console preference
#console=comconsole,vidconsole
comconsole_speed=115200
boot_multicons=YES
hint.uart.0.flags=0x0
hint.uart.2.at=isa
hint.uart.2.port=0x3E8
#hint.uart.2.disabled=0
hint.uart.2.flags=0x30
=

I don't know whether all of that is needed, but, as you can see from the 
various commented-out lines, it was a configuration I arrived at that works. 
:-)

I don't have a /boot.config on this system.

Usually, I have the VGA console take precedence.  In that case, I get messages 
on both the VGA console and the IPMI SOL console during the BIOS screen, 
loader, and kernel boot messages but then only messages on the VGA console 
during the rc.d boot phase.  I have a serial console enabled in /etc/ttys, so I 
get output (e.g., getty login) on the IPMI SOL when that is eventually spawned. 
 If I want to have the serial console take precedence, I usually escape to the 
loader prompt (via ESC at the loader menu) and issue a set 
console=comconsole,vidconsole command.  Then, the rc.d boot output goes to 
the serial console and not the VGA console.  (It would be nice to have rc.d 
init scripts output go to both consoles, but I don't know whether that is 
possible.)

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Virtio and GEOM labels

2013-03-22 Thread Paul Mather

I'm running FreeBSD 9-STABLE as a guest under RHEL 6.4 KVM virtualisation.  I 
have networking and storage in the FreeBSD guest using the Virtio drivers (with 
the virtual disk set to Virtio in the definition on the host).  Everything is 
working nicely: I have a vtnet network adapter and see vtbd devices for my 
virtual disks in FreeBSD.  Performance is much better compared with an emulated 
IDE device.

The odd thing is that I don't see GEOM labels reflected in /dev.  For example, 
I have GPT labels defined in the guest, but I don't see them show up under 
/dev/gpt.  Similarly, my UFS labels don't show up under /dev/ufs.  I *do* see a 
/dev/gptid.  That appears to be the only label that shows up.

Is there something special I need to do to get GPT and UFS labels to appear 
when using Virtio?  It seems to me that Virtio block devices appear to be 
somewhat unusual.  Unlike regular ATA and SCSI devices, my vtbd devices don't 
appear in the boot dmesg (although a vtblk device does), and camcontrol 
devlist does not list them.  It's not clear to me how I am supposed to 
interact with them other than via basic device I/O through /dev/vtbdX.  I 
thought that the virtio_scsi module might make them appear as da devices and 
able to interacted with via camcontrol, but this doesn't seem to be the case.

I've included my system dmesg at the end of this message.

Cheers,

Paul.

=
Copyright (c) 1992-2013 The FreeBSD Project.
Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994
The Regents of the University of California. All rights reserved.
FreeBSD is a registered trademark of The FreeBSD Foundation.
FreeBSD 9.1-STABLE #0 r248465: Mon Mar 18 11:19:23 EDT 2013
pmat...@freebsd9.virtual.lib.vt.edu:/usr/obj/usr/src/sys/KVMTEST amd64
FreeBSD clang version 3.2 (tags/RELEASE_32/final 170710) 20121221
CPU: QEMU Virtual CPU version (cpu64-rhel6) (2100.15-MHz K8-class CPU)
  Origin = AuthenticAMD  Id = 0x6d3  Family = 0x6  Model = 0xd  Stepping = 3
  
Features=0x783f3ffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE,SSE2
  Features2=0x80002001SSE3,CX16,HV
  AMD Features=0x22500800SYSCALL,NX,MMX+,FFXSR,LM
  AMD Features2=0x73LAHF,CMP,CR8,ABM,SSE4A
real memory  = 8589934592 (8192 MB)
avail memory = 8255410176 (7872 MB)
Event timer LAPIC quality 400
ACPI APIC Table: BOCHS  BXPCAPIC
FreeBSD/SMP: Multiprocessor System Detected: 2 CPUs
FreeBSD/SMP: 2 package(s) x 1 core(s)
 cpu0 (BSP): APIC ID:  0
 cpu1 (AP): APIC ID:  1
ioapic0 Version 1.1 irqs 0-23 on motherboard
kbd1 at kbdmux0
acpi0: BOCHS BXPCRSDT on motherboard
acpi0: Power Button (fixed)
cpu0: ACPI CPU on acpi0
cpu1: ACPI CPU on acpi0
atrtc0: AT realtime clock port 0x70-0x71,0x72-0x77 irq 8 on acpi0
Event timer RTC frequency 32768 Hz quality 0
Timecounter ACPI-fast frequency 3579545 Hz quality 900
acpi_timer0: 24-bit timer at 3.579545MHz port 0xb008-0xb00b on acpi0
pcib0: ACPI Host-PCI bridge port 0xcf8-0xcff on acpi0
pci0: ACPI PCI bus on pcib0
pci_link4: Unable to route IRQs: AE_NOT_FOUND
isab0: PCI-ISA bridge at device 1.0 on pci0
isa0: ISA bus on isab0
atapci0: Intel PIIX3 WDMA2 controller port 
0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xc000-0xc00f at device 1.1 on pci0
ata0: ATA channel at channel 0 on atapci0
ata1: ATA channel at channel 1 on atapci0
uhci0: Intel 82371SB (PIIX3) USB controller port 0xc020-0xc03f irq 11 at 
device 1.2 on pci0
usbus0: controller did not stop
usbus0 on uhci0
pci0: bridge at device 1.3 (no driver attached)
vgapci0: VGA-compatible display mem 
0xf000-0xf1ff,0xf200-0xf2000fff at device 2.0 on pci0
virtio_pci0: VirtIO PCI Network adapter port 0xc040-0xc05f mem 
0xf202-0xf2020fff irq 11 at device 3.0 on pci0
vtnet0: VirtIO Networking Adapter on virtio_pci0
virtio_pci0: host features: 0x711fffe3 
EventIdx,RingIndirect,NotifyOnEmpty,RxModeExtra,VLanFilter,RxMode,ControlVq,Status,MrgRxBuf,TxUFO,TxTSOECN,TxTSOv6,TxTSOv4,RxUFO,RxECN,RxTSOv6,RxTSOv4,TxAllGSO,MacAddress,RxChecksum,TxChecksum
virtio_pci0: negotiated features: 0x110fbba3 
RingIndirect,NotifyOnEmpty,VLanFilter,RxMode,ControlVq,Status,MrgRxBuf,TxTSOECN,TxTSOv6,TxTSOv4,RxECN,RxTSOv6,RxTSOv4,MacAddress,RxChecksum,TxChecksum
vtnet0: Ethernet address: 52:54:00:3e:48:94
virtio_pci1: VirtIO PCI Balloon adapter port 0xc060-0xc07f irq 10 at device 
5.0 on pci0
vtballoon0: VirtIO Balloon Adapter on virtio_pci1
virtio_pci1: host features: 0x7102 
EventIdx,RingIndirect,NotifyOnEmpty,StatsVq
virtio_pci1: negotiated features: 0x0
virtio_pci2: VirtIO PCI Block adapter port 0xc080-0xc0bf mem 
0xf204-0xf2040fff irq 10 at device 6.0 on pci0
vtblk0: VirtIO Block Adapter on virtio_pci2
virtio_pci2: host features: 0x71000654 
EventIdx,RingIndirect,NotifyOnEmpty,Topology,FlushCmd,BlockSize,DiskGeometry,MaxNumSegs
virtio_pci2: negotiated features: 0x1254 
RingIndirect,FlushCmd,BlockSize,DiskGeometry,MaxNumSegs
vtblk0: 8192MB (16777216 512 byte sectors)
atkbdc0: Keyboard controller (i8042) port 0x60,0x64 irq 1 on

Re: Virtio and GEOM labels

2013-03-25 Thread Paul Mather

On Mar 25, 2013, at 1:46 PM, John Nielsen li...@jnielsen.net wrote:

 On Mar 22, 2013, at 8:14 AM, Paul Mather p...@gromit.dlib.vt.edu wrote:
 
 I'm running FreeBSD 9-STABLE as a guest under RHEL 6.4 KVM virtualisation.  
 I have networking and storage in the FreeBSD guest using the Virtio drivers 
 (with the virtual disk set to Virtio in the definition on the host).  
 Everything is working nicely: I have a vtnet network adapter and see vtbd 
 devices for my virtual disks in FreeBSD.  Performance is much better 
 compared with an emulated IDE device.
 
 I've had the same experience.
 
 The odd thing is that I don't see GEOM labels reflected in /dev.  For 
 example, I have GPT labels defined in the guest, but I don't see them show 
 up under /dev/gpt.  Similarly, my UFS labels don't show up under /dev/ufs.  
 I *do* see a /dev/gptid.  That appears to be the only label that shows up.
 
 I have not encountered this issue. I use virtio block devices and GPT labels 
 exclusively in multiple FreeBSD 9.1 guests and all mount/function without 
 issue. How are you referring to your filesystems in /etc/fstab? IIRC GEOM 
 makes not-in-use labels disappear when a device is in use (e.g. mounted). If 
 you take a new device, put a labeled GPT partition on it and a labeled UFS 
 partition on that but don't mount anything, what happens?


Thanks for the reply.

My apologies: this is a case of pilot error on my part.  I was mounting the 
devices as /dev/vtbd...  I hadn't realised that the present-but-unused labels 
were being suppressed when the device was mounted.  Has this always been the 
case?  For some reason I had a distinct recollection stuck in my mind that all 
labels showed up in /dev.

Anyway, many thanks for pointing the way to the solution.  I now have GPT- and 
UFS-labelled devices mounted in my FreeBSD guest system.


 
 Is there something special I need to do to get GPT and UFS labels to appear 
 when using Virtio?  It seems to me that Virtio block devices appear to be 
 somewhat unusual.  Unlike regular ATA and SCSI devices, my vtbd devices 
 don't appear in the boot dmesg (although a vtblk device does), and 
 camcontrol devlist does not list them.  It's not clear to me how I am 
 supposed to interact with them other than via basic device I/O through 
 /dev/vtbdX.  I thought that the virtio_scsi module might make them appear as 
 da devices and able to interacted with via camcontrol, but this doesn't 
 seem to be the case.
 
 Virtio block devices and virtio SCSI devices are not the same. If you want to 
 use the virtio_scsi module in FreeBSD you should expose a virtio SCSI device 
 from the host.


Thank you for the explanation.  I've been using virt-manager up to now for 
setting up KVM guests and it seems that virtio-scsi isn't exposed through that 
interface---only through the command line.  I'll have to investigate...

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: ZFS Panic after freebsd-update

2013-07-01 Thread Paul Mather

On Jul 1, 2013, at 11:49 AM, Jeremy Chadwick j...@koitsu.org wrote:

 On Mon, Jul 01, 2013 at 11:35:30AM -0400, Scott Sipe wrote:
 *** Sorry for partial first message! (gmail sent after multiple returns
 apparently?) ***
 
 Hello,
 
 I have not had much time to research this problem yet, so please let me
 know what further information I might be able to provide.
 [[...]]
 Any thoughts?
 
 Thoughts:
 
 [[..]]
 Of course when I see lines like this:
 
  Trying to mount root from zfs:zroot
 
  ...this greatly diminishes any chances of live debugging on the
  system.  It amazes me how often I see this come up on the lists -- people
  who have ZFS problems but use ZFS for their root/var/tmp/usr.  I wish
  that behaviour would stop, as it makes debugging ZFS a serious PITA.
  This comes up on the list almost constantly, sad panda.


I'm not sure why it amazes you that people are making widespread use of ZFS.  
You could make the same argument that people shouldn't use UFS2 journaling on 
their file systems because bugs in the implementation might make debugging 
journaled UFS2 file systems a serious PITA.  The point is that there are VERY 
compelling reasons why people might want to use ZFS for root/var/tmp/usr/etc. 
(pooled storage; easy snapshots; etc.) and there should come a time when a 
given file system is generally regarded as safe.  I'd say the time for ZFS 
came when they removed the big disclaimer from the boot messages.  If ZFS is 
dangerous, they should reinstate the not ready for production warning.  Until 
they do, I think it's unfair to castigate people for using ZFS universally.

Isn't it a recurring theme on freebsd-current and freebsd-stable that more 
people need to use features so they can be debugged in realistic environments?  
If you're telling them, don't use that because it makes debugging harder, how 
are they supposed to get debugged and hence improved? :-)

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Enabling pf in 9-STABLE guest on KVM triggers abrt crash report

2013-08-07 Thread Paul Mather

I have been using 9-STABLE as a guest under KVM on RHEL 6 for several months 
now without incident.  I am using the virtio drivers and using bridged 
networking on the host to attach my guests.

Recently, I enabled pf in one of my 9-STABLE (r253579) guests and subsequently 
started to receive intermittent crash reports from abrt on the KVM host.  Has 
anyone else encountered problems using pf under KVM virtualisation?

A typical crash report from the host goes like this:

=
abrt_version:   2.0.8
cmdline:ro root=/dev/mapper/chumby-root rd_LVM_LV=chumby/root 
rd_NO_LUKS LANG=en_US.UTF-8 rd_LVM_LV=chumby/swap SYSFONT=latarcyrheb-sun16 
crashkernel=137M@0M rd_MD_UUID=b7338ac5:b08fdc1b:34d0fcf1:cf28da17  
KEYBOARDTYPE=pc KEYTABLE=us rd_NO_DM rhgb quiet console=tty0 
console=ttyS1,115200
kernel: 2.6.32-358.14.1.el6.x86_64
not-reportable: A kernel problem occurred, but your kernel has been tainted 
(flags:GW  ). Kernel maintainers are unable to diagnose tainted reports.
time:   Wed 07 Aug 2013 11:41:22 AM EDT

sosreport.tar.xz: Binary file, 2114408 bytes

backtrace:
:WARNING: at net/core/dev.c:1759 skb_gso_segment+0x1df/0x2b0() (Tainted: G  
  W  --- )
:Hardware name: AX1204-819-R700UB
:igb: caps=(0x12114bb3, 0x0) len=2084 data_len=0 ip_summed=0
:Modules linked in: iptable_nat nf_nat nf_conntrack_ipv4 nf_defrag_ipv4 
iptable_filter ip_tables ebtable_nat ebtables xt_CHECKSUM cpufreq_ondemand 
powernow_k8 freq_table mperf bridge stp llc ipt_REJECT ip6t_REJECT 
nf_conntrack_ipv6 nf_defrag_ipv6 xt_state nf_conntrack ip6table_filter 
ip6_tables ipv6 ext2 vhost_net macvtap macvlan tun kvm_amd kvm igb dca ptp 
pps_core microcode sg serio_raw fam15h_power k10temp amd64_edac_mod edac_core 
edac_mce_amd i2c_piix4 i2c_core shpchp ext4 mbcache jbd2 raid1 sr_mod cdrom 
sd_mod crc_t10dif pata_acpi ata_generic pata_atiixp ahci dm_mirror 
dm_region_hash dm_log dm_mod [last unloaded: nf_defrag_ipv4]
:Pid: 3262, comm: vhost-3242 Tainted: GW  ---
2.6.32-358.14.1.el6.x86_64 #1
:Call Trace:
:IRQ  [8106e307] ? warn_slowpath_common+0x87/0xc0
:[8106e3f6] ? warn_slowpath_fmt+0x46/0x50
:[a01b7d62] ? igb_get_drvinfo+0x82/0xe0 [igb]
:[81448c2f] ? skb_gso_segment+0x1df/0x2b0
:[81449010] ? dev_hard_start_xmit+0x1b0/0x530
:[814674ea] ? sch_direct_xmit+0x15a/0x1c0
:[8144ce70] ? dev_queue_xmit+0x3b0/0x550
:[a02fd64c] ? br_dev_queue_push_xmit+0x6c/0xa0 [bridge]
:[a02fd6d8] ? br_forward_finish+0x58/0x60 [bridge]
:[a02fd78a] ? __br_forward+0xaa/0xd0 [bridge]
:[81474ce4] ? nf_hook_slow+0x74/0x110
:[a02fd80d] ? br_forward+0x5d/0x70 [bridge]
:[a02fe5e9] ? br_handle_frame_finish+0x179/0x2a0 [bridge]
:[81063536] ? rebalance_domains+0x1a6/0x5a0
:[a02fe8ba] ? br_handle_frame+0x1aa/0x250 [bridge]
:[814486d9] ? __netif_receive_skb+0x529/0x750
:[8144899a] ? process_backlog+0x9a/0x100
:[8144d203] ? net_rx_action+0x103/0x2f0
:[81076fd1] ? __do_softirq+0xc1/0x1e0
:[8100c1cc] ? call_softirq+0x1c/0x30
:[8100c1cc] ? call_softirq+0x1c/0x30
:EOI  [8100de05] ? do_softirq+0x65/0xa0
:[8144d688] ? netif_rx_ni+0x28/0x30
:[a0079739] ? tun_sendmsg+0x229/0x4ec [tun]
:[a024acf5] ? handle_tx+0x275/0x5e0 [vhost_net]
:[a024b095] ? handle_tx_kick+0x15/0x20 [vhost_net]
:[a024855c] ? vhost_worker+0xbc/0x140 [vhost_net]
:[a02484a0] ? vhost_worker+0x0/0x140 [vhost_net]
:[81096956] ? kthread+0x96/0xa0
:[8100c0ca] ? child_rip+0xa/0x20
:[810968c0] ? kthread+0x0/0xa0
:[8100c0c0] ? child_rip+0x0/0x20
=

I get these crash reports even with a simple firewall rule set like this:

=
#   $FreeBSD: stable/9/share/examples/pf/pf.conf 218854 2011-02-19 
14:57:00Z brucec $
#   $OpenBSD: pf.conf,v 1.34 2007/02/24 19:30:59 millert Exp $
#
# See pf.conf(5) and /usr/share/examples/pf for syntax and examples.
# Remember to set net.inet.ip.forwarding=1 and/or net.inet6.ip6.forwarding=1
# in /etc/sysctl.conf if packets are to be forwarded between interfaces.

ext_if=vtnet0

set skip on lo

scrub in

block in
pass out

pass in on $ext_if proto tcp to ($ext_if) port ssh
pass in on $ext_if inet proto icmp from any to ($ext_if) icmp-type { unreach, 
redir, timex }
=

Does anyone know of any problems using pf with the virtio vtnet driver, or 
indeed in using pf at all under KVM virtualisation?  For now, I've turned off 
pf, but I would like to be able to enable it in future to do firewalling on the 
virtual guest.  I have no problems using iptables for firewalling on my Linux 
KVM guests.

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: protecting some processes from out-of-swap killer

2015-04-28 Thread Paul Mather

On Apr 28, 2015, at 5:51 AM, Ronald Klop ronald-li...@klop.ws wrote:

 On Sat, 25 Apr 2015 13:15:32 +0200, Dmitry Morozovsky ma...@rinet.ru wrote:
 
 On Sat, 25 Apr 2015, Baptiste Daroussin wrote:
 
  However, sometimes postgres processes got killed by 'out of swap space'.
  I suppose the source of problem could be that VSZ size of postgres 
  processes
  (8-9 G) is bigger than swap congigured (4G).
 
  Is there any way to prevent this, besides reallocating space for swap?
 
 protect(1) ?
 
 Of course.  I really do not understand how google hides the man page from me.
 
 Thanks, and sorry fot the noise.
 
 
 
 The OS trying to kill a process is probably not what you want. So when you 
 protect(1) postgres the OS will kill another process, which I hope is not 
 running without reason.
 My advice would be to
 - or increase your swap space
 - or tune postgresql to use less memory
 - or limit tmpfs (tmpfs uses swap if RAM is short)
 - or tune zfs to use less memory


That is good advice, although I do think protect has its place for preventing 
unforeseen accidents as mentioned above.  I believe it's good to be able to 
designate certain processes as more valuable than others if you know that to be 
the case.

A case in point: We have an Omeka instance on a fairly low-resource system.  
Omeka uses ImageMagick to generate thumbnails for items added to collections.  
In one case, we had a very large TIFF file added, who, when having a thumbnail 
generated, caused its thumbnail-generating process to consume large amounts of 
memory and swap.  When swap was exhausted, the OS decided it needed to kill off 
something and settled upon Apache (under which Omeka runs), causing Omeka to 
stop running.  In other times, the out-of-swap killer has chosen to kill the 
SSH daemon, making it harder subsequently to access the box and fix problems.

Being able to protect httpd---or even just sshd---served as a handy safety belt 
after that happened. :-)

I guess the proper way to address this would be to set limits on Apache or the 
thumbnail generation so it doesn't go hog wild in the future...

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: zfs, cam sticking on failed disk

2015-05-07 Thread Paul Mather

On May 7, 2015, at 8:00 AM, Steven Hartland kill...@multiplay.co.uk wrote:

 On 07/05/2015 11:46, Slawa Olhovchenkov wrote:
 On Thu, May 07, 2015 at 11:38:46AM +0100, Steven Hartland wrote:
 
 How I can cancel this 24 requst?
 Why this requests don't timeout (3 hours already)?
 How I can forced detach this disk? (I am lready try `camcontrol reset`, 
 `camconrol rescan`).
 Why ZFS (or geom) don't timeout on request and don't rerouted to da18?
 
 If they are in mirrors, in theory you can just pull the disk, isci will
 report to cam and cam will report to ZFS which should all recover.
 Yes, zmirror with da18.
 I am surprise that ZFS don't use da18. All zpool fully stuck.
 A single low level request can only be handled by one device, if that
 device returns an error then ZFS will use the other device, but not until.
 Why next requests don't routed to da18?
 Current request stuck on da19 (unlikely, but understund), but why
 stuck all pool?
 
 Its still waiting for the request from the failed device to complete. As far 
 as ZFS currently knows there is nothing wrong with the device as its had no 
 failures.


Maybe related to this, but if the drive stalls indefinitely, is it what leads 
to the panic: I/O to pool 'poolname' appears to be hung on vdev guid GUID-ID 
at '/dev/somedevice'.?

I have a 6-disk RAIDZ2 pool that is used for nightly rsync backups from various 
systems.  I believe one of the drives is a bit temperamental.  Very 
occasionally, I discover the backup has failed and the machine actually paniced 
because of this drive, with a panic message like the above.  The panic 
backtrace includes references to vdev_deadman, which sounds like some sort of 
dead man's switch/watchdog.

It's a bit counter-intuitive that a single drive wandering off into la-la land 
can not only cause an entire ZFS pool to wedge, but, worse still, panic the 
whole machine.

If I'm understanding this thread correctly, part of the problem is that an I/O 
never completing is not the same as a failure to ZFS, and hence ZFS can't call 
upon various resources in the pool and mechanisms at its disposal to correct 
for that.  Is that accurate?

I would think that never-ending I/O requests would be a type of failure that 
ZFS could sustain.  It seems from the hung on vdev panic that it does detect 
this situation, though the resolution (panic) is not ideal. :-)

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: 4TB Western Digital My Book 1230 USB hard drive not working on 10.2

2015-08-03 Thread Paul Mather

On Aug 3, 2015, at 11:09 AM, Alban Hertroys haram...@gmail.com wrote:

 On 3 August 2015 at 15:01, Paul Mather freebsd-li...@gromit.dlib.vt.edu 
 wrote:
 On Aug 2, 2015, at 9:34 PM, Adam McDougall mcdou...@egr.msu.edu wrote:
 
 On 08/02/2015 21:22, Paul Mather wrote:
 I have a 4TB external USB drive (Western Digital My Book 1230) that I am 
 trying to use under FreeBSD/amd64 10.2 (10.2-PRERELEASE #0 r286052: Wed 
 Jul 29 20:59:39 EDT 2015).  This system has a MSI 760GMA-P34 (FX) 
 motherboard.
 
 The drive probes unreliably when plugged in to a USB 3 port.  It reliably 
 probes when plugged into a USB 2 port.  However, it works in neither 
 cases.  Attempting to dd from the drive results in a dd: /dev/da0: 
 Invalid argument.
 
 FYI: I had been experiencing the same with a 2 TB WD MyBook on OS X
 (Mavericks). The drive was being used for Time Machine backups, but it
 would at irregular intervals it would just 'disconnect' itself. My
 guess is that there's a firmware problem in the USB3 chip in those
 drives.
 
 After trying to get the issue fixed for almost half a year I returned
 it and replaced it by a Seagate, which has been working flawlessly
 ever since.
 
 I'm just saying, perhaps the problem isn't with FreeBSD but with the drive.


I'd love just to get to the randomly disconnects stage under FreeBSD at this 
point. :-)

Up to now, I have been unable to read ANY data off the drive under FreeBSD, 
either via USB 2 or USB 3. :-(

However, you have given me something to think about: does anyone know of a 4 TB 
USB 2 or 3 drive that will work reliably under FreeBSD 10-STABLE right now?

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: pkg clean question

2015-08-03 Thread Paul Mather

On Aug 3, 2015, at 11:39 AM, Zoran Kolic zko...@sbb.rs wrote:

 Amd64, 9.3. Updated few times, packages also. At first I
 found on laptop that /var directory was at 102%. Using
 pkg clean i removed about 400mb of cache packages from
 /var/cache/pkg. Tried to clean desktop cache and the out-
 put of the command was nothing to do. I see about 400
 mb of files in the cache directory. Is it safe to remove
 them by the hand?


Use pkg clean -ay to delete all cached packages, not just the ones that are 
no longer current or provided by the upstream repository.

Cheers,

Paul.

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: 4TB Western Digital My Book 1230 USB hard drive not working on 10.2

2015-08-03 Thread Paul Mather

On Aug 3, 2015, at 12:17 PM, Jack Vogel jfvo...@gmail.com wrote:
 
 Have you tried it on HEAD, and have you tried it on the same system with some
 recent flavor of Linux?

No, and no.  It's a good idea, though.  I can try it this weekend, if I can run 
each OS via a USB memstick.

With this being such a commonplace drive, I'd hoped someone might pipe up and 
say something along the lines of, oh, you have to add USB quirk XYZ to get 
that model working under FreeBSD.  No such luck, it seems.

Cheers,

Paul.

  
 
 Jack
 
 
 On Mon, Aug 3, 2015 at 8:35 AM, Paul Mather freebsd-li...@gromit.dlib.vt.edu 
 mailto:freebsd-li...@gromit.dlib.vt.edu wrote:
 On Aug 3, 2015, at 11:09 AM, Alban Hertroys haram...@gmail.com 
 mailto:haram...@gmail.com wrote:
 
  On 3 August 2015 at 15:01, Paul Mather freebsd-li...@gromit.dlib.vt.edu 
  mailto:freebsd-li...@gromit.dlib.vt.edu wrote:
  On Aug 2, 2015, at 9:34 PM, Adam McDougall mcdou...@egr.msu.edu 
  mailto:mcdou...@egr.msu.edu wrote:
 
  On 08/02/2015 21:22, Paul Mather wrote:
  I have a 4TB external USB drive (Western Digital My Book 1230) that I am 
  trying to use under FreeBSD/amd64 10.2 (10.2-PRERELEASE #0 r286052: Wed 
  Jul 29 20:59:39 EDT 2015).  This system has a MSI 760GMA-P34 (FX) 
  motherboard.
 
  The drive probes unreliably when plugged in to a USB 3 port.  It 
  reliably probes when plugged into a USB 2 port.  However, it works in 
  neither cases.  Attempting to dd from the drive results in a dd: 
  /dev/da0: Invalid argument.
 
  FYI: I had been experiencing the same with a 2 TB WD MyBook on OS X
  (Mavericks). The drive was being used for Time Machine backups, but it
  would at irregular intervals it would just 'disconnect' itself. My
  guess is that there's a firmware problem in the USB3 chip in those
  drives.
 
  After trying to get the issue fixed for almost half a year I returned
  it and replaced it by a Seagate, which has been working flawlessly
  ever since.
 
  I'm just saying, perhaps the problem isn't with FreeBSD but with the drive.
 
 
 I'd love just to get to the randomly disconnects stage under FreeBSD at 
 this point. :-)
 
 Up to now, I have been unable to read ANY data off the drive under FreeBSD, 
 either via USB 2 or USB 3. :-(
 
 However, you have given me something to think about: does anyone know of a 4 
 TB USB 2 or 3 drive that will work reliably under FreeBSD 10-STABLE right now?
 
 Cheers,
 
 Paul.
 
 ___
 freebsd-stable@freebsd.org mailto:freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable 
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org 
 mailto:freebsd-stable-unsubscr...@freebsd.org
 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: 4TB Western Digital My Book 1230 USB hard drive not working on 10.2

2015-08-03 Thread Paul Mather

On Aug 2, 2015, at 9:34 PM, Adam McDougall mcdou...@egr.msu.edu wrote:

 On 08/02/2015 21:22, Paul Mather wrote:
 I have a 4TB external USB drive (Western Digital My Book 1230) that I am 
 trying to use under FreeBSD/amd64 10.2 (10.2-PRERELEASE #0 r286052: Wed Jul 
 29 20:59:39 EDT 2015).  This system has a MSI 760GMA-P34 (FX) motherboard.
 
 The drive probes unreliably when plugged in to a USB 3 port.  It reliably 
 probes when plugged into a USB 2 port.  However, it works in neither cases.  
 Attempting to dd from the drive results in a dd: /dev/da0: Invalid 
 argument.
 
 When plugged in to a USB 2 port, this is how the drive is probed:
 
 ugen6.2: Western Digital at usbus6
 umass0: Western Digital My Book 1230, class 0/0, rev 2.10/10.65, addr 2 on 
 usbus6
 umass0:  SCSI over Bulk-Only; quirks = 0xc001
 umass0:9:0:-1: Attached to scbus9
 da0 at umass-sim0 bus 0 scbus9 target 0 lun 0
 da0: WD My Book 1230 1065 Fixed Direct Access SPC-4 SCSI device
 da0: Serial Number 57434334453056594A4C4A4A
 da0: 40.000MB/s transfers
 da0: 3815415MB (976746240 4096 byte sectors: 255H 63S/T 60799C)
 da0: quirks=0x2NO_6_BYTE
 ses0 at umass-sim0 bus 0 scbus9 target 0 lun 1
 ses0: WD SES Device 1065 Fixed Enclosure Services SPC-4 SCSI device
 ses0: Serial Number 57434334453056594A4C4A4A
 ses0: 40.000MB/s transfers
 ses0: SCSI-3 ENC Device
 
 When booting with it connected to a USB 3 port, this is what is output:
 
 xhci0: XHCI (generic) USB 3.0 controller mem 0xfeafe000-0xfeaf irq 18 
 at device 0.0 on pci3
 xhci0: 64 bytes context size, 64-bit DMA
 usbus0 on xhci0
 [[...]]
 ohci0: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fe000-0xfe7fefff irq 
 16 at device 18.0 on pci0
 usbus1 on ohci0
 ohci1: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fd000-0xfe7fdfff irq 
 16 at device 18.1 on pci0
 usbus2 on ohci1
 ehci0: AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller mem 0xfe7ff800-0xfe7ff8ff 
 irq 17 at device 18.2 on pci0
 usbus3: EHCI version 1.0
 usbus3 on ehci0
 ohci2: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fc000-0xfe7fcfff irq 
 18 at device 19.0 on pci0
 usbus4 on ohci2
 ohci3: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7f7000-0xfe7f7fff irq 
 18 at device 19.1 on pci0
 usbus5 on ohci3
 ehci1: AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller mem 0xfe7ff400-0xfe7ff4ff 
 irq 19 at device 19.2 on pci0
 usbus6: EHCI version 1.0
 usbus6 on ehci1
 [[...]]
 ohci4: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7f6000-0xfe7f6fff irq 
 18 at device 20.5 on pci0
 usbus7 on ohci4
 [[...]]
 usbus0: 5.0Gbps Super Speed USB v3.0
 usbus1: 12Mbps Full Speed USB v1.0
 usbus2: 12Mbps Full Speed USB v1.0
 usbus3: 480Mbps High Speed USB v2.0
 usbus4: 12Mbps Full Speed USB v1.0
 usbus5: 12Mbps Full Speed USB v1.0
 usbus6: 480Mbps High Speed USB v2.0
 usbus7: 12Mbps Full Speed USB v1.0
 ugen7.1: ATI at usbus7
 uhub0: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus7
 ugen6.1: ATI at usbus6
 uhub1: ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1 on usbus6
 ugen5.1: ATI at usbus5
 uhub2: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus5
 ugen4.1: ATI at usbus4
 uhub3: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus4
 ugen3.1: ATI at usbus3
 uhub4: ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1 on usbus3
 ugen2.1: ATI at usbus2
 uhub5: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus2
 ugen1.1: ATI at usbus1
 uhub6: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus1
 ugen0.1: 0x1912 at usbus0
 uhub7: 0x1912 XHCI root HUB, class 9/0, rev 3.00/1.00, addr 1 on usbus0
 uhub0: 2 ports with 2 removable, self powered
 uhub2: 3 ports with 3 removable, self powered
 uhub3: 3 ports with 3 removable, self powered
 uhub5: 3 ports with 3 removable, self powered
 uhub6: 3 ports with 3 removable, self powered
 uhub7: 8 ports with 8 removable, self powered
 [[...]]
 Root mount waiting for: usbus6 usbus3 usbus0
 Root mount waiting for: usbus6 usbus3 usbus0
 uhub1: 6 ports with 6 removable, self powered
 uhub4: 6 ports with 6 removable, self powered
 ugen0.2: vendor 0x1058 at usbus0
 umass0: vendor 0x1058 product 0x1230, class 0/0, rev 3.00/10.65, addr 1 on 
 usbus0
 umass0:  SCSI over Bulk-Only; quirks = 0x8000
 Root mount waiting for: usbus0
 [[...]]
 Root mount waiting for: usbus0
 Root mount waiting for: usbus0
 umass0: Get Max Lun not supported (USB_ERR_TIMEOUT)
 umass0:9:0:-1: Attached to scbus9
 [[...]]
 da0 at umass-sim0 bus 0 scbus9 target 0 lun 0
 da0:Fixed Direct Access SCSI device
 da0: Serial Number WCC4E0VYJLJJ
 da0: 400.000MB/s transfers
 da0: 3815415MB (976746240 4096 byte sectors: 255H 63S/T 60799C)
 da0: quirks=0x2NO_6_BYTE
 
 
 This external USB drive works fine under OSX Yosemite and also when plugged 
 in to my Raspberry Pi 2 running OSMC.
 
 Is there anyone using this model of USB drive under FreeBSD/amd64 10.2?  Is 
 it a matter of finding the correct quirk to get it working?
 
 Cheers,
 
 Paul.
 
 The trouble detecting is probably related to
 https://bugs.freebsd.org

4TB Western Digital My Book 1230 USB hard drive not working on 10.2

2015-08-02 Thread Paul Mather

I have a 4TB external USB drive (Western Digital My Book 1230) that I am trying 
to use under FreeBSD/amd64 10.2 (10.2-PRERELEASE #0 r286052: Wed Jul 29 
20:59:39 EDT 2015).  This system has a MSI 760GMA-P34 (FX) motherboard.

The drive probes unreliably when plugged in to a USB 3 port.  It reliably 
probes when plugged into a USB 2 port.  However, it works in neither cases.  
Attempting to dd from the drive results in a dd: /dev/da0: Invalid argument.

When plugged in to a USB 2 port, this is how the drive is probed:

ugen6.2: Western Digital at usbus6
umass0: Western Digital My Book 1230, class 0/0, rev 2.10/10.65, addr 2 on 
usbus6
umass0:  SCSI over Bulk-Only; quirks = 0xc001
umass0:9:0:-1: Attached to scbus9
da0 at umass-sim0 bus 0 scbus9 target 0 lun 0
da0: WD My Book 1230 1065 Fixed Direct Access SPC-4 SCSI device
da0: Serial Number 57434334453056594A4C4A4A
da0: 40.000MB/s transfers
da0: 3815415MB (976746240 4096 byte sectors: 255H 63S/T 60799C)
da0: quirks=0x2NO_6_BYTE
ses0 at umass-sim0 bus 0 scbus9 target 0 lun 1
ses0: WD SES Device 1065 Fixed Enclosure Services SPC-4 SCSI device
ses0: Serial Number 57434334453056594A4C4A4A
ses0: 40.000MB/s transfers
ses0: SCSI-3 ENC Device

When booting with it connected to a USB 3 port, this is what is output:

xhci0: XHCI (generic) USB 3.0 controller mem 0xfeafe000-0xfeaf irq 18 at 
device 0.0 on pci3
xhci0: 64 bytes context size, 64-bit DMA
usbus0 on xhci0
[[...]]
ohci0: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fe000-0xfe7fefff irq 16 
at device 18.0 on pci0
usbus1 on ohci0
ohci1: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fd000-0xfe7fdfff irq 16 
at device 18.1 on pci0
usbus2 on ohci1
ehci0: AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller mem 0xfe7ff800-0xfe7ff8ff irq 
17 at device 18.2 on pci0
usbus3: EHCI version 1.0
usbus3 on ehci0
ohci2: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7fc000-0xfe7fcfff irq 18 
at device 19.0 on pci0
usbus4 on ohci2
ohci3: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7f7000-0xfe7f7fff irq 18 
at device 19.1 on pci0
usbus5 on ohci3
ehci1: AMD SB7x0/SB8x0/SB9x0 USB 2.0 controller mem 0xfe7ff400-0xfe7ff4ff irq 
19 at device 19.2 on pci0
usbus6: EHCI version 1.0
usbus6 on ehci1
[[...]]
ohci4: AMD SB7x0/SB8x0/SB9x0 USB controller mem 0xfe7f6000-0xfe7f6fff irq 18 
at device 20.5 on pci0
usbus7 on ohci4
[[...]]
usbus0: 5.0Gbps Super Speed USB v3.0
usbus1: 12Mbps Full Speed USB v1.0
usbus2: 12Mbps Full Speed USB v1.0
usbus3: 480Mbps High Speed USB v2.0
usbus4: 12Mbps Full Speed USB v1.0
usbus5: 12Mbps Full Speed USB v1.0
usbus6: 480Mbps High Speed USB v2.0
usbus7: 12Mbps Full Speed USB v1.0
ugen7.1: ATI at usbus7
uhub0: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus7
ugen6.1: ATI at usbus6
uhub1: ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1 on usbus6
ugen5.1: ATI at usbus5
uhub2: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus5
ugen4.1: ATI at usbus4
uhub3: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus4
ugen3.1: ATI at usbus3
uhub4: ATI EHCI root HUB, class 9/0, rev 2.00/1.00, addr 1 on usbus3
ugen2.1: ATI at usbus2
uhub5: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus2
ugen1.1: ATI at usbus1
uhub6: ATI OHCI root HUB, class 9/0, rev 1.00/1.00, addr 1 on usbus1
ugen0.1: 0x1912 at usbus0
uhub7: 0x1912 XHCI root HUB, class 9/0, rev 3.00/1.00, addr 1 on usbus0
uhub0: 2 ports with 2 removable, self powered
uhub2: 3 ports with 3 removable, self powered
uhub3: 3 ports with 3 removable, self powered
uhub5: 3 ports with 3 removable, self powered
uhub6: 3 ports with 3 removable, self powered
uhub7: 8 ports with 8 removable, self powered
[[...]]
Root mount waiting for: usbus6 usbus3 usbus0
Root mount waiting for: usbus6 usbus3 usbus0
uhub1: 6 ports with 6 removable, self powered
uhub4: 6 ports with 6 removable, self powered
ugen0.2: vendor 0x1058 at usbus0
umass0: vendor 0x1058 product 0x1230, class 0/0, rev 3.00/10.65, addr 1 on 
usbus0
umass0:  SCSI over Bulk-Only; quirks = 0x8000
Root mount waiting for: usbus0
[[...]]
Root mount waiting for: usbus0
Root mount waiting for: usbus0
umass0: Get Max Lun not supported (USB_ERR_TIMEOUT)
umass0:9:0:-1: Attached to scbus9
[[...]]
da0 at umass-sim0 bus 0 scbus9 target 0 lun 0
da0:Fixed Direct Access SCSI device
da0: Serial Number WCC4E0VYJLJJ
da0: 400.000MB/s transfers
da0: 3815415MB (976746240 4096 byte sectors: 255H 63S/T 60799C)
da0: quirks=0x2NO_6_BYTE


This external USB drive works fine under OSX Yosemite and also when plugged in 
to my Raspberry Pi 2 running OSMC.

Is there anyone using this model of USB drive under FreeBSD/amd64 10.2?  Is it 
a matter of finding the correct quirk to get it working?

Cheers,

Paul.



___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: 4TB Western Digital My Book 1230 USB hard drive not working on 10.2

2015-08-08 Thread Paul Mather

On Aug 3, 2015, at 2:15 PM, Paul Mather freebsd-li...@gromit.dlib.vt.edu 
wrote:

 On Aug 3, 2015, at 12:17 PM, Jack Vogel jfvo...@gmail.com wrote:
 
 Have you tried it on HEAD, and have you tried it on the same system with some
 recent flavor of Linux?
 
 No, and no.  It's a good idea, though.  I can try it this weekend, if I can 
 run each OS via a USB memstick.


Just to follow up on myself, I tried this external USB drive on a 
20150804-r286285 snapshot of FreeBSD/amd64 11-CURRENT on the same system, as 
well as under Ubuntu 15.04.

The 4 TB Western Digital My Book 1230 works under both those operating systems.

The drive is detected and works reliably when plugged in to a USB 2 port on 
either 11-CURRENT or Ubuntu 15.04.  It also works reliably when plugged in to a 
USB 3 port on either OS, however, it isn't always detected reliably under 
FreeBSD 11-CURRENT when unplugged and plugged in again (2 times out of 3 
attempts), but was detected every time without fail under Ubuntu 15.04.

So, it appears the hardware is okay; it seems the problem may lie with FreeBSD 
10.2.

Cheers,

Paul.


 
 With this being such a commonplace drive, I'd hoped someone might pipe up and 
 say something along the lines of, oh, you have to add USB quirk XYZ to get 
 that model working under FreeBSD.  No such luck, it seems.
 
 Cheers,
 
 Paul.
 
 
 
 Jack
 
 
 On Mon, Aug 3, 2015 at 8:35 AM, Paul Mather 
 freebsd-li...@gromit.dlib.vt.edu mailto:freebsd-li...@gromit.dlib.vt.edu 
 wrote:
 On Aug 3, 2015, at 11:09 AM, Alban Hertroys haram...@gmail.com 
 mailto:haram...@gmail.com wrote:
 
 On 3 August 2015 at 15:01, Paul Mather freebsd-li...@gromit.dlib.vt.edu 
 mailto:freebsd-li...@gromit.dlib.vt.edu wrote:
 On Aug 2, 2015, at 9:34 PM, Adam McDougall mcdou...@egr.msu.edu 
 mailto:mcdou...@egr.msu.edu wrote:
 
 On 08/02/2015 21:22, Paul Mather wrote:
 I have a 4TB external USB drive (Western Digital My Book 1230) that I am 
 trying to use under FreeBSD/amd64 10.2 (10.2-PRERELEASE #0 r286052: Wed 
 Jul 29 20:59:39 EDT 2015).  This system has a MSI 760GMA-P34 (FX) 
 motherboard.
 
 The drive probes unreliably when plugged in to a USB 3 port.  It 
 reliably probes when plugged into a USB 2 port.  However, it works in 
 neither cases.  Attempting to dd from the drive results in a dd: 
 /dev/da0: Invalid argument.
 
 FYI: I had been experiencing the same with a 2 TB WD MyBook on OS X
 (Mavericks). The drive was being used for Time Machine backups, but it
 would at irregular intervals it would just 'disconnect' itself. My
 guess is that there's a firmware problem in the USB3 chip in those
 drives.
 
 After trying to get the issue fixed for almost half a year I returned
 it and replaced it by a Seagate, which has been working flawlessly
 ever since.
 
 I'm just saying, perhaps the problem isn't with FreeBSD but with the drive.
 
 
 I'd love just to get to the randomly disconnects stage under FreeBSD at 
 this point. :-)
 
 Up to now, I have been unable to read ANY data off the drive under FreeBSD, 
 either via USB 2 or USB 3. :-(
 
 However, you have given me something to think about: does anyone know of a 4 
 TB USB 2 or 3 drive that will work reliably under FreeBSD 10-STABLE right 
 now?
 
 Cheers,
 
 Paul.
 
 ___
 freebsd-stable@freebsd.org mailto:freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable 
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org 
 mailto:freebsd-stable-unsubscr...@freebsd.org
 
 
 ___
 freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org
 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

10.2-RC1/STABLE VIMAGE memory leak progress?

2015-07-27 Thread Paul Mather

I'm running 10-STABLE (which currently identifies itself as 10.2-PRERELEASE) 
and lately I've started experimenting with using iocage for Jail management.  
(I really like it so far.)  In particular, I've been trying out iocage's 
support for VNET networking that relies on VIMAGE and uses bridge and epair for 
the network.

I find that when I stop a VNET Jail using iocage that I get something akin to 
the following in my console logs:

=
Jul 27 10:59:47 chumby kernel: vnet0:1: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: vnet0: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: re0: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: bridge0: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: vnet1:1: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: vnet1: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: bridge1: link state changed to DOWN
Jul 27 10:59:47 chumby kernel: ifa_del_loopback_route: deletion failed: 48
Jul 27 10:59:47 chumby kernel: Freed UMA keg (udp_inpcb) was not empty (120 
items).  Lost 12 pages of memory.
Jul 27 10:59:47 chumby kernel: Freed UMA keg (udpcb) was not empty (1169 
items).  Lost 7 pages of memory.
Jul 27 10:59:47 chumby kernel: hhook_vnet_uninit: hhook_head type=1, id=1 
cleanup required
Jul 27 10:59:47 chumby kernel: hhook_vnet_uninit: hhook_head type=1, id=0 
cleanup required
=

I found a PR (kern/164763) dating back to 2012-02-04 describing the problem.  
The last update on that PR was on 2014-10-17, which made reference to some 
unmerged fixes that would be good to merge into HEAD.

Does anyone know whether this was done?

One of the entries in that PR states, Our current rule of thumb is 'if you 
need to shutdown one vimage jail then you need to reboot the host.'  It then 
goes on to observe, Of course this is far from optimal.

Is it safe to say, then, that dynamic VNET Jails are not recommended still 
under FreeBSD?

Judging by that PR, this is not likely to be fixed for 10.2-RELEASE?

Cheers,

Paul.


___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Re: Circular dependency between local_unbound and ntpd?

2015-07-14 Thread Paul Mather

On Jul 14, 2015, at 10:33 AM, krad kra...@gmail.com wrote:
 
 As
 
 $ grep REQUIRE /etc/rc.d/ntpd
 # REQUIRE: DAEMON ntpdate FILESYSTEMS devfs
 
 
 You could set something similar to the following in the rc.conf
 
 ntpdate_hosts=a.b.c.d w.x.y.z
 ntpdate_enable=yes

Thanks for that suggestion.  I assume the a.b.c.d w.x.y.z are IP addresses, 
not hostnames, otherwise we'd have the same problem.

The /etc/rc.d/ntpdate startup script has a REQUIRE: NETWORKING ... and 
/etc/rc.d/local_unbound has a BEFORE: NETWORKING in it, meaning it will be 
running before ntpdate runs.  That means DNS resolution will require an 
accurate clock and, I assume, mean that ntpdate will require IP addresses, too?

So, it still comes down to this: do I need to know the IP address of an NTP 
server to be able to use local_unbound safely with NTP?

Cheers,

Paul.


 
 
 
 
 On 14 July 2015 at 14:43, Paul Mather p...@gromit.dlib.vt.edu 
 mailto:p...@gromit.dlib.vt.edu wrote:
 I believe I ran afoul of a circular dependency between local_unbound and ntpd 
 on my 10.2-PRERELEASE system.  I use a stock /etc/ntp.conf and use 
 ntpd_sync_on_start=YES.
 
 Last night, a BIOS settings reset cause my CMOS clock to go WAY out of synch 
 for the first time.  No problem, I thought: NTP will correct it at boot.
 
 Wrong!
 
 When my system booted, the time was not corrected.  Also, DNS resolution was 
 not working.  I figured out it was because local_unbound relies on an 
 accurately set clock, but the clock could not be set accurately because my 
 stock ntp.conf requires working DNS resolution to reach the NTP servers.
 
 That sounds like a potential circular dependency to me.
 
 My workaround at the time was to look up 0.freebsd.pool.ntp.org 
 http://0.freebsd.pool.ntp.org/ on another system; stop ntpd; then do a 
 ntpdate using the IP addresses to set the clock. Once the clock was set 
 accurately, things were all hunky dory.
 
 Does anyone have any suggestion for an automatic way around this?  I guess 
 one way would be to put the IP address of an NTP server into my ntp.conf 
 file, so at least one would be reachable without needing a working DNS?
 
 My main concern is for those systems like my Raspberry Pi and Beaglebone 
 Black that don't have a battery-backed clock.  I currently don't use 
 local_unbound on those, but it seems like I'd encounter this problem 
 routinely if I did.
 
 Cheers,
 
 Paul.
 ___
 freebsd-stable@freebsd.org mailto:freebsd-stable@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable 
 http://lists.freebsd.org/mailman/listinfo/freebsd-stable
 To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org 
 mailto:freebsd-stable-unsubscr...@freebsd.org
 

___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

Circular dependency between local_unbound and ntpd?

2015-07-14 Thread Paul Mather

I believe I ran afoul of a circular dependency between local_unbound and ntpd 
on my 10.2-PRERELEASE system.  I use a stock /etc/ntp.conf and use 
ntpd_sync_on_start=YES.

Last night, a BIOS settings reset cause my CMOS clock to go WAY out of synch 
for the first time.  No problem, I thought: NTP will correct it at boot.

Wrong!

When my system booted, the time was not corrected.  Also, DNS resolution was 
not working.  I figured out it was because local_unbound relies on an 
accurately set clock, but the clock could not be set accurately because my 
stock ntp.conf requires working DNS resolution to reach the NTP servers.

That sounds like a potential circular dependency to me.

My workaround at the time was to look up 0.freebsd.pool.ntp.org on another 
system; stop ntpd; then do a ntpdate using the IP addresses to set the clock. 
Once the clock was set accurately, things were all hunky dory.

Does anyone have any suggestion for an automatic way around this?  I guess one 
way would be to put the IP address of an NTP server into my ntp.conf file, so 
at least one would be reachable without needing a working DNS?

My main concern is for those systems like my Raspberry Pi and Beaglebone Black 
that don't have a battery-backed clock.  I currently don't use local_unbound on 
those, but it seems like I'd encounter this problem routinely if I did.

Cheers,

Paul.
___
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to freebsd-stable-unsubscr...@freebsd.org

1 2 >

1 - 100 of 137 matches

Mail list logo