Re: [gentoo-user] Re: RAID problems - Is udev at fault here?

2010-05-24 Thread Mark Knecht
On Sun, May 16, 2010 at 5:17 PM, Mark Knecht markkne...@gmail.com wrote:
 On Sun, May 16, 2010 at 5:07 PM, walt w41...@gmail.com wrote:
 SNIP

 Hm.  Is this your motherboard?:

 http://usa.asus.com/product.aspx?P_ID=W7i5W4Pw4fH22Mih

 Being a geek of a certain age, I find that products with names that invoke
 mega-dose anabolic steroids usually don't fit my lifestyle very well.

 I do better with product names that contain more sedate character strings
 like VSOP or MOM.

 By grepping through /usr/src/linux*/MAINTAINERS I turned up quite a few
 email addresses at intel.com, none of which seem relevant to RAID or its
 device drivers, but a polite email asking for a link to the appropriate
 dev might bring a polite and useful reply.  That's how I connected with
 the appropriate dev at Broadcom, who eventually fixed my ethernet driver.


 Yes, that's the motherboard. I don't care much about the names of
 things myself. I had limited options for the new i7-980x processor at
 the time I was ordering the hardware, and I'd never done overclocking
 before (and technically still haven't) so I got it because it was an
 Asus board which I've generally had very good luck with.

 To be clear, the RAID I'm doing is mdadm Linux software RAID and
 nothing having to do with the on-board RAID controller. The machine
 uses the standard Linux SATA drivers, or so I think.

 I like the VSOP idea. :-)

 - Mark


I continue to see this problem. I booted multiple times this morning
and cannot get the system to show /dev/sde:

c2stable ~ # ls -al /dev/sd*
brw-rw 1 root disk 8,  0 May 24  2010 /dev/sda
brw-rw 1 root disk 8,  1 May 24  2010 /dev/sda1
brw-rw 1 root disk 8,  2 May 24  2010 /dev/sda2
brw-rw 1 root disk 8,  3 May 24  2010 /dev/sda3
brw-rw 1 root disk 8,  4 May 24  2010 /dev/sda4
brw-rw 1 root disk 8,  5 May 24  2010 /dev/sda5
brw-rw 1 root disk 8,  6 May 24  2010 /dev/sda6
brw-rw 1 root disk 8, 16 May 24  2010 /dev/sdb
brw-rw 1 root disk 8, 17 May 24  2010 /dev/sdb1
brw-rw 1 root disk 8, 18 May 24  2010 /dev/sdb2
brw-rw 1 root disk 8, 19 May 24  2010 /dev/sdb3
brw-rw 1 root disk 8, 20 May 24  2010 /dev/sdb4
brw-rw 1 root disk 8, 21 May 24  2010 /dev/sdb5
brw-rw 1 root disk 8, 22 May 24  2010 /dev/sdb6
brw-rw 1 root disk 8, 32 May 24  2010 /dev/sdc
brw-rw 1 root disk 8, 33 May 24  2010 /dev/sdc1
brw-rw 1 root disk 8, 34 May 24  2010 /dev/sdc2
brw-rw 1 root disk 8, 35 May 24  2010 /dev/sdc3
brw-rw 1 root disk 8, 36 May 24  2010 /dev/sdc4
brw-rw 1 root disk 8, 37 May 24  2010 /dev/sdc5
brw-rw 1 root disk 8, 38 May 24  2010 /dev/sdc6
brw-rw 1 root disk 8, 48 May 24  2010 /dev/sdd
brw-rw 1 root disk 8, 49 May 24  2010 /dev/sdd1
c2stable ~ #

sda, sdb  sdc are RAID1 partition drives. sdd  sde are RAID0. As sde
is not seen the RAID0 cannot be started.

I have rebooted mutiple times. BIOS says the drives are there and are
functional, at least as far as SMART data is concerned.

This is vanilla-sources as a few weeks ago anyway there wasn't yet the
right gentoo-sources to support my video card. That has probably
changed by now so I'll try that.

c2stable ~ # uname -a
Linux c2stable 2.6.34-rc5 #1 SMP PREEMPT Mon Apr 26 12:04:14 PDT 2010
x86_64 Intel(R) Core(TM) i7 CPU X 980 @ 3.33GHz GenuineIntel GNU/Linux
c2stable ~ #

Is there a known reason to try a newer udev?

c2stable ~ # emerge -pv udev

These are the packages that would be merged, in order:

Calculating dependencies... done!
[ebuild   R   ] sys-fs/udev-149  USE=devfs-compat extras (-selinux) -test 0 kB

Total: 1 package (1 reinstall), Size of downloads: 0 kB
c2stable ~ # eix -I udev
[I] sys-fs/udev
 Available versions:  114 115-r1 119 124-r1 124-r2 141 ~141-r1
~145!t ~145-r1!t ~145-r2!t ~145-r3!t ~146!t 146-r1!t ~146-r2!t
~146-r3!t ~147-r1!t 149 ~150-r1!t ~151-r1 ~151-r2 ~151-r3 ~151-r4 ~154
** {(+)devfs-compat (-)extras (+)old-hd-rules selinux test}
 Installed versions:  149(10:28:59 05/05/10)(devfs-compat extras
-selinux -test)
 Homepage:
http://www.kernel.org/pub/linux/utils/kernel/hotplug/udev.html
 Description: Linux dynamic and persistent device naming
support (aka userspace devfs)

c2stable ~ #

I note in the info at the very end there are daemons for device-mapper
and udev-mount that are not running. Would they be involuved in this
problem?

Anyone who can give some guidance, please do.

Thanks,
Mark



c2stable ~ # lspci
00:00.0 Host bridge: Intel Corporation X58 I/O Hub to ESI Port (rev 13)
00:01.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI
Express Root Port 1 (rev 13)
00:03.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI
Express Root Port 3 (rev 13)
00:07.0 PCI bridge: Intel Corporation 5520/5500/X58 I/O Hub PCI
Express Root Port 7 (rev 13)
00:14.0 PIC: Intel Corporation 5520/5500/X58 I/O Hub System Management
Registers (rev 13)
00:14.1 PIC: Intel Corporation 5520/5500/X58 I/O Hub GPIO and Scratch
Pad Registers (rev 13)

Re: [gentoo-user] Re: RAID problems - Is udev at fault here?

2010-05-16 Thread Mark Knecht
On Sun, May 16, 2010 at 1:32 PM, walt w41...@gmail.com wrote:
 On 05/16/2010 10:56 AM, Mark Knecht wrote:

 I have a newish high-end machine here that's causing me some problems
 with RAID, but looking at log files and dmesg I don't think the
 problem is actually RAID and more likely udev. I'm looking for some
 ideas on how to debug this.

 The hardware:
 Asus Rampage II Extreme
 Intel Core i7-980x
 12GB DRAM
 5 WD5002ABYS RAID Edition 500GB drives

 I had an asus mobo that turned out to be great in the long run, but a few
 of its newer hardware gadgets took months to be well-supported by linux.

 I'm thinking (completely guessing :) it sounds like a driver that's not
 setting some bit properly in a hardware register during boot.

 That turned out to be a problem with the network chip on my asus, which
 randomly didn't work after reboots.  Finally the driver got fixed after
 I whined a thousand times to the driver maintainer at Broadcom :)


It very well could be something like that. I had a Compaq laptop a few
years ago which had an ATI chipset in it and which took a long time to
get DMA working on the hard drive controller to it was very slow for
the first few months.

The thing about this is that it's a single 6 port SATA controller in
an Intel chipset, albeit because it's the newer chipsets with the
newest processor (6 cores, 12 threads) it likely hasn't been seen by
too many people yet.

Let's assume you're right? I've been trying to determine how udev goes
about finding the actual hard drives and assigning them device names.
Is there a way that I can get udev to log what it's doing? Any sort of
debug messages I can get it to print in a log file somewhere?

It is a flaky problem and strangely it doesn't always miss every
partition on a given drive. For instance /dev/md3, md5 and md11
3-drive RAID1 arrays. You'd think if it was the controller failing it
would fail for all the partitions on a given drive, but it doesn't. It
might find sda3 for md3 but miss sda5 for md5. Strange.

Thanks for the ideas.

- Mark



Re: [gentoo-user] Re: RAID problems - Is udev at fault here?

2010-05-16 Thread Mark Knecht
On Sun, May 16, 2010 at 5:07 PM, walt w41...@gmail.com wrote:
SNIP

 Hm.  Is this your motherboard?:

 http://usa.asus.com/product.aspx?P_ID=W7i5W4Pw4fH22Mih

 Being a geek of a certain age, I find that products with names that invoke
 mega-dose anabolic steroids usually don't fit my lifestyle very well.

 I do better with product names that contain more sedate character strings
 like VSOP or MOM.

 By grepping through /usr/src/linux*/MAINTAINERS I turned up quite a few
 email addresses at intel.com, none of which seem relevant to RAID or its
 device drivers, but a polite email asking for a link to the appropriate
 dev might bring a polite and useful reply.  That's how I connected with
 the appropriate dev at Broadcom, who eventually fixed my ethernet driver.


Yes, that's the motherboard. I don't care much about the names of
things myself. I had limited options for the new i7-980x processor at
the time I was ordering the hardware, and I'd never done overclocking
before (and technically still haven't) so I got it because it was an
Asus board which I've generally had very good luck with.

To be clear, the RAID I'm doing is mdadm Linux software RAID and
nothing having to do with the on-board RAID controller. The machine
uses the standard Linux SATA drivers, or so I think.

I like the VSOP idea. :-)

- Mark