Re: Rescuing mangled RAID5 array

2006-11-23 Thread Greg Eden

Hello again,

On 14 Nov 2006, at 18:30, Greg Eden wrote:

I'm trying to recover data from a broken RAID5 array (drive removed  
whilst array was rebuilding!).


[snip]



I'm trying

dd if=/dev/da1s1d of=/raid2/rescueimage


I'm trying not to equate lack of response to lack of chance of  
receovery! However...


After 4 days dd eventually successfully created a 1.5TB image file of  
the trashed partition, so i can work on it without causing further  
damage. I successfully attached it with mdconfig. Running fsck_ufs on  
the resultant /dev/md0 causes fsck_ufs to eventually crash out with:


UNKNOWN FILE TYPE I=42151497
UNEXPECTED SOFT UPDATE INCONSISTENCY

CLEAR? yes

fsck_ufs: bad inode number 42158080 to nextinode

during Phase 1. Is there anyway to get around this so fsck can continue?

Thanks in advance.

Greg.

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


Rescuing mangled RAID5 array

2006-11-14 Thread Greg Eden

Hello

I'm trying to recover data from a broken RAID5 array (drive removed  
whilst array was rebuilding!).


What's the best way to get an image of the corrupted partition and  
how would I run fsck -y on the image. The corrupted partition is  
1.3TB, /dev/da1s1d, usually mounted as /raid. I have another empty  
2TB partition handy, /dev/da2s1d currently mounted as /raid2.


Following this thread

http://lists.freebsd.org/pipermail/freebsd-questions/2006-November/ 
134984.html


I'm trying

dd if=/dev/da1s1d of=/raid2/rescueimage

but it's proceeding very slowly (2MB/sec) and I'm not entirely sure  
I'm going to be able to mount and fix the resulting image with fsck  
at the end of it.


So is there a faster way to clone a partition (unfortunately the  
drive array itself doesn't offer this functionality) and can you fsck  
an md mounted image. assuming i should use md to mount the image?


There's nothing wrong with the drives physically, just mangled  
metadata I think. I can actually mount the partition readonly but  
many directories are missing or listed as 'Bad file descriptor'. I've  
copied about 20% of the data off but the most important 50% doesn't  
even have it's directory listed any more :(


Thanks in advance for any help

Greg.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to [EMAIL PROTECTED]


FreeBSD 5.3R SMP Kernel not detecting 2nd CPU in a HP DL360

2005-03-25 Thread Greg Eden
Hello All,
I have a co-located HP DL360G3 server to which I do not have physical  
access. It has 2 x 2.8GHz Xeon CPUs with HTT. One of the ISP's  
engineers has confirmed that both CPUs are detected when the BIOS  
POSTs.

I have cvsup'd to the latest 5.3R security release and successfully  
built 'world', something I have done successfully many times with 4.x  
releases, using the instructions from the handbook. I used a custom  
kernel configuration which included options SMP and device apic. On  
reboot dmesg confirmed that the custom kernel was used, however SMP was  
not active and the additional CPUs were not launched.

Thinking that perhaps my custom kernel conf was at fault, I compiled a  
GENERIC smp kernel with:

# cd /usr/obj
# chflags -R noschg *
# rm -rf *
/usr/src/# make buildkernel KERNCONF=SMP
/usr/src/# make installkernel KERNCONF=SMP
After reboot the dmesg output is as follows:
FreeBSD 5.3-RELEASE-p5 #0: Thu Mar 24 14:37:29 GMT 2005
[EMAIL PROTECTED]:/usr/obj/usr/src/sys/SMP
Timecounter i8254 frequency 1193182 Hz quality 0
CPU: Intel(R) Xeon(TM) CPU 2.80GHz (2799.22-MHz 686-class CPU)
  Origin = GenuineIntel  Id = 0xf29  Stepping = 9
   
Features=0xbfebf9ffFPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,SEP,MTRR,PGE,MCA, 
CMOV,PAT,PSE36,CLFLUSH,DTS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,PBE
  Hyperthreading: 2 logical CPUs
real memory  = 2147459072 (2047 MB)
avail memory = 2095996928 (1998 MB)
npx0: [FAST]
npx0: math processor on motherboard
npx0: INT 16 interface
acpi0: COMPAQ P31 on motherboard
acpi0: Power Button (fixed)
Timecounter ACPI-safe frequency 3579545 Hz quality 1000
acpi_timer0: 32-bit timer at 3.579545MHz port 0x920-0x923 on acpi0
cpu0: ACPI CPU on acpi0
acpi_tz0: Thermal Zone on acpi0
pcib0: ACPI Host-PCI bridge on acpi0
ACPI link \\_SB_.IN31 has invalid initial irq 3, ignoring
pci0: ACPI PCI bus on pcib0
pci0: display, VGA at device 3.0 (no driver attached)
ciss0: Compaq Smart Array 5i port 0x2800-0x28ff mem  
0xf5df-0xf5df3fff,0xf5f8-0xf5fb irq 11 at device 4.0 on  
pci0
ciss0: [GIANT-LOCKED]
pci0: base peripheral at device 5.0 (no driver attached)
pci0: base peripheral at device 5.2 (no driver attached)
isab0: PCI-ISA bridge at device 15.0 on pci0
isa0: ISA bus on isab0
atapci0: ServerWorks CSB5 UDMA100 controller port  
0x2000-0x200f,0x376,0x170-0x177,0x3f6,0x1f0-0x1f7 at device 15.1 on  
pci0
ata0: channel #0 on atapci0
ata1: channel #1 on atapci0
ohci0: OHCI (generic) USB controller mem 0xf5e7-0xf5e70fff irq 10  
at device 15.2 on pci0
ohci0: [GIANT-LOCKED]
usb0: OHCI version 1.0, legacy support
usb0: SMM does not respond, resetting
usb0: OHCI (generic) USB controller on ohci0
usb0: USB revision 1.0
uhub0: (0x1166) OHCI root hub, class 9/0, rev 1.00/1.00, addr 1
uhub0: 4 ports with 4 removable, self powered
pcib1: ACPI Host-PCI bridge on acpi0
pci1: ACPI PCI bus on pcib1
bge0: Broadcom BCM5703 Gigabit Ethernet, ASIC rev. 0x1002 mem  
0xf7ef-0xf7ef irq 11 at device 2.0 on pci1
miibus0: MII bus on bge0
brgphy0: BCM5703 10/100/1000baseTX PHY on miibus0
brgphy0:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,  
1000baseTX-FDX, auto
bge0: Ethernet address: 00:11:85:6b:07:d8
pcib2: ACPI Host-PCI bridge on acpi0
pci4: ACPI PCI bus on pcib2
bge1: Broadcom BCM5703 Gigabit Ethernet, ASIC rev. 0x1002 mem  
0xf7ff-0xf7ff irq 15 at device 2.0 on pci4
miibus1: MII bus on bge1
brgphy1: BCM5703 10/100/1000baseTX PHY on miibus1
brgphy1:  10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX,  
1000baseTX-FDX, auto
bge1: Ethernet address: 00:11:85:6b:07:db
atkbdc0: Keyboard controller (i8042) port 0x64,0x60 irq 1 on acpi0
atkbd0: AT Keyboard irq 1 on atkbdc0
kbd0 at atkbd0
atkbd0: [GIANT-LOCKED]
sio0: Standard PC COM port port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0
sio0: type 16550A
fdc0: floppy drive controller (FDE) port 0x3f2-0x3f5 irq 6 drq 2 on  
acpi0
fdc0: [FAST]
fd0: 1440-KB 3.5 drive on fdc0 drive 0
orm0: ISA Option ROMs at iomem  
0xee000-0xe,0xcc000-0xcd7ff,0xc8000-0xcbfff,0xc-0xc7fff on isa0
pmtimer0 on isa0
ppc0: parallel port not found.
sc0: System console at flags 0x100 on isa0
sc0: VGA 16 virtual consoles, flags=0x300
sio1: configured irq 3 not in bitmap of probed irqs 0
sio1: port may not be enabled
vga0: Generic ISA VGA at port 0x3c0-0x3df iomem 0xa-0xb on  
isa0
Timecounter TSC frequency 2799224924 Hz quality 800
Timecounters tick every 10.000 msec
acd0: CDROM CRN-8245B/2.19 at ata0-master PIO4
da0 at ciss0 bus 0 target 0 lun 0
da0: COMPAQ RAID 1  VOLUME OK Fixed Direct Access SCSI-0 device
da0: 135.168MB/s transfers
da0: 69459MB (142253280 512 byte sectors: 255H 32S/T 17433C)
Mounting root from ufs:/dev/da0s1a

and
~ sysctl kern.smp.active
kern.smp.active: 0
I just don't understand why the kernel isn't running SMP. To my mind  
it's either something in my method (most likely), a hardware problem,  
or something odd in the way 5.3R detects the CPUs on the hardware.

I have successfully compiled SMP kernels on