Hallo everybody! Short info on architecture in use:
I have a setup of two HP DL380 with Smart Array 5i Controller for internal Disks (RAID 1 for rootdisks, on mashine wit additional RAID 5 for local database). Both Machines are attached to a HP MSA 500 Storage device via Smart Array 532 Controller. The machines form a high availability cluster for an Oracle database. Kernel in Use is 2.6.10-gentoo-r6. This construct is suggestet by HP for use in HA clusters. The device has one singel lun wich is used as LVM2 device via device-mapper. FS ist ext2. For a few days now write access to the MSA 500 stalls. Afterwards every access to that device stalls too. The machine refises to sync and will not reboot without pressing tho power button. Please help! _any_ hint is welcome! Including input on working environments with similiar setup or similiar problems. Any hint on possible problems with the used kernel or host bus adapters? Kind regards, Matthias Witschel PS: Here are the relevant messages from /var/log/messages: (earlier tests included EXT2 error messages ahead of the timeout, this didn't happen after fsck on the device) Feb 9 16:56:16 telkas1 cciss: cmd f7d80000 timedout Feb 9 16:56:16 telkas1 Buffer I/O error on device dm-6, logical block 60998 Feb 9 16:56:16 telkas1 lost page write due to I/O error on dm-6 Feb 9 16:56:16 telkas1 cciss: cmd f7d80248 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d80490 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d806d8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d80920 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d80b68 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d80db0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d80ff8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d81240 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d81488 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d816d0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d81918 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d81b60 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d81da8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d81ff0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d82238 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d82480 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d826c8 timedout Feb 9 16:56:16 telkas1 Buffer I/O error on device dm-6, logical block 62023 Feb 9 16:56:16 telkas1 lost page write due to I/O error on dm-6 Feb 9 16:56:16 telkas1 cciss: cmd f7d82910 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d82b58 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d82da0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d82fe8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d83230 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d83478 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d836c0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d83908 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d83b50 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d83d98 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d83fe0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d84228 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d84470 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d846b8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d84900 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d84b48 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d84d90 timedout Feb 9 16:56:16 telkas1 Buffer I/O error on device dm-6, logical block 63048 Feb 9 16:56:16 telkas1 lost page write due to I/O error on dm-6 Feb 9 16:56:16 telkas1 cciss: cmd f7d84fd8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d85220 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d85468 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d856b0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d858f8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d85b40 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d85d88 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d85fd0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d86218 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d86460 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d866a8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d868f0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d86b38 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d86d80 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d86fc8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d87210 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d87458 timedout Feb 9 16:56:16 telkas1 Buffer I/O error on device dm-6, logical block 64073 Feb 9 16:56:16 telkas1 lost page write due to I/O error on dm-6 Feb 9 16:56:16 telkas1 cciss: cmd f7d876a0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d878e8 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d87b30 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d87d78 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d87fc0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d88208 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d88450 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d88698 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d888e0 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d88b28 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d88d70 timedout Feb 9 16:56:16 telkas1 cciss: cmd f7d88fb8 timedout
