Re: [Linux-HA] Here again with my problem with iscsi resource agent

Dejan Muhamedagic Thu, 13 Jan 2011 08:32:52 -0800

On Thu, Jan 13, 2011 at 03:23:09PM +0100, RaSca wrote:
> Il giorno Gio 13 Gen 2011 13:57:28 CET, Dejan Muhamedagic ha scritto:
> > Hi,
> [...]
> >> The patch applies, but the resource fails to startup with:
> >> Failed actions:
> >>      www_db-iscsi_monitor_0 (node=debian-squeeze-nodo1, call=39,
> >> rc=4, status=complete): insufficient privileges
> > +/usr/lib/ocf/resource.d//heartbeat/iscsi: Permission denied
> > Try chmod +x /usr/lib/ocf/resource.d//heartbeat/iscsi :)
> 
> Shame on me. I'm an idiot :)
> 
> Now seems to work. On the log I can see some message like this:
> 
> Jan 13 15:56:09 debian-squeeze-nodo1 iscsid: connect to 10.0.0.100:3260 
> failed (No route to host)
> 
> until the db resource comes up, then the iscsi resource comes up correctly.
> But now there's another problem with the resource next to this one: the 
> first time the filesystem comes up, it fails, with this error:
> 
> Jan 13 16:13:20 debian-squeeze-nodo1 Filesystem[8207]: [8250]: INFO: 
> Running start for 
> /dev/disk/by-path/ip-10.0.0.100:3260-iscsi-iqn.2010-12.local.rascanet:db.rascanet.iscsi-lun-1-part1
>  
> on /
> db
> Jan 13 16:13:20 debian-squeeze-nodo1 lrmd: [7397]: info: RA output: 
> (www_db-fs:start:stderr) FATAL: Module scsi_hostadapter not found.
> Jan 13 16:13:20 debian-squeeze-nodo1 kernel: [19980.770010] sd 19:0:0:1: 
> [sdc] Unhandled error code
> Jan 13 16:13:20 debian-squeeze-nodo1 kernel: [19980.770056] sd 19:0:0:1: 
> [sdc] Result: hostbyte=DID_TRANSPORT_FAILFAST driverbyte=DRIVER_OK
> Jan 13 16:13:20 debian-squeeze-nodo1 kernel: [19980.770123] sd 19:0:0:1: 
> [sdc] CDB: Read(10): 28 00 00 00 00 3e 00 00 02 00
> Jan 13 16:13:20 debian-squeeze-nodo1 kernel: [19980.770361] end_request: 
> I/O error, dev sdc, sector 62
> Jan 13 16:13:20 debian-squeeze-nodo1 kernel: [19980.771595] EXT3-fs: 
> unable to read superblock
> Jan 13 16:13:20 debian-squeeze-nodo1 lrmd: [7397]: info: RA output: 
> (www_db-fs:start:stderr) mount: wrong fs type, bad option, bad 
> superblock on /dev/sdc1,#012       missing codepage or helpe
> r program, or other error


Bad filesystem? That's what it says anyway.

> Jan 13 16:13:20 debian-squeeze-nodo1 lrmd: [7397]: info: RA output: 
> (www_db-fs:start:stderr)
> Jan 13 16:13:20 debian-squeeze-nodo1 lrmd: [7397]: info: RA output: 
> (www_db-fs:start:stderr)        In some cases useful info is found in 
> syslog - try#012       dmesg | tail  or so
> Jan 13 16:13:20 debian-squeeze-nodo1 lrmd: [7397]: info: RA output: 
> (www_db-fs:start:stderr)
> Jan 13 16:13:20 debian-squeeze-nodo1 Filesystem[8207]: [8266]: ERROR: 
> Couldn't mount filesystem 
> /dev/disk/by-path/ip-10.0.0.100:3260-iscsi-iqn.2010-12.local.rascanet:db.rascanet.iscsi-lun-1-p
> art1 on /db
> Jan 13 16:13:20 debian-squeeze-nodo1 crmd: [7400]: info: 
> process_lrm_event: LRM operation www_db-fs_start_0 (call=32, rc=1, 
> cib-update=57, confirmed=true) unknown error
> 
> But, even if the system says that "FATAL: Module scsi_hostadapter not 
> found.", if I do a cleanup of the resource it comes up without other 
> problems:

Oh. So, only on first start this happens?

> Jan 13 16:20:26 debian-squeeze-nodo1 Filesystem[11389]: [11444]: INFO: 
> Running start for 
> /dev/disk/by-path/ip-10.0.0.100:3260-iscsi-iqn.2010-12.local.rascanet:db.rascanet.iscsi-lun-1-part1
>  
> on /db
> Jan 13 16:20:26 debian-squeeze-nodo1 lrmd: [7397]: info: RA output: 
> (www_db-fs:start:stderr) FATAL: Module scsi_hostadapter not found.
> Jan 13 16:20:27 debian-squeeze-nodo1 kernel: [20407.470761] kjournald 
> starting.  Commit interval 5 seconds
> Jan 13 16:20:27 debian-squeeze-nodo1 kernel: [20407.478976] EXT3 FS on 
> sdc1, internal journal
> Jan 13 16:20:27 debian-squeeze-nodo1 kernel: [20407.479074] EXT3-fs: 
> mounted filesystem with ordered data mode.
> 
> So everything is ok. The filesystem resource is declared in this way:
> 
> primitive www_db-fs ocf:heartbeat:Filesystem \
>       params 
> device="/dev/disk/by-path/ip-10.0.0.100:3260-iscsi-iqn.2010-12.local.rascanet:db.rascanet.iscsi-lun-1-part1"
>  
> directory="/db" fstype="ext3" \
>       op monitor interval="20s" timeout="40s" \
>       op start interval="0" timeout="60s" \
>       op stop interval="0" timeout="60s"
> 
> What should be the problem?

No idea. Test by hand to see what's going on?

Thanks,

Dejan
_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Re: [Linux-HA] Here again with my problem with iscsi resource agent

Reply via email to