I am looking for some help with the external/riloe stonith plug in.  I have 
been working with the one that ships in SLES 10 SP1 heartbeat 2.0.8-0.19.  I 
have used the following XML to create the clone resource. 

<clone id="CL_stonithset_node1"> 
   <instance_attributes id="CL_stonithset_node1"> 
    <attributes> 
      <nvpair id="CL_stonithset_node1_clone_node_max" name="clone_node_max" 
value="1"/> 
    </attributes> 
  </instance_attributes> 
  <primitive id="CL_stonith_node1" class="stonith" type="external/riloe" 
provider="heartbeat"> 
    <operations> 
      <op name="monitor" interval="30s" timeout="20s" 
id="CL_stonith_node1_monitor"/> 
      <op name="start" timeout="60s" id="CL_stonith_node1_start"/> 
    </operations> 
    <instance_attributes id="CL_stonith_node1"> 
      <attributes> 
        <nvpair id="CL_stonith_node1_hostlist" name="hostlist" value="node1"/> 
        <nvpair id="CL_stonith_node1_RI_HOSTRI" name="RI_HOSTRI" 
value="il-node1"/> 
        <nvpair id="CL_stonith_node1_RI_LOGIN" name="RI_LOGIN" 
value="Administrator"/> 
        <nvpair id="CL_stonith_node1_RI_PASSWORD" name="RI_PASSWORD" 
value="password"/> 
      </attributes> 
    </instance_attributes> 
  </primitive> 
</clone> 

Sample errors in the messages log. 
Jun 27 11:30:41 node1 haclient: on_event:evt:cib_changed 
Jun 27 11:30:41 node1 stonithd: [5318]: info: Cannot get parameter hostname 
from StonithNVpair 
Jun 27 11:30:41 node1 stonithd: [5318]: ERROR: Invalid config info for 
external/riloe device. 
Jun 27 11:30:41 node1 lrmd: [12035]: ERROR: sending stonithRA op to stonithd 
failed. 
Jun 27 11:30:41 node1 cib: [12048]: info: write_cib_contents: Wrote version 
0.46.2095 of the CIB to disk 

This error shows up a couple of times in a row also. 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error parsing token: 
couldnt find attr_name 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error at or before: 
="ilo_hostname" uniq 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error parsing token: 
error parsing child 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error at or before:    
<longdesc lang=en 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error parsing token: 
error parsing child 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error at or before: >  
<parameter name=" 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error parsing token: 
error parsing child 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: parse_xml: Error at or before: c> 
<parameters>  <pa 
Jun 27 11:30:41 node1 crmd: [5320]: ERROR: crm_abort: find_xml_node: Triggered 
non-fatal assert at xml.c:75 : root != NULL 

The resource is created OK but I can't start the resource.  It gives an error 
that it can't run anywhere.  I also see errors about not being able to fin 
hostname.  So I did some digging in the riloe file and it shows the RI_ entries 
as legacy.  lower it in the file it shows some ilo_ values.  So I tried 
creating the same file above with the new ilo equivalents. 



<clone id="CL_stonithset_node1"> 
   <instance_attributes id="CL_stonithset_node1"> 
    <attributes> 
      <nvpair id="CL_stonithset_node1_clone_node_max" name="clone_node_max" 
value="1"/> 
    </attributes> 
  </instance_attributes> 
  <primitive id="CL_stonith_node1" class="stonith" type="external/riloe" 
provider="heartbeat"> 
    <operations> 
      <op name="monitor" interval="30s" timeout="20s" 
id="CL_stonith_node1_monitor"/> 
      <op name="start" timeout="60s" id="CL_stonith_node1_start"/> 
    </operations> 
    <instance_attributes id="CL_stonith_node1"> 
      <attributes> 
        <nvpair id="CL_stonith_node1_hostlist" name="hostlist" value="node1"/> 
        <nvpair id="CL_stonith_node1_ilo_hostname" name="ilo_hostname" 
value="il-node1"/> 
        <nvpair id="CL_stonith_node1_ilo_user" name="ilo_user" 
value="Administrator"/> 
        <nvpair id="CL_stonith_node1_ilo_password" name="ilo_password" 
value="password"/> 
        <nvpair id="CL_stonith_node1_ilo_protocol" name="ilo_protocol" 
value="1.2"/> 
      </attributes> 
    </instance_attributes> 
  </primitive> 
</clone> 

Same results resource is created but doesn't start.  I can ping the hostname 
and the ilo hostname of node1 and il-node1 from all boxes.  I am able to ssh 
and https to the ilo card and login with the admin account.  I have attached 
the riloe plug in that I am trying to use. 

The hardware is a dl350 running ilo firmware 1.22. 

Does anyone know what type of connection the plug in makes to the ilo card? 

Do I need to have the ilo2 device at a certain firmware version? 

Do I need a driver loaded for the ilo card to work or does it communicate to it 
through ssh or https? 

What can I do to trouble shoot this problem? 


TIA 
Edward 

Attachment: riloe
Description: Binary data

_______________________________________________
Linux-HA mailing list
[email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha
See also: http://linux-ha.org/ReportingProblems

Reply via email to