On Mar 30, 2007, at 5:29 AM, Alan Robertson wrote:

Doug Knight wrote:
Alan,
did you get a chance to look at the cibadmin.txt and the comments below?
The crm_master spinning is the last thing standing in my way to
completing the OCF script testing.

Thanks,
Doug

On Wed, 2007-03-28 at 11:27 -0400, Doug Knight wrote:
Hi Alan,
I tend to agree with your analysis below. I placed some debug output
in the crm_attribute.c module, and found that it went into its loop
within the call to update_attr. I had trouble trying to get debug
output I placed in the function itself (is cib_attrs.c the correct
module?). Then on Friday I ran out of time to continue persuing it.




On Sun, 2007-03-25 at 16:45 -0600, Alan Robertson wrote:
Doug Knight wrote:
Got it. The attached file contains the strace from the second attempt by heartbeat to start the resource up as master, right up until it was killed. The resource already showed failed on the gui. I zipped it up
using gzip.

By the way, from the system call perspective, what it's doing is
mallocing again and again and again...

I presume it's in this function (from the top level)
   rc = update_attr(the_cib, cib_opts, type, dest_node, set_name,
               attr_id, attr_name, attr_value);


And I further presume (with somewhat more risk) that it's in this
function from the next level down:

        rc = the_cib->cmds->modify(the_cib, section, xml_top, NULL,
call_options| cib_quorum_override);

        cib_client_modify(CIB_OP_MODIFY...)

        cib_native_perform_op()

Which sends the request over to the CIB, where it should do this...

        cib_process_modify()

        update_xml_child(obj_root, input)

However, from cib_process_modify on, all the work takes place in the CIB, not in the crm_master command. So, I presume that it doesn't get
that far.  [Other theories are also possible, of course ;-)]

Here is my initial conclusion:
        1)  No one else has reported this problem
        2)  The code in question is common and is used for many things
        3)  Therefore it's more likely that something is amiss with your
                CIB and causing the CIB code to loop looking for the
                subtree to modify.  If this theory is correct, there are
                two problems one with your CIB, and one in the code.

So, could you please send the current output from cibadmin -Q to the
list as an attachment?

I've attached the output from the "cibadmin -Q" command.

Could you also please run crm_verify on your CIB and see if it complains about anything. If it does, please fix its complaints, and try again.

"crm_verify -L" did not complain on any issues. However, "crm_verify
-x /var/lib/heartbeat/crm/cib.xml" had the following to say:

[dknight]# crm_verify -V -x /var/lib/heartbeat/crm/cib.xml
element cib: validity error : Element cib content does not follow the
DTD, expecting (configuration , status), got (configuration )

crm_verify[27448]: 2007/03/28_11:11:31 ERROR: validate_with_dtd: CIB
does not validate against /usr/lib64/heartbeat/crm.dtd

crm_verify[27448]: 2007/03/28_11:11:31 ERROR: main: CIB did not pass
DTD validation
Errors found during check: config not valid

its complaining there's no status section... which isn't actually important as we'll add it in
i'll try and mask it

was that the only error?


And, could you also please tell us how you installed the system. If you didn't install a package, then did you make the required user ID and
group ID?

I pulled down the 2.0.8 tarball from the linux-ha web site. Used
ConfigureMe to build, with some minor changes (for my Red Hat distro
of EL5 Beta, I added DFLAGS="--with-group-id=60 --with-ccmuser- id=17" in the appropriate place to get it to build). I created the hacluster
user ID and group ID to match

OK.

Thanks.

Andrew is somewhat back now, and he would be more effective looking at
it than I, and I'm traveling this week...

--
    Alan Robertson <[EMAIL PROTECTED]>

"Openness is the foundation and preservative of friendship...  Let me
claim from you at all times your undisguised opinions." - William
Wilberforce

_______________________________________________________
Linux-HA-Dev: [email protected]
http://lists.linux-ha.org/mailman/listinfo/linux-ha-dev
Home Page: http://linux-ha.org/

Reply via email to