Some further clarification:
To partition a system from the sysplex, XCF must isolate or "fence" the
target system from the channel subsystem, in order to ensure that it is no
longer capable of initiating I/O and thus can no longer affect resources (CF
structures, data bases, etc.) shared by other members of the sysplex. XCF
issues IXC102A whenever it cannot automatically isolate the target system
from the channel subsystem.
When removing a system because it is "status update missing" (that is,
the system has failed to update its heartbeat information in the sysplex couple
data set for a period greater than the failure detection interval), XCF's
processing is governed by the contents of the SFM policy. If SFM is in use on
all systems in the sysplex and there is an active SFM policy and it specifies
ISOLATETIME, DEACTTIME, or RESETTIME (i.e., anything other than PROMPT)
for the affected system, XCF will attempt to take the specified automatic
action. In the case of ISOLATETIME, there must be a CF that is connected to
both the affected system and some other system in the sysplex from which
isolation can be initiated. If the requirements are met and isolation
succeeds,
you will not see IXC102A. If isolation cannot be initiated or fails for any
reason, XCF will issue IXC102A to prompt for manual system reset.
When removing a system in response to a VARY XCF,sysname,OFFLINE
command, XCF will attempt automatic isolation of the outgoing system as long
as there is an active SFM policy, regardless of what it specifies for the
target
system. If there is no SFM policy, or if isolation fails for any reason (no
connected CF, or whatever), XCF issues IXC102A.
If XCF issues IXC102A, it is an absolute, sure-enough, hey-no-kidding
requirement that you reset the target system BEFORE replying DOWN to the
message. The moment you reply DOWN, XCF will mark the target system
inactive and the remaining systems in the plex will begin reclaiming its
resources. If that system has not in fact been reset, and is still trying to
access shared resources for which it no longer holds serialization, merry hell
will break loose in the form of data base corruption or any other scenario you
can conceive that can begin from a state where two entities are
simultaneously manipulating the same critical resource.
Bill Neiman
XCF Development
On Mon, 28 Apr 2008 07:37:56 +0200, Barbara Nitz <[EMAIL PROTECTED]>
wrote:
>>I'm looking to get clarification on whether or not the z/OS console
>>IXC102A message requesting that the operator confirms that a member of a
>>sysplex bas been reset always appears when a member is removed from the
>>sysplex.
>
>No. In order for it NOT to appear,
>
>1. You must have a parallel sysplex with at least one CF.
>2. You must have an SFM policy active.
>
>Then all but the last system can be system-reset automagically through the
CF (initiated by SFM, and you don't see it). The last system in the sysplex has
to be system-reset manually, as there is no one else around in the sysplex to
do it. But then you wouldn't see that message, either.
>
>As the message states, you may see IXC102A when both 1 and 2 above are
fulfilled. The only time I ever saw it I followed up on it and it turned out
that
there was a bug somewhere in coexistence code between releases, and I
should not have seen the message at all.
>
>>I have been told that some installations never see it, and I suspect it
>>it would not be replied to by automation without somehow managing to
>>cause a system reset to occur on the member being removed.
>
>Well, if you reply to it via automation and that automation is not capable of
actually system-resetting the lpar before replying, then you may be in deep
do-do, and it's all your fault. :-)
>
>Regards, Barbara Nitz
>--
----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [EMAIL PROTECTED] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html