Re: Early IPL problems

Mark Zelden Sun, 29 Apr 2012 08:33:32 -0700

On Sat, 28 Apr 2012 09:47:29 +0200, R.S. <[email protected]> wrote:


>W dniu 2012-04-27 23:03, John McDowell pisze:
>> I'm trying to get a feel for problems that occur in the early stages of z/OS 
>> system start up (e.g. IPL/NIP).  Generally problems in these stages result 
>> in a non-restartable wait state, for example wait state x'0B1' (e.g. LOADxx 
>> or IODF problem).
>>
>> Questions:
>> 1.  FREQUENCY: How often do they occur ?
>> 2.  DURATION: How long does it take to resolve them (e.g. minutes, hours, 
>> etc.) ?
>> 3.  IMPACT: What are the consequences (e.g. missed SLAs, etc.) ?
>> 4.  CAUSE: What are the underlying sources (e.g. hardware, software, etc.) ?
>> 5.  RECOVERY: How do you recover from them ?
>
>1. Rarely. IPL is performed rarely. In my case I haven't noticed such
>problem *on production systems* for years. Such problems do happen
>during tests, like new system, PTFs applied (and IPLTEXT not refreshed),
>new CPC, new LPAR, DR test, etc.
>BTW: I *hate* looking at last 3 digits, then previous digits... ;-)
>Since the numbers are available on HMC, it would be nice to have button
>Explain which could (under the covers) open the book and perform the
>analysis for me.
>
>2. The time depends on two-three elements:
>a) time to open the book. It can be few seconds when I'm on my PC (HMC
>accessed remotely), it can be minutes when I do it on real HMC and I
>have to use another PC for documentation access.
>b) time to write down the digits, extract wait state code and reason code.
>c) (optional) sometimes I need to check whether description is accurate
>or maybe fix something (like LOAD member). I usually logon to TSO on
>another system and view/modify the things. It could take 5 min.
>
>3. Lost time, some stress. From business point of view it doesn't affect
>my SLA.
>
>4. IODF in multiple extents, OS config with bad offline/online device
>set (i.e. IODF device is described as OFFLINE YES), mistakes in LOADxx,
>not refreshed IPLTEXT (after PTF APPLY), typo in LOAD window on HMC.
>
>5. See 2.
>


I think R.S.'s response is typical for most of us.   Although I can't remember 
the
last time I had a wait due to nucleus or IODF in multiple extents, occasionally
I have typo'd a loadparm when IPLing one of my sandbox LPARs.   Loadparms
never change in production except for a brief period when migrating to a new
OS release and the sysprog will do that initial IPL/change and the HMC remembers
it of course.   Load addresses change often, but no one ever seems to type
those in wrong.   

I really like the idea of the HMC being able to quickly open a reference to 
the correct wait state code.  

Mark
--
Mark Zelden - Zelden Consulting Services - z/OS, OS/390 and MVS       
mailto:[email protected]                                        
Mark's MVS Utilities: http://www.mzelden.com/mvsutil.html 
Systems Programming expert at http://expertanswercenter.techtarget.com/

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: INFO IBM-MAIN

Re: Early IPL problems

Reply via email to