And you ask why I hate OMVS?

Barbara Nitz Tue, 23 Jun 2009 02:23:24 -0700

This past weekend I had the dubious honour of shutting down and IPLing 5 
systems, two of them with USS work. The shutting down part was really bad 
(now I know why our operators keep complaining).


One lpar has my favourite hate-application running (called WBIFN, for all you 
European SWIFT customers). Around seven minutes into the shutdown 
(another lpar with similar workload but not this appliaction was already down 
after 7 minutes) D A,L revealed that there were some DB2s plus WBIFN still 
running plus the necessary system infrastructure. And one thing with the 
jobname of a TSO user, but OMVSEX in the step info, so the userid belonged 
to some USS process? thread? application? And they seemend to multiply while 
I was looking at them. Canceling any of them didn't really help, never mind 
that the duplicate jobname requires using the asid, which requires a list 
first. 
By the time I get around to killing the pid, it's already gone.

Then I saw that *something* still had open DB2 threads, for which automation 
has made provisions and forces things out. So I thought that this must be 
related to this user. Given that I couldn't stop it from multiplying, much less 
get out of the system (f bpxoinit,shutdown=forkinit was replied with 'shutdown 
delayed'), I shut down the fork service. 

That stopped the multiplication, but a few of those 'user asids with a number' 
were still around. And it was a VERY bad idea to shutdown the fork service, as 
that effectively prevented WBIFN from terminating eventually (it never 
terminates in a timely manner, anyway). I ended up canceling things, which 
generated tons of coredumps which filled the directory, which eventually 
prevented the startup of this application. (And no, these useless coredumps 
cannot be prevented, believe me, I've tried.)

The good news was that after 20 minutes I had WBIFN and that userid shut 
down, and then automation did the rest (in the case of our operators, they 
never get automation to do 'the rest').

So how are other installations handling system shutdown when there are 
active USS users (or at least their leftover processes)? For a 'pure' MVS, I 
can 
shutdown TSO and the Initiators, cancel any running batch jobs, and I am 
done. But how do I stop the USS things from multiplying? 

And this Tuesday, that users leftover processes are back. I tried killing the 
top 
one (right under ppid=1), but that only resulted in another process under 
ppid=1 (that killed process was just dropped). superkill didn't help, either. 
Isn't 
there any surefire way to get the whole tree stopped in one fell swoop? (and 
no, I won't kill pid 1).

(An OMVS ignoramus is asking this, so please be gentle with me)

Best regards, Barbara

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to [email protected] with the message: GET IBM-MAIN INFO
Search the archives at http://bama.ua.edu/archives/ibm-main.html

And you ask why I hate OMVS?

Reply via email to