On Fri, 8 Nov 2019 at 11:09, van Sleeuwen, Berry <[email protected]> wrote:
> Hi Rob, > > Indeed I have rexx logic after the PIPE to restart when required (IF > rc=313 then CP IPL CMS PARM AUTOCR). The PIPELINE has two parts. The first > part is STARMSG to send commands to the machine. The second part is the > STARMON processing. It selects domains/records based on a parameter file > (such as 04 0003 User activity records) an then writes them to disk. The > output stream for STARMON is never servered except when the disk is full. > > Looking at the performance data I can see the LPAR was at 100% CPU at this > time so probably the machine didn't get enough CPU to process data in time. > Indeed I did get the HCPMOV6274I message but I didn't copy that line. In > fact, that's the reason I have coded the restart when the PIPE ends with > 313 in the past. As mentioned, I have seen a couple of times in the past > that this would have stopped the entire PIPE but now it looks like only the > STARMON was stopped. > > In this case I'm looking for a way to stop the PIPELINE when the STARMON > stage stops collecting data. Indeed maybe I can rewrite the logic using the > GATE stage. > IPL CMS is a bit rude. The 313 error terminates the STARMON stage, right? That means the following business logic sees end-of-file on the input stream and should wrap up things because there's nothing more to come. You might use JEREMY to figure out which part of the pipeline is in a catch-22 (something like a CONS stage would already). If nothing else, you could FANOUT and TAKE LAST after STARMON and use that to fire a GATE that cuts the line further down. But your (4,3) records is sample data. You have the entire minute to finish your work. If that's not enough for this sample, it probably will not be for the next sample either. My rough guess is that a significant piece of plumbing might consume 0.5% of a CPU to keep up. You can also tell STARMON to skip some domains or just get the SAMPLE records. If you really can't 300 ms for a minute, then your systems are a lot worse than they used to be :-) Could it be something else is doing a MONITOR STOP and MONITOR START behind your back? When your plumbing takes a lot more resources, you might want to talk to RITA and hear where to rework the code. Feel free to post the challenges. Sir Rob the Plumber
