[Boston.pm] Can't alarm() while reading an unbuffered endless line?

Bogart Salzberg Mon, 19 Oct 2009 14:17:59 -0700

Mongers,

I recently encountered a puzzling dilemma. You might find itinteresting, or obvious (probably not both) and it leads to a questionabout how perl handles signals.


Here's the situation:

I teach a class for Portland's adult education class. It's a PHPclass. (boo, hiss). I created a web site on my own server wherestudents can enter php code, submit it, have it executed and see theresult in an output window. (Yes, the security implications aresevere. But that's not what I'm writing about :-).

The CGI script that handles the code submission is a perl script. Itcreates a temp file, builds a command line with some PHP-native safetyfeatures and then hands the file to PHP to execute via qx().

Anyhow, this has worked well for years. Inevitably during our coverageof looping (i.e. with "while") students will write infinite loops. Ithought I was handling these situations pretty well with a timeoutmechanism based on perl's alarm(). Here is basically how it worked, toparaphrase:


                $SIG{'ALRM'} = sub {die "timeout"};

my $command = 'php -d safe_mode=on -d open_basedir=/home/user ' .$file;


                alarm 5;
                eval {
                        $output = qx($command);
                };
                if ($@) {
                        # tell user their process exceeded the time limit
                }

On more than one recent occasion I found this didn't work. (Cut hereto say "I told you so.")

In researching the problem I isolated the conditions that led to theproblem, and also showed that it's not specific to PHP. But I stilldon't fully understand the reason for it.

Here is a test program I wrote (with help from http://perlguru.com/gforum.cgi?post=40217) to emulate the behavior of the CGI script:


----------------------------

#!/usr/bin/perl

use Errno (ESRCH, EPERM);

print "Content-type: text/plain\n\n" if $ENV{'GATEWAY_INTERFACE'};

$SIG{'ALRM'} = sub {die "Throwing time_out"};

my $file = '/path/to/inf_loop.php';

my $command = 'php ' . $file;

my $pid = open(CMD, '-|'); # going to read child's STDOUT

defined ($pid) or die "Can't fork: $!";

if ($pid) { # parent
        alarm 5;
        eval {
                $output = join '', <CMD>;
        };
        if ($@) {
                print "Caught time_out\n";
                kill_pid($pid);
        }
}
else { # child
        exec($command) or die;
}

sub kill_pid {
        my $pid = shift;
        if (kill 0, $pid) {
                print "Killing $pid\n";
                kill 'KILL', $pid;
        }
        elsif ($! == EPERM) {
                print "I'm not allowed to signal $pid!\n";
        }
        elsif ($! == ESRCH) {
                print "$pid is deceased\n";
        }
        else {
                print "Unexpected error, can't kill $pid\n";
        }
}

----------------------------

The contents of the PHP file "inf_loop.php" varied as I trieddifferent things.


TEST 1:

while (1) {
  $n++;
}

RESULT:

After 5 seconds, the browser received this response:

Caught time_out
Killing 7884

----------------------------

TEST 2:

while (1) {
  print $n++;
}

RESULT:

The browser spun its wheels indefinitely, waiting... while in "top" Isee:


  PID USER      PR  NI  VIRT  RES  SHR S %CPU %MEM    TIME+  COMMAND
 8446 ftpbog1   20   0  192m 8640 5496 R   52  0.2   0:02.06 php
 8445 ftpbog1   20   0 38380  23m 1320 S   48  0.6   0:02.00 test2.pl
 8444 ftpbog1   20   0  237m  10m 2156 S    0  0.3   0:00.00 apache2

The test2.pl process is consuming massive amounts of memory veryquickly and continues to run indefinitely. Finally I run...


jet:/home/bogart# sudo -u ftpbog1 kill 8446

... and almost simultaneously the response comes back to the browser:

Caught time_out
Killing 8446

This tells me that the failure to terminate after 5 seconds is not dueto a permissions issue.


----------------------------

TEST 3:

For this test I see if a perl child "inf_loop.pl" would produce thesame result, with (literally) the same code:


while (1) {
  print $n++;
}

RESULT: 5 out of 5 times the process is killed after 5 seconds, thoughnot before consuming 30 MB of RAM.


----------------------------

TEST 4:

For this test I unbuffered STDOUT in "inf_loop.pl" with:

$| = 1;

RESULT: Now the result matches the result from the PHP file. The childperl process runs indefinitely until killed with:


jet:/home/bogart# sudo -u ftpbog1 kill 10190

... at which point the browser receives:

Caught time_out
Killing 10190

NOTE: All of the subsequent tests are run with the perl childunbuffered. The PHP child is unbuffered by default, presumably.


----------------------------

TEST 5:

Added a call to sleep().

while (1) {
  print $n++;
  sleep(1);
}

RESULT: Consistent kill after 5 seconds, for both the PHP child andthe perl child.


----------------------------

TEST 6:

Print newlines:

while (1) {
  print $n++ . "\n";
}

RESULT: Both the PHP and perl children run indefinitely. Granted, thereader was trying to read to EOF all at once...


----------------------------

TEST 7:

Read in a loop:

while (<CMD>) {
        $output .= $_;
}

RESULT: Consistent kills for both children. They're printing newlinesbut not sleeping.


----------------------------

TEST 8:

Read in a loop, but don't have the children print newlines.

RESULT: As you'd suspect, they run indefinitely. The readline operatorkeeps gobbling up RAM as fast as it can.


----------------------------

TEST 9:

Use read():

1 while ($bytes = read(CMD, $output, 1024, $total += $bytes));

RESULT: Consistent kills. Neither child was sleeping or printingnewlines. Both were attempting to print one unbuffered endless line asfast as possible.


----------------------------

TEST 10:

What if I give the child something else to do in the loop? I triedthis several times with additional "extra work" each time. And I wentback to line-based reading, but in a loop, i.e. "while (<CMD>) ..."


while (1) {
        print $n++;
        1;
}

RESULT: Testing just the perl process now, it runs indefinitely.

while (1) {
        print $n++;
        $m = rand();
}

RESULT: Indefinite run time.

while (1) {
        print $n++;
        $m = rand() * rand() * rand() * rand() * rand() / $n;
}

RESULT: Indefinite run time.

while (1) {
        print $n++;
        1 while $m++ % 100;
}

RESULT: Killed on the 1st, 3rd and 5th run. Ran indefinitely on the2nd and 4th run.


while (1) {
        print $n++;
        my $m;
        $m++ until $m == $n;
}

RESULT: Killed 5 times out of 5

while (1) {
        print $n++;
        $m = $n;
        while ($m--) {
                $o = rand() * rand();
        }
}

RESULT: Killed 5 times out of 5

----------------------------

TEST 11:

Had the $SIG{'ALRM'} sub and the exception handler print the time atwhich they executed.


$SIG{'ALRM'} = sub {die "Throwing time_out at " . scalar(localtime())};

... and ...

if ($@) {
        print "Caught error ($@) at " . scalar(localtime()) . "\n";
        kill_pid($pid);
}

RESULT: I reverted the inf_loop.pl script to a known "infinite"personality. After running it for about 30 seconds I killed it fromthe command line and found that the times reported were simultaneouswith the manual kill, long after the 5-second alarm signal handlershould have been executed.


----------------------------

ANALYSIS:

There are some circumstances in which an alarm is unlikely to behandled by perl at the intended time, or soon after. (When I say thetests ran "indefinitely", it means I let them run for 30 seconds to aminute before killing them manually. The alarm was set for fiveseconds). The probability depends on a variety of factors, including:


- Unbuffered output. (Compare tests 3 and 4).
- Line-based reading for the parent. (See test 9).
- The child writing something. (See test 1).

- The child writing without newlines (Compare tests 7 and 8, notearray context for <>).- The child doing something other than "print $n++", though it dependson how much. (See tests 5 and 10).

The problem is not that the child *can't* be signaled. It appears thatthe problem (see test 11) is that under certain circumstances perldoesn't execute the parent's $SIG{'ALRM'} handler when it should. Thesignal is "there", waiting, but it doesn't get through until the childexits.


QUESTIONS:

Does these results seem unusual or unexpected? What bugs me is that,for instance in test 10, the success of the alarm seems somewhatarbitrary. The fourth of the sub-tests under test 10 only ranindefinitely *some* of the time.

What is happening internally that could explain this? I don't know howalarm() is implemented under the hood. I've heard of "uninterruptablesleep" associated with disk I/O problems, but I didn't think it wasrelated to RAM. (Swap is presumably not in use here).

Do any of you typically use a particular "defensive programming"tactic to avoid these kinds of issues? It seems that reading with "read()" might be a good idea.


Thanks.

Bogart




_______________________________________________
Boston-pm mailing list
[email protected]
http://mail.pm.org/mailman/listinfo/boston-pm

[Boston.pm] Can't alarm() while reading an unbuffered endless line?

Reply via email to