Loading subroutines ad-hoc: extension to load.pm

Elizabeth Mattijsen Tue, 14 Oct 2008 13:23:04 -0700

After feedback from the Amsterdam Perl Mongers and some other people,I'm considering some additions to the "load" pragma that I developedand put on CPAN a few years back.

So what does "load" do? Well, it basically (optionally) delaysloading of subroutines until they're actually called (useful insidecronjobs). *Or* it loads all subroutines of a module at compile time(useful in an Apache mod_perl environment). And all of that withminimal changes to the source code.

So what changes would you need to make to the source code? Well,basically there are 2 changes you need to do:


1. add "use load;" near the top of the file

2. put all subroutines that need to be loaded "on demand" after an__END__ marker

What the "use load" does, is that it scans the source file from whichit is being called between the first __END__ and the second __END__or __DATA__ or end of file. In a mod_perl environment it willimmediately "eval" the subroutines. If not in a mod_perlenvironment, it will create a directory of file offsets / lengths ofthe subroutines it finds, which can be used later by an AUTOLOADhandler to read from the source file and "eval" the subroutine whenit is actually being called.

So what are the disadvantages to this approach? Well, the mostnoticeable one is the lack of support of file lexicals. Since eachsubroutine is potentially compiled seperately, it cannot see filelexicals (since they can only be "seen" during compilation).

The second disadvantage to this approach is the fact that it inmod_perl, it is an all or nothing approach: either all subroutinesare loaded on demand, or all are loaded at compile time. I want toachieve a greater granularity. For any given module, only load theseX subroutines on the basic application server, load these Ysubroutines on the XML server, and load these Z subroutines on theadministrative servers. Where X, Y and Z may be supersets, subsetsor intersections.

I was therefore thinking about adding the following genericfunctionality to "load.pm":



1. support for related subroutines, grouped in a block

By adding support for the simple source parser for lexical blocks, itwould become possible to "share" lexicals between subroutines. Thisis in fact no different from what some of us are already doing:


{
my $foo;

sub foo { $foo }
sub bar { $foo + $foo }
}

The source parser of "load.pm" would see this as one entity: whenever"foo" or "bar" would be called, the entire block would be evalled,causing "$foo" to be accessible by both "foo" and "bar".



2. support for "roles"

The concept of a "role" would be the conceptual context in which codeis being compiled. Taking the about X, Y and Z example, a modulemight contain code that should be accessible in all 3 contexts, butalso code that should only be accessible (and compiled) in the Zcontext.

I am thinking of adding a "compile time" directive to the "load.pm"source scanner that would indicate in which "role" or "roles" thecode should be visible. Something like:


#roles: admin,app
sub foo { ... }

#roles: admin
sub bar { ... }
sub baz { ... }

A line that starts with "#roles:" would indicate the roles in whichthe following code should be visible. In the above example, thesubroutine "foo" would only be visible in the "app" and "admin" roles.

In a mod_perl environment, it would compile only those subroutines ofthat role. Outside of a mod_perl environment, the AUTOLOAD handlerwould refuse to load any subroutines of which the role doesn't match.

The actual role for which code should be compiled, is set with anenvironment variable, e.g. $ENV{ROLE}. Whenever code is encounteredthat is not supposed to be available for that ROLE, measures will betaken so that that code is not compiled (and execution errors willensue if you still try to do that).



3. preventing typo's in roles

To prevent typo's in role specifications, the first line with #roles:should contain all possible roles that any subroutine in this filecould live in. This would also serve as a visual cue as to whichroles are supported for the developer. So for the above X, Y and Zexample, we'd probably have a line with:


 #roles: admin,app,xml

near the top of the file.


4. making other code conditional on role

A constant would be exported (e.g. by default ROLE) that would allowyou to actually make conditions on the role:


  if ( ROLE eq 'app' ) {  # optimised away if ROLE ne 'app'
    print STDERR "compiled for 'app' role\n";
  }


5. propagate strict and warnings

Currently, any "use strict" and "use warnings" are not propagated tocode actually being evalled. Within the constraints of the simplesource code parser, it will try to remember the last setting of"use/no strict" and "use/no warnings" seen. Alternately, theimport() routine of load.pm will allow specification of pragma's tobe prefixed to each piece of code being evalled. Something like:


  use load(
    use => 'warnings',
    use => 'strict',
  );

Perhaps that should even be the default setting.

I'm looking into this to scratch the itch of a client. It wassuggested to me to take this to a little bigger audience to seewhether maybe such a beast already exists. Or if it doesn't, to findout whether there would be any suggestions, remarks or other feedbackthat could be of interest. ;-)

Liz

Loading subroutines ad-hoc: extension to load.pm

Reply via email to