RFC 163 (v3) Objects: Autoaccessors for object data structures

Perl6 RFC Librarian Fri, 29 Sep 2000 14:09:05 -0700
This and other RFCs are available on the web at
  http://dev.perl.org/rfc/

=head1 TITLE

Objects: Autoaccessors for object data structures

=head1 VERSION

  Maintainer: Nathan Wiger <[EMAIL PROTECTED]>
  Date: 25 Aug 2000
  Last Modified: 29 Sep 2000
  Mailing List: [EMAIL PROTECTED]
  Number: 163
  Version: 3
  Status: Frozen
  Original Author: James Mastros <[EMAIL PROTECTED]>
  Requires: RFC 279, RFC 337

=head1 ABSTRACT

This RFC proposes three attributes, C<:laccess>, C<:raccess>, and
C<:roacess>, which when used on a blessed object variable,
automatically construct an C<lvalue>, C<rvalue>, or readonly
C<rvalue> autoaccessor, respectively.

=head1 NOTES ON FREEZE

Pretty much everyone liked the basic idea, but the implementation needs
some work. In particular, Michael Schwern pointed out that using
attributes to declare these may not be the best method.  B<RFC 337>
proposes a use of attributes that would allow them to basically act as
specialized subs, so this concern may be solved. Regardless, it is an
issue that needs revisiting; a package-wide pragma may be the best
solution.

=head1 DESCRIPTION

=head2 Overview

Currently, to hide your data - even simply - you must create subs that
often look something like this:

   sub fullname {
       my $self = shift;
       @_ ? $self->{fullname} = $_[0]
          : $self->{fullname};
   }

This allows you to compartmentalize your data so that people can then
use it in their programs as:

   $r->fullname('Nathan Wiger');
   print $r->fullname;

For one or two data pieces, this is ok. But for lots of data, this chews
up a lot of time and code space unnecessarily.

This RFC proposes the attributes C<:laccess>, C<:raccess>, and
$C<:roaccess> which would automate this process. The tags are kept
intentionally separate so that you can manually create an C<rvalue> sub,
but use the attribute to generate an C<lvalue> sub, or vice-versa.

=head2 The :raccess attribute

We'll start with the "easy one", which simply constructs an C<rvalue>
sub. Consider this code:

   package Bob;
   sub new {
       my $class = shift;
       my $self = {};
       $self->{name} :raccess = undef;
       $self->{age}  :raccess = undef;
       return bless $self, $class;
   }

Basically, two new C<rvalue> class methods would be created
automagically:

   sub name {
       my $self = shift;
       @_ ? $self->{name} = $_[0]
          : $self->{name};
   }

   sub age {     
       my $self = shift;
       @_ ? $self->{age} = $_[0]
          : $self->{age};
   }

These two subs could now be used in any C<rvalue> context by anyone who
used your class:

   my $jim = new Bob;
   $jim->name('Joe');
   $jim->age(42);
   print $jim->age;

The benefits are obvious: It's quick and easy, especially if you just
need a simple data wrapper.

=head2 The :roaccess attribute

This works identically to the above sub, with the difference that 
modification of its value is not allowed; it is readonly. As such,
the following:

   $self->{name} :roaccess = "Nate";

Would create the equivalent of the following sub:

   sub name {
       my $self = shift;
       croak "Attempt to assign to readonly rvalue autoaccessor" if @_;
       return $self->{name};
   }

As such, you could fetch values but not set them:

   print $r->name;     # "Nate";
   $r->name('Jim');    # oops!

The second one should result in an exception being thrown.

=head2 The :laccess attribute

This tag works just like the above, only creating an C<lvalue> sub
instead. So this code:

   package JimmyHat;
   sub new {
       my $class = shift;
       my $self = {};
       $self->{size} :laccess = undef;
       return bless $self, $class;
   }

Would basically create the following C<lvalue> sub:

   sub size : lvalue {
       shift->{size};
   }   

Which could be used in any C<lvalue> context:

   my $raincoat = new JimmyHat;
   $raincoat->size = 42;
   $raincoat->size++;

Easy enough.

=head2 Specifying the name of the sub

The above cases are fine for simple situations, but often you will have
nested data structures. No fear, you can optionally specify the name of
the sub as an argument to the tag:

   package Johnson;
   sub new {
       my $class = shift;
       my $self = {};
       $self->{DATA}->{NUMERIC}->{S_size} :raccess('size') = undef;
       $self->{DATA}->{STRING}->{PL_name} :laccess('name') = undef;
       return bless $self, $class;
   }

This would basically have the effect of creating the following two subs:

   sub size {
       my $self = shift;
       @_ ? $self->{DATA}->{NUMERIC}->{S_size} = $_[0]
          : $self->{DATA}->{NUMERIC}->{S_size}
   }

   sub name : lvalue {
       shift->{DATA}->{STRING}->{PL_name};
   }

Which again, can be used in the appropriate contexts. Note this allows
you to maintain arrayref objects automatically as well:

   package Johnson;
   sub new {
       my $class = shift;
       my $self = [];
       $self->[0]->[2]->[3] :raccess('size') = undef;
       $self->[4]->[1] :laccess('name') = undef;
       return bless $self, $class;
   }

Although an arrayref is usually not the best data representation for
direct-access objects.

=head2 Combining attributes

You can, of course, combine the C<:laccess> and C<:raccess> attributes
on a given key to autogenerate accessors that work in both C<lvalue> and
C<rvalue> contexts, if you simply want to hide your data.

The C<:roaccess> can only be combined with the C<:laccess> tag. In fact,
the C<:roaccess> should probably B<not> be combinable with any other
tag, but this RFC does not require this.

=head2 Mixing autoaccessors and real subs

The final case is where these become really powerful, but also makes the
implementation a little tricky. Here, we want to create an lvalue sub
that does something productive, but an rvalue autoaccessor:

   package 3rdLeg;
   sub new {
       my $class = shift;
       my $self = {};
       $self->{State}->{leg} :raccess = undef;
       return bless $self, $class;
   }

   sub leg : lvalue { 
       my $self = shift;
       # do a whole bunch of stuff ...
       $self->{State}->{leg};
   }

This would autogenerate an C<rvalue> autoaccessor for accessing C<leg>,
but would then use the sub that you had explicitly declared as the
C<lvalue> sub for C<leg>.

=head1 IMPLEMENTATION

In order for this to work, a couple things have to be true:

   1. Autoaccessor methods have to take precedence over declared
      methods in some way.

   2. A means for duplication of method namespace has to exist.

   3. The autoaccessor tags :laccess and :raccess must dictate
      that autoaccessors be declared for *only* that context;
      this is different from the meaning of lvalue subs which 
      can be used in either.

   4. There must be a way to set attributes on individual hash keys
      in Perl 6. See RFC 279 and RFC 337.

Note that throughout the RFC it has looked like this is a simple
sub-wrapper proposal, but it's not. As James Mastros (the original
author) said:

   The autoaccessor isn't a sub, it just behaves as if it were.
   This is the central point of autoaccessors.  The idea is that
   they should be faster then a method call, because you never
   change scope.  

Bingo. So, implementationally, it seems all of these could be satisfied
by having this:

   $r->method;

Do the following in order:

   1. Look for a declared autoaccessor for the given context,
      context of course being where $r is.

   2. Look for a class method (same as currently).

A quick lookup in some type of centrally-maintained hash or something
should do the trick.

=head1 EXAMPLES

This example shows how much easier it would have been to write the
example on line 170 of perltoot.pod:

    package Person;
    use strict;
    
    ##################################################
    ## the object constructor (simplistic version)  ##
    ##################################################
    sub new {
        my $self  = {};
        $self->{name}  :laccess :raccess = undef;
        $self->{age}   :laccess :raccess = undef;
        $self->{peers} :laccess :raccess = [];
        return bless $self; 
    }

Note the lack of anything close to the next 15 lines of code; all of
that is taken care of with the simple attribute settings above.

=head1 MIGRATION

None. New functionality.

=head1 NOTES

I ripped the guts out of the original RFC in an attempt to simplify
things and get the concepts out there. However, a few things were lost
in the process:

   1. automatic dereferencing of return values
   2. readonly rvalue subs
   3. mutators

I suspect the first one should not be solved. The real idea behind
autoaccessors, as James pointed out, is speed and simplicity. As such,
requiring you to pass stuff around by reference:

   $r->autoaccessor = \@stuff;
   $r->autoaccessor(\@stuff);
   @stuff = @{$r->autoaccessor};

Is probably not such a bad idea, since references are faster anyways,
and this greatly simplifies what the autoaccessors have to do.

As for the second one, perhaps making the C<:raccess> tag a simple
accessor, unable to handle assignment (as noted below in the archives),
is really what should be done. This gives us back readonly C<rvalue>
subs, plus you can always use C<:laccess> to create an C<lvalue> sub for
assignment (which is probably cleaner anyways). I suspect this may be
the correct approach.

As for mutators, I suspect that most of the important functions should
be able to be handled by the separation and combination of the
C<:laccess> and C<:raccess> attributes, but feel free to point out if
this is completely wrong. :-)

=head1 REFERENCES

RFC 279: my() syntax extensions and attribute declarations

RFC 337: Common attribute system to allow user-defined, extensible attributes

http://www.mail-archive.com/perl6-language-objects@perl.org/msg00417.html

http://www.mail-archive.com/perl6-language-objects@perl.org/msg00418.html

http://www.mail-archive.com/perl6-language-objects@perl.org/msg00098.html

http://www.mail-archive.com/perl6-language-objects@perl.org/msg00099.html
RFC 163 (v3) Objects: Autoaccessors for object data structures

Reply via email to