Re: [PHP-DEV] [RFC] PHP Attributes

Dmitry Stogov Thu, 21 Apr 2016 15:43:43 -0700


On 04/22/2016 12:52 AM, Larry Garfield wrote:

On 4/21/16 4:13 PM, Dmitry Stogov wrote:
Hi,


I would like to present an RFC proposing support for native annotation.
The naming, syntax and behavior are mostly influenced by HHVM Hack,but not exactly the same.
The most interesting difference is an ability to use arbitrary PHPexpressions as attribute values.
These expressions are not evaluated, but stored as Abstract SyntaxTrees, and later may be accessed (node by node) in PHP extensions,preprocessors and PHP scripts their selves. I think this ability maybe useful for "Design By Contract", other formal verificationsystems, Aspect Oriented Programming, etc
https://wiki.php.net/rfc/attributes
Note that this approach is going to be native, in contrast todoc-comment approach that uses not well defined syntax, and even notparsed by PHP itself.
Additional ideas, endorsement and criticism are welcome.


Thanks. Dmitry.
Thanks, Dmitry! In concept I am in favor of syntax-nativeannotations, although I have some concerns with the specifics of theproposal. Thoughts in no particular order:
First, for the getAttributes() reflection method, please oh pleasedon't return array-or-false. That's horrible. Just return an emptyarray if there aren't any, as that makes getAttributes() entirely typesafe and saves all callers from a mandatory if-check. (Seehttp://www.garfieldtech.com/blog/empty-return-values for moreinformation.)

Makes sense. I may change this.

The reflection section further indicates that the type of the resultis variable, which means I cannot know in advance if I'm going to getback a scalar or an array. If we go with this free-form approach, I'dhonestly prefer to always get back an array, even for single value, sothat I can always know the type I'm dealing with. (Since I cannotenforce a given attribute to be single-value.)

I'm not sure yet. both decisions may make sense. If I expect just asingle value, I'll have to check the number of elements (or just ignorevalues above the first).

For the expression example:

<<test($a  +  $b   >  0)>>
function  foo($a,  $b)  {
}
It is not at all clear to me what scope the annotation's $a and $bexist in. Are the they same $a and $b as in the function signature?If so, what happens if I reflect the function before ever calling it?

This is just an AST. It may contain any valid PHP expression syntax, butvariable, functions and constants don't have to be valid.

How can I evaluate test?

I hope this functionality will be provided by php-ast extension.Currently, it is not a problem to reconstruct PHP source from AST andthen use regular eval().

In general, we may find a more efficient way.

Or are they inherited from the global scope at the time ofdeclaration? (That scares me a great deal.) I don't know what tomake of that at all.

AST is going to be mainly used by extension and pre-processors (like AOTand DBC), but in general, they also may be used directly in scripts.


<<test($a  +  $b   >  0)>>
function  foo($a,  $b)  {
ast_eval(RefelectionFunction(__FUNCTION__)->getAttributes()["test"]);
}

DB
In the "Attribute syntax" section, the text says the tokens are theleft and right double-angle character, as used for quotations in someEuropean languages. The rest of the text says it's two left/rightcarrot characters, as seen above the comma and period on USkeyboards. I'm assuming the former is just a typo/auto-correct bug.


yeah, computers think they are too smart :)


If I read correctly, the following two would be semantically identical:

<<One, Two>>
function foo() {}

<<One>>
<<Two>>
function foo() {}


right

Is there a reason you chose the name "attribute" rather than"annotations", which seems at least in PHP to be the more common termfor this type of declaration?


I took the name from HHVM. Personally, I don't care about naming at all.



It appears that the annotations themselves are entirely free-form.

no. they are parsed according to PHP expression syntax rules. syntaxmistakes in attributes are going to be caught at compile time.

At the risk of expanding the typing debate, this concerns me as thenall we're adding is a new way to parse undocumented, undefinedanonymous structs. How can I say what annotations mean what for myORM, or routing system, or whatever? We're back to, essentially,out-of-band documentation of big anonymous structs (aka associativearrays).
A more robust alternative would be something along the same lines thatDoctrine uses: Make annotations actual classes. To wit:
<<AThing>>
<<AnotherThing('stuff')>>
<<MoreThing(1, 2, 3)>>
function foo($a, $b) { }
Where AThing, AnotherThing, and MoreThing are defined classes, andsubject to namespaces and use statements. Then what gets returnedfrom getAttributes() is an array consisting of an instance of AThing,an instance of AnotherThing, and an instance of MoreThing. In thisexample we'd just call their constructors with the listed values andlet them do as they will. Doctrine uses named properties in theannotation that maps to properties on the object, which is even moreflexible and self-documenting although I don't know how feasible thatis without opening up the named properties can of worms globally.

This is just a next level. Attributes are just a storage of meta-data.You may use them as you like :)


function getAttributesAsObjects($r) {
    $ret = array();
    $a = $r->getAttributes();
    foreach ($a as $name => $val) {
        $ret[] = new $name(...$val);
   }
   return $ret;
}

Either way, the advantage then is that I know what annotations areavailable, and the class itself serves as documentation for what itis, what it does, and what its options are. It also helps addresscollisions if two different libraries both want to use the samekeyword; we already have a class name resolution mechanism that worksand everyone is familiar with.
One concern is that not all classes necessarily make sense as anannotation; perhaps only classes with a certain interface can beused. Actually (thinking aloud here), that would be a possiblesolution to the named property issue. To wit:
<<AThing(a => 'a', b => 'b')>>
foo() {}

class AThing implements Attribute {
  public static function attributeCreate(array $params) {
    return new static($param['a'], $param['b']);
  }
}

$r  = new ReflectionFunction('foo');
$a = $r->getAttributes();
$a is now an array of one element, an instance of AThing, created with'a' and 'b'. The specifics here are probably terrible, but thegeneral idea of using classes to define annotations is, I think, a bigstep forward for documentation and avoiding multi-library collisions.
While I know some of the things Drupal 8 is using annotations for arearguably excessive (and I would agree with that argument in somecases), as is I fear the proposed system is too free-form andrudimentary for Drupal to switch to them.

We can't use classes and object in storage directly, because they may bedefined in different PHP script, may be changed between requests, etc.but as I showed, it's very easy to construct corresponding objects onrequest.


Thanks for deep review and good suggestions.

Dmitry.

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: http://www.php.net/unsub.php

Re: [PHP-DEV] [RFC] PHP Attributes

Reply via email to