Re: [PHP-DEV] NULL Coercion Consistency

Rowan Tommins Mon, 25 Apr 2022 14:07:25 -0700

On 25/04/2022 10:33, Craig Francis wrote:

The fact that internal functions have parameter parsing behaviour that is 
almost impossible to implement in userland, and often not even consistent 
between functions, is a wart of engine internals, not a design decision.

Bit of a tangent, but do you have some examples? would be nice to clean some of 
these up, or at least have them in mind as we discuss this RFC.

Fundamentally, the internal parameter handling system (ZPP) iscompletely separate from the way function signatures work in userland,and evolved based on a different set of requirements. The emphasis ofZPP is on unwrapping zval structs to values that can be manipulateddirectly in C; so, for instance, it has always had support for integerparameters. Since 7.0, userland signatures have evolved an essentiallyparallel set of features with an emphasis on designing a consistent anduseful dynamic typing system.

Increasingly, ZPP is being aligned with the userland language, whichalso allows reflection information to be generated based on PHP stubs.For instance:

* Making rejected parameters throw TypeError rather than raise a Warningand return null* Giving optional parameters an explicit default in the signature ratherthan inspecting the argument count

* Using union types, rather than ad hoc if/switch on zval type

The currently proposed change to how internal functions handle nulls in9.0 is just another part of that process - the userland behaviour iswell-established, and we're making the ZPP behaviour match.

Off the top of my head, I don't know what other inconsistencies remain,but my point was that in every case so far, internal functions have beenadapted to match userland, not vice versa.

So I'll spend 1 more... I think it's fair to say that developers using 
`strict_types=1` are more likely to be using static analysis; and if 
`strict_types=1` is going to eventually disappear, those developers won't lose 
any functionality with the stricter checking being done by static analysis, 
which can check all possible variable types (more reliable than runtime), and 
(with the appropriate level of strictness) static analysis can do things like 
rejecting the string '5' being passed to an integer parameter and null being 
passed to a non-nullable parameter.

There's an unhelpful implication here, and in your discussion oftesting, that PHP users can be divided into two camps: those who checkprogram correctness with static analysis tools, unit tests, etc; andthose who don't care about program correctness.

Instead, how about we think about those who are writing new code andwant PHP to tell them early when they do something silly; and those whoare maintaining large code bases and have to deal with compatibilityproblems. Neither of these groups is helped enough by static analysers -as you've rightly pointed out elsewhere, static checks are *not*reliable in a dynamic language, and are not likely to be built-in anytime soon.

I'm by no means the strongest advocate of strictness in PHP - I thinkthere is a risk of throwing out good features with the bad. But I wouldlove to see strict_types=1 become unnecessary - not because "everyone'srunning static analysers anyway, so who cares" but because the defaultbehaviour provides a good balance of safety and usability.

That makes me very hesitant to use the strict_types modes as a crutchfor this compatibility break - it only puts off the question of what wethink the sensible behaviour actually is.

Thank you; and you're right, if you write new code today, you could do that, 
but that assumes you don't need to tell the difference between an empty value 
vs a missing value

As I've said multiple times now, as soon as you pass it to a functionthat doesn't have specific handling for nulls, you lose that distinctionanyway. There is literally zero difference in behaviour between "$foo =htmlspecialchars($_GET['foo'] ?? null)" and "$foo =htmlspecialchars($_GET['foo'] ?? '')".

Telling users when they've passed null to a non-nullable parameter isprecisely about *preserving* that distinction: if you want null to meansomething specific, treating it as a string is a bug.

But, updating existing code, while that would make automated updates easier, it's likely 
to cause problems, because you're editing the value source, with no idea about checks 
later on (like your example which looks for NULL)... and that's why an automated update 
of existing code would have more luck updating the sinks rather than the sources (e.g. it 
knows which sinks are expecting a string, so it can add in a `strval($var)`, or `(string) 
$var`, or `$var ?? ""`).

That's a fair point, although "sinks" are often themselves the next"source", which is what makes static analysis possible as often as it is.

Despite all of the above, I am honestly torn on this issue. It is adisruptive change, and I'm not a fan of errors for errors' sake; but Ican see the value in the decision made back in 7.0 to exclude nulls bydefault.



Regards,

--
Rowan Tommins
[IMSoP]

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php

Re: [PHP-DEV] NULL Coercion Consistency

Reply via email to