Re: [PHP-DEV] [RFC][Discussion] use construct (Block Scoping)

Tim Düsterhus Wed, 17 Dec 2025 11:10:56 -0800

Hi

Am 2025-12-11 23:21, schrieb Rowan Tommins [IMSoP]:

That sentence you quoted was specifically in the context of theinitial paragraph of that section, contrasting PHP - where blockscoping is expected to be used comparatively sparingly - againstlanguages where variable declarations are a more “bread and butter”part of the development process, because formally / explicitlydeclaring variables is a necessity for one reason or another.
I don't think that changes anything I said in my previous reply: assoon as you declare a variable half-way through a block, there is anambiguity about its range of visibility. Having more variabledeclarations makes that *more* likely to come up, not *less*, so I'mnot sure why you think it "avoids" the problem.

The difference I'm seeing is that for languages where variabledeclarations (and block scoping) are a core part of the language, thescoping rules are “moulding” (if that word makes sense here) how code inthat language is written and how folks reason about the code. This isdifferent for a language where block scoping is added after-the-fact andremains an optional part of the language.

There's also an assumption that if PHP added block scoping, it wouldonly rarely be used. We have no way to know, but I'm not sure that'strue. I can easily imagine code styles adding a rule that all localvariables be declared at an appropriate level. I can also imagine newusers coming from other languages - particularly JS - adding "let" outof habit, even if seasoned PHP coders wouldn't.

From my experience, a majority of functions in modern code bases arereasonably short and single-purpose where intermediate variables aremeant to live for the remainder of the function scope. And of coursewith additions such as the pipe operator, the number of temporaries willlikely also go down further. From my own PHP code, I would guess blockscoping to be useful for less than 10% of functions. For the ones whereit would be useful, it would be very useful, though, since those are thefunctions that are on the more complex end of things.

I feel that the C99 requirements and syntax would still have moreambiguity compared to the proposed `let()` syntax in cases like this:
    {
let $foo = bar($baz); // What is $baz referring to?Particularly if it is a by-reference out parameter.
        let $baz = 1;
    }
Probably the simplest solution is to re-use our existing definition of"constant expression". In fact, we already have variable declarationsusing that rule:
function foo() {
    static $a = 1; // OK
static $b = $a; // Fatal error: Constant expression containsinvalid operations
}

Morgan already correctly noted that `static` supports arbitraryexpressions nowadays. I would like to add that supporting arbitraryexpressions within the initializer is also something we expect fromblock scoping to avoid boilerplate, since most if we don't store adynamically computed value in a variable, we might as well use aconstant or hardcode the value.ö

As an example, is a goto jump label a statement?

    {
        let $foo = 1;
 label:
        let $bar = $foo++;
        goto label;
    }
PHP already limits where "goto" can jump to; I don't know how that'simplemented, but I don't think we need to get into philosophicaldefinitions to say "you can't jump into the middle of a declarationlist".

Another, perhaps better, example that is not handled well by anyC-derived language that we are aware of is block scoping in combinationwith `switch()`:


    switch ($var) {
        let $tmp;
    case "foo":
        let $tmp2;
        break;
    case "bar":
    case "baz":
        let $tmp2;
        let $tmp3;
        break;
    }

Which of the `$tmp`s is placed at the “start of a block”? What is theend of the block for each of them? Is it legal for `$tmp2` to bedeclared in two locations?

Or, we could just bite the bullet and answer the "which way does itresolve" question, as loads of other languages have already done.

Other languages have other ecosystems and other user expectations. PHPhas extensive “scope introspection” functionality by means of`extract()`, `compact()`, `get_defined_vars()` and variable variables.Folks are used to being able to access arbitrary variables (it's just aWarning, not an Error to access undefined variables) and there's alsoconstructs like `isset()` that can act on plain old local-scopevariables. Adding semantics like the “temporal dead zone” fromJavaScript that you suggested in the other thread would mean that wewould need to have entirely new semantics and interactions with variousexisting language features that folks already know, adding to thecomplexity of the language. The RFC, as currently proposed, avoids allthat by preserving all the existing semantics about “variable existence”and just adding the “backup and restore old value” semantics that areknown from other languages and reasonably intuitive to understand evenwhen not intimately familiar with block scoping.

    let ($user = $repository->find(1)) if ($user !== null) { }
Skimming down a piece of code, I can spot where code is being runconditionally without reading the condition itself:

For me this works, because the `let()` is preparing me that “this codeis doing user processing” and the `if()` is just an “implementationdetail” / “means to an end” of that. By the block scoping semantics Iknow that when I read the closing brace, the user processing isfinished. The function is a <h1>, the user processing is a <h2> and the`if()` is a <h3> if that analogy makes sense. If I just want to get anoverview over the function, I only care about the <h2> headings.

Maybe it's also because I've dabbled in Perl, which has post-fixconditions, so a very similar line would have a very different meaning:

I understand that some languages have postfix conditions, but being ableto place an `if()` after another control structure is not a new thing.The same would apply to:


    foreach ($users as $user) if ($user->isAdmin()) {
        echo "User is admin";
    }

which is already valid PHP.

In terms of making it less of a special case, some languages have a ","operator which lets you glue any two expressions together and get theright-hand result.
In Perl, you can write this:

```
my $a = 'outer', $b = 'whatever';
if ( my $a='inner', $b == 'whatever' ) {
    say $a; // 'inner'
}
say $a; // 'outer'
```
This gives the desired scope for $a, but the if statement is still justaccepting a single expression.

The comma would leave ambiguity in cases like `if (let $repository =$container->getRepository(), $user = $repository->find(1))`. Are both$repository and $user block-scoped or only $repository of them?Assignments are valid expressions in a condition. That's probably whyC++ uses the `;` as a delimiter there.

JavaScript has the same operator, but apparently doesn't allow "let" inan expression, so you can write:
if ( a="inner", b=="whatever" ) { }

but can't use it to declare a local version of "a".
I haven't thought through exactly how to apply that to PHP, but itmight give us an option for "both and": a concise and reusable syntaxfor the if use case, and a separate syntax for cases like the closureexample I gave earlier: https://externals.io/message/129059#129075

Adding “inline” support for other control structures certainly issomething that can be done as future scope. But we believe the “top ofthe block” semantics are important for block scoping to work well in PHPdue to its unique semantics and 30y history.


Best regards
Tim Düsterhus

PS: With that both Seifeddine and I are going to be enjoying ourend-of-the-year vacations and are expected to be back on the list nextyear.

Re: [PHP-DEV] [RFC][Discussion] use construct (Block Scoping)

Reply via email to