Re: Update on String Templates (JEP 459)

Brian Goetz Tue, 12 Mar 2024 10:09:00 -0700

OK, so let's summarize the EG discussion so far. (As a reminder,syntax-heavy features like this are even more subject to "armchairtheorization" than most, so please, take that into account whencommenting. As a further reminder, the best thing we could do right nowis write more API code that manipulates string templates.)

Overall, I think everyone agrees that the "make string templates thestar of the show" approach is a winning direction. No one seems toobusted up at the loss of processors.

I'm going to try and focus for now on "potential problems that mightprompt further adjustment", rather than specific solutions.

There is some ambient discomfort that the "sublanguage" of a templatebecomes a dynamic property of a template, introducing new opportunitiesfor users to make mistakes with unprocessed templates. (This waspresent before as well using the RAW processor, but much lessprominent.) But, I don't think this is a significant issue, its justsomething new to get used to.

Most of the concerns have to do with the visual similarity betweenstring literals and template literals. While this is of courseintended, there are some concerns that they may be "too similar".Concerns raised include:

- In a code-generation scenario that leans on templates, sometimes wewant to use a string literal as a degenerate form of template. It maybe surprising that this doesn't "just work", and alternatives (e.g.,conversion functions, casting, etc) may have varying degrees ofdiscoverability and yuck-factor.

- Given (a) the visual similarity of string and template literals and(b) the lenient treatment of concatenation between strings andeverything else, users may well be tempted to concatenate stringliterals with template literals, and may be surprised at the outcome.

- Because template literals may be broad and wide, and theirevaluation may involve side effects, we may want to give a lexicalheads-up of "weird thing coming", rather than having template literalsbe framed more like "strings with benefits."


Have I covered the concerns raised so far?

Before we get too caught up in solutions, let's try to get on the samepage about which of these are problems that need to be solved right now.

(As a small matter of housekeeping, given that the preview train isalready rolling, we will soon have to make a decision to (a) withdrawthe current preview entirely, (b) re-preview the current design eventhough we know it will change, or (c) gain the requisite confidence in anew design in time to preview that. From my vantage point, (c) isstarting to look increasingly unlikely, and I suspect (a) is a betterchoice than (b). But I bring this up not to start a project managementdiscussions, as much as to raise awareness that there are projectmanagement constraints.)





On 3/8/2024 1:35 PM, Brian Goetz wrote:

Time to check in with where were are with String Templates. We’vegone through two rounds of preview, and have received some feedback.
As a reminder, the primary goal of gathering feedback is to learnthings about the design or implementation that we don’t already know. This could be bug reports, experience reports, code review, carefulanalysis, novel alternatives, etc. And the best feedback usuallycomes from using the feature “in anger” — trying to actually writecode with it. (“Some people would prefer a different syntax” or “somepeople would prefer we focused on string interpolation only” fallsquarely in the “things we already knew” camp.)
In the course of using this feature in the `jextract` project, we didlearn quite a few things we didn’t already know, and this wasconclusive enough that it has motivated us to adjust our approach inthis feature. Specifically, the role of processors is “outsized” tothe value they offer, and, after further exploration, we now believeit is possible to achieve the goals of the feature without an explicit“processor” abstraction at all! This is a very positive development.
First, I want to affirm that that the goals of the project have notchanged. From JEP 459:
Goals
• Simplify the writing of Java programs by making it easy to expressstrings that include values computed at run time.• Enhance the readability of expressions that mix text andexpressions, whether the text fits on a single source line (as withstring literals) or spans several source lines (as with text blocks).• Improve the security of Java programs that compose strings fromuser-provided values and pass them to other systems (e.g., buildingqueries for databases) by supporting validation and transformation ofboth the template and the values of its embedded expressions.• Retain flexibility by allowing Java libraries to define theformatting syntax used in string templates.• Simplify the use of APIs that accept strings written in non-Javalanguages (e.g., SQL, XML, and JSON).• Enable the creation of non-string values computed from literal textand embedded expressions without having to transit through anintermediate string representation.
Non-Goals
• It is not a goal to introduce syntactic sugar for Java's stringconcatenation operator (+), since that would circumvent the goal ofvalidation.• It is not a goal to deprecate or remove the StringBuilder andStringBuffer classes, which have traditionally been used for complexor programmatic string composition.
Another thing that has not changed is our view on the syntax forembedding expressions. While many people did express the opinion of“why not ‘just' do what Kotlin/Scala does”, this issue was more thanfully explored during the initial design round. (In fact, whilesyntax disagreements are often purely subjective, this one was farmore clear — the $-syntax is objectively worse, and would be doubly soif injected into an existing language where there were already stringliterals in the wild. This has all been more than adequately coveredelsewhere, so I won’t rehash it here.)
Now, let’s talk about what we do think should change: the role ofprocessors and the StringTemplate type.
Processors were envisioned as a means to abstract the transformationof templates to their final form (whether string, or something else.) However, Java already has a well established means of abstractingbehavior: methods. (In fact, a processor application can be viewedas merely a new syntax for a method call.) Our experience using thefeature highlighted the question: When converting a SQL queryexpressed as a template to the form required by the database (such asPreparedStatement), why do we need to say:
  DB.”… template …”

When we could use an ordinary Java library:

  Query q = Query.of(“…template…”)
Indeed, one of the worst things about having processors in thelanguage is that API designers are put in the difficult situation ofnot knowing whether to write a processor or an ordinary API, and oftenhave to make that choice before the consequences are fully understood. (To add to this, processors raise similar questions at the use site.)But the real criticism here is that template capture and processingare complected, when they should be separate, composable features.
This motivated us to revisit some of the reasons why processors wereso central to the initial design in the first place. And it turnedout, this choice had been influenced — perhaps overly so — by earlyimplementation experiments. (One of the background design goals wasto enable expensive operations like `String::format` to be (much)cheaper. Without digressing too deeply on performance, String::formatcan be more than an order of magnitude worse than the equivalentconcatenation operation, and this in turn sometimes motivatesdevelopers to use worse idioms for formatting. The FMT processorbrough that cost back in line with the equivalent concatenation.) These early experiments biased the design towards needing to know theprocessor at the point of template capture, but upon reexamination werealized that there are other ways to achieve the desired performancegoals without requiring processors to be known at capture time. This,in turn, enabled us to revisit a point in the design space we hadtransited through earlier, where string templates were “just a newkind of literal” and the job performed by processors could instead beperformed by ordinary APIs.
At this point, a simpler design and implementation emerged that metthe semantic, correctness, and performance goals: template literals(“Hello \{name}”) are simply the literal form of StringTemplate:
  StringTemplate st = “Hello \{name}”;
String and StringTemplate remain unrelated types. (We explored anumber of ways to interconvert them, but they caused more trouble thanthey solved.) Processing of string templates, includinginterpolation, is done by ordinary APIs that deal in StringTemplate,aided by some clever implementation tricks to ensure good performance.
For APIs where interpolation is known to be safe in the domain, suchas PrintWriter, APIs can make that choice on behalf of the domain, byproviding overloads to embody this design choice:
   void println(String) { … }
void println(StringTemplate) { … interpolate and delegate toprintln(String) …. }
The upshot is that for interpolation-safe APIs like println, we canuse a template directly without giving up any safety:
   System.out.println(“Hello \{name}”);
In this example, the string template evaluates to StringTemplate, notString (no implicit interpolation), and chooses the StringTemplateoverload of println, which in turn chooses how to process thetemplate. This stays true to the design principle that interpolationis dangerous enough that it should be an explicit choice in the code —but it allows that choice to be made by libraries when the library iscomfortable doing so.
Similarly, the FMT processor is replaced by an overload ofString::format that interprets templates with embedded formatspecifiers (e.g., “%d”):
String format(String formatString, Object… parameters) { … same astoday … }
  String format(StringTemplate template) {... equivalent of FMT ...}

And users can call this as:

  String s = String.format(“Hello %12s\{name}”);
Here, the String::format API has chosen to interpret string templatesaccording to the rules previously specified in the FMT processor (notordinary interpolation), but that choice is embedded in the librarysemantics so no further explicit choice at the use site is required. The user already chose to pass it to String::format; that’s all theprocessing selection that is needed.
Where APIs do not express a choice of what template expansion means,users continue to be free to process them explicitly before passingthem, using APIs that do (such as String::format or ordinaryinterpolation.).
The result is:
- The need for use-site "goop" (previously, the processor name; now,static or instance methods to process a template) goes away entirelywhen dealing with libraries that are already template-friendly.- Even with libraries that require use-site goop, it is no moreintrusive than before, and can be reduced over time as APIs get withthe program.- StringTemplate is just another type that APIs can support if theywant. The "DB" processor becomes an ordinary factory method thataccepts a string template or an ordinary builder API.- APIs now can have _more_ control over the timing and meaning oftemplate processing, because we are not biasing so strongly towardsearly processing.- It becomes easier to abstract over template processing (i.e.,combine or manipulate templates as templates before processing)- Interpolation remains an explicit choice, but ST-aware libraries canmake this choice on behalf of the user.- The language feature and API surface get considerably smaller, whichis good. Core JDK APIs (e.g., println, format, exceptionconstructors) get upgraded to work with string templates.
The remaining question that everyone is probably asking is: “so how dowe do interpolation.” The answer there is “ordinary library methods”. This might be a static method (String.join(StringTemplate)) or aninstance method (template.join()), shed to be painted (but please, notright now.).
This is a sketch of direction, so feel free to pose questions/commentson the direction. We’ll discuss the details as we go.

Re: Update on String Templates (JEP 459)

Reply via email to