Project Valhalla: Goals

Brian Goetz Fri, 14 Oct 2016 12:24:18 -0700

I've heard a number of people describe Valhalla recently as being"primarily about performance". While it is understandable why peoplemight come to that conclusion -- many of the motivations for Valhallaare, in fact, rooted in performance considerations -- thischaracterization misses something very important. Yes, performance isan important part of the story -- but so are safety, abstraction,encapsulation, expressiveness, maintainability, and compatible libraryevolution.


The major goals of Valhalla are:


 - Align JVM memory layout behavior with the cost model of modern hardware;

- Extend generics to allow abstraction over all types, includingprimitives, values, and even void;- Enable existing libraries -- especially the JDK -- to compatiblyevolve to fully take advantage of these features.


Let's take these in turn.

*1. **Align JVM memory layout behavior with the cost model of modernhardware. *

Java's approach of "(almost) everything is an object" was a reasonablematch for the hardware and compilation technology of the mid-nineties,when the cost of a memory fetch and an arithmetic operation were ofcomparable magnitude. But since then, these costs have diverged by afactor of several hundred. At the same time, memory costs have come todominate application provisioning costs. These two considerationscombine to make the graph-of-small-objects data representation, whichtypical Java programs result in, suboptimal in both program performanceand provisioning cost.

The root cause of this is a partly accidental one: object identity.Every object has an identity, but not all objects /need/ an identity --many objects represent values, such as decimal numbers, currencyamounts, cursors, or dates and times, and do not need this identity.This need to preserve identity foils many otherwise powerful optimizations.

Our solution for this is /value types/; value types are aggregates, liketraditional Java classes, that renounce their identity. In return, thisenables us to create data structures that are /flatter/ (because valuescan be inlined into objects, arrays, and other values, just asprimitives are today) and /denser/ (because we don't waste space onobject headers and pointers, which can increase memory usage by up to4x), with a programming model more like objects -- supporting nominalsubstructure, behavior, subtyping, and encapsulation. /Codes like aclass, works like an int./

If you view values as being "faster objects", you could indeed view thisfundamental aspect of Valhalla as being primarily about efficiency. Butequally, you could view them as "programmable primitives", in which caseit also becomes about better abstraction, encapsulation, readability,maintainability, and type safety.

Which is the real point -- that we need not force users to choosebetween abstraction/encapsulation/safety and performance. We can haveboth. Whether you call that "cheaper objects" or "richer primitives",the end result is the same.

*2. **E**xtend generics to allow abstraction over all types, includingprimitives, values, and even void.**

Generics are currently limited to abstracting only over referencetypes. Sometimes this is merely a performance cost (one can alwaysappeal to boxing), but in reality this not only increases the cost, butdecreases the expressiveness, of libraries.

Methods like Arrays.fill() have to be written nine times; this is ninemethods to write, nine methods to test, and nine methods for the user towade through in the docs.

Real-world libraries like Streams often resort to hand-codedspecializations like IntStream; not only is this an inconvenience forthe writer, but was also a significant constraint in the design of thestream library, forcing some undesirable tradeoffs. And this approachrarely provides total coverage; users complain that we have IntStreamand LongStream but not CharStream. (For generic types with multipletype variables, like Map, the number of specializations starts toexplode out of control.) The functional interfaces introduced in Java 8are another consequence of the limitations of generics; because genericsare boxed and erased, we had to provide a large number ofhand-specialized versions (and still, not all the ones people want.)You don't need Predicate if you have can simply say Function<T, boolean>and not suffer boxing or erasure.

Everyone would be better off if we could write a generic class or methodonce -- and abstract over all the possible data types, not justreference types. (This includes not only primitives and values, butalso void. Treating void uniformly is no mere "abstract all thethings"; HashSet is based on HashMap, for implementation convenience,but suffers a lot of wasted space as a result; we could use a HashMap<T,void>. And the XxxConsumer functional interfaces are really justFunction<T, void> -- we don't need separate abstractions here.)

Being able to write things once -- rather than having an ad-hocexplosion of types and implementations (which often propagates intofurther explosion at the client site) -- means simpler, more expressive,more regular, more testable, more composible libraries. Without givingup performance when dealing with primitives and values, as boxing doestoday.

Which is to say, again, just at the data-abstraction layer instead ofthe data-modeling layer: we need not force users to choose betweenabstraction/encapsulation/safety and performance.

*3. **Enable existing libraries -- especially the JDK -- to compatiblyevolve to fully take advantage of these features.**

The breadth and quality of Java libraries is one of the core assets ofthe Java ecosystem. We don't want people to have to replace theirlibraries to migrate to a value-ful world; nor do we want existinglibraries to be "frozen in time." (Imagine if we did lambdas andstreams, but didn't do default methods, that allowed the Collectionclasses to evolve to take advantage of them -- Collections would haveinstantly looked ten years older.) There should be a straightforwardpath to extending existing libraries -- especially core JDK libraries --to supporting values and enhanced generics, in a way that makes them"look built in". This may require additional linguistic tools to allowlibraries to evolve while providing compatibility with older clients andsubclasses that have not yet migrated.

Just providing these features for new libraries is not enough; if widelyused libraries can't compatibly evolve to take advantage of this newworld, they are effectively consigned to a slow death. This might be OKfor some classes -- deprecating OldX and replacing it with NewX -- butonly when uses of the OldX types aren't strewn through lots of otherlibraries. That rules out starting fresh with Collections, Streams,JSR-310, and many other abstractions, without rewriting the whole JDK(and many third party libraries). So instead, we have to provide acompatible path for such libraries (and their clients) to modernize.

So, to summarize: Valhalla may be motivated by performanceconsiderations, but a better way to view it as enhancing abstraction,encapsulation, safety, expressiveness, and maintainability -- /without/giving up performance.

Project Valhalla: Goals

Reply via email to