"Object Layout"

Daniel Latrémolière Thu, 29 Jan 2015 03:04:35 -0800

I just want to quickly summarize my
current findings here and gently ask for feedback in case you think
I've totally misunderstood something. Of course any comments and
additional information is highly welcome as well.

I don't know if that can be useful, but here is my point of view ofdeveloper oriented towards the question: "What feature for solving myproblem?". This contains probably some or many errors, but it is anotherpoint of view (only mine), if useful.

I will not use strictly projects/proposal list as the structure of mymail because content of proposal is changing and it is not my target. Iam oriented towards the final user, i.e. the developer consuming theseprojects, not the implementer working in each of these projects.

I will preferably split in three scopes following my perceived split ofjob between developer and runtime. The problem is data, then what can doJVM/GC with an object? I find two possibilities regarding this domain:move it, clone it.

If JVM can clone the object, JVM can also move the object because theclone will not have the same address, then we have the following threefeatures:

---
1) JVM can clone and move objects (Project Valhalla):

Constraint: no complex constructor/no complex finalizer, becauselifecycle of object is managed by JVM (JVM can clone, then JVM cancreate and destroy the object like JVM want). Only field affectationconstructor, possibly with simple conversion of data format.Constraint: immutable, because we don't know which clone is good whenone is modified and because modifying all clones simultaneously isslow/complex/parallel-unfriendly.Constraint: non-null because cloning a non-existing object is anon-existing problem.

Use-case "Performance": objects to clone for being closer to executionsilicon and better parallelism (registers or cache of CPU/GPU)- Runtime: expose features of CPU/GPU like SIMD (mostly like a modernversion of javax.vecmath).- Developer: create custom low-level structures for CPU/GPU parallelcomputing.- Java language: small tuples, like complex numbers (immutable byperformance choice, like SIMD, for being close to silicon; cloned ateach pass by value).

Use-case "Language": objects to clone for being closer to registers (instack, then less allocations in heap; simpler than escape analysis)- Java language: multiple return values from a method (immutable becauseit's a result; cloned, by example, at the return of each delegate or noteven created when stack-only).

Use-case "Efficiency": others immutable non-null objects possiblyconcerned for reducing indirection/improving cache, given byspecialization of collection classes- Database: primary key for Map (like HashMap)/B-Tree (like MapDB)/SQL(like JPA). A primary key is immutable and non-null by choice ofdeveloper, then possible gains.

---
2) JVM can move but not clone objects

It's current state of Java objects:

Constraint: developer need to define lifecycle in object, for beingtriggered by GC (constructor/finalizer) like current Java class.Constraint: small object, because when GC move a big object, there ispossibly a noticeable latency.Constraint: usable directly only in Java code (because native code willneed an indirection level for finding the real address of the object,changing after each move)

Improvement by adding custom layout for objects (Project Panama on heap/ ObjectLayout):Specific constraint: objects which are near identity-less, i.e. only oneother object (the owner) know their identity/have pointer on it.Non-constraint: applicable to all objects types, contrary to ProjectValhalla. Applicable to complex constructor, because complex constructorcan be inlined in owner code where called. Applicable to mutable objects, because no cloning then no incoherency. Applicable to nullable objectsonly by adding a boolean field in the custom layout for storingpotential existence or non-existence of the inlined object, and updatingcode testing nullability for using this boolean.

Use-case "General efficiency": Custom layout (Inline sub-object in theobject owning it):

- Reduce memory use with less objects then less headers and less pointers.

- Improve cache performance with better locality (objects inlined are insame cache line, then no reference to follow).- Applicable to many fields containing reference, requiring only thereferenced object to be invisible from all objects except one (the owner).

By example, a private field containing an internal ArrayList (withoutgetter/setter) can probably be replaced by the integer containing theused size and the reference to backing array, with inlining of the fewmethods of ArrayList really used.It need probably to be driven by developer after real profiling forfinding best ratio between efficiency/code expansion. It will probablyhave much more use-cases when AOT will be available anddeveloper-manageable precisely (Jigsaw???), because most slow work ofobject-code inlining and following optimizations can be done at AOTtime, while gains will be at running time.Probably useful for the hottest code (JIT after this pre-optimization atAOT time) and clearly bad for the coldest code (interpreter then avoidcode expansion), but very useful for the big quantity of code between,which will gain from AOT if complex optimizations are available. Thiswill very probably require developer help/instructions/annotations usingprofiler data obtained on functional tests of application.

---

3) JVM can not move or clone objects (Project Panama off heap /PackedObjects)Constraint: developer need to manage externally the full lifecycle ofobject and need to choose when creating or destroying it. Object isoff-heap and an handle is on-heap for managing off-heap part.Constraint: potential fragmentation of free memory when frequentlycreating and removing objects not having the same size (taking attentionto object size vs. page size is probably important).

Use-case "GC Latency": big data structure inducing GC latency when movedif stored in heap

- All big chunks of data, like Big Data or textures in games, etc.

- Few number of objects for being manageable more explicitly bydeveloper (without too much work).


Use-case "Native": communicate with native library
- Modern version of JNI

Only my 2 cents,
Daniel.
_______________________________________________
mlvm-dev mailing list
mlvm-dev@openjdk.java.net
http://mail.openjdk.java.net/mailman/listinfo/mlvm-dev

Re: What's the status of / relation between "JEP 169: Value Objects" / "Value Types for Java" / "Object Layout"

Reply via email to