I have come around to a similar conclusion. None of the models for
WR(value) are really all that justifiable, other than "throws". But we
dislike that because so much code uses WHM that we are worried about
this code all of a sudden failing. But we are trying to fix that by
distorting WR, rather than enhancing WHM.
WHM has its own set of constraints; it is the closest thing to a cache
in the JDK (with a very rigid eviction policy), and frequently gets used
as such. But implicit in the use of a cache is something that only the
code that uses WHM knows: are we caching because we're trying to avoid
redundant computation, or are we caching because we *cannot* recompute
the mapping? The current eviction strategy is consistent with both --
for identities. But once we get past identity objects, I can very well
imagine some clients wanting to aggressively release mappings from value
keys (to save memory), and some wanting to never release mappings from
value keys (because they need a unique target). Which suggests that
creating a WHM requires a user-specifiable policy; I can think of cases
where each of the following policies are sensible (note that they all
avoid getting into the details of invalidation):
- THROW -- throw when someone uses a value key. This would probably
be the default.
- KEEP -- keep value key mappings forever. This is useful when we are
computing, say, histograms of "how often have we seen key X".
- DISCARD -- clear value key mappings immediately.
- SOFT -- Keep value key mappings around for as long as there is not
excessive heap pressure.
For policy SOFT, we would likely implement by wrapping a SoftReference
around an *entire* side map of key -> value mappings for value keys.
Then we'd provide some new factories for WHM to select one of these
policies.
On 2/9/2022 11:50 AM, Dan Heidinga wrote:
One option is to look at what we can do to help users prepare for IAE
when using value-based classes as keys to WHM. Can we take an
approach similar to JEP 390 [1] and provide JFR events that flag uses
of value-based classes as keys? It's not perfect but similar to JEP
390, it does help users to know if they need to do something to
prepare for this.
--Dan
[1]http://openjdk.java.net/jeps/390
On Thu, Jan 20, 2022 at 1:54 PM Dan Heidinga<heidi...@redhat.com> wrote:
It certainly seems that all the choices are bad.
The "obvious" choice is to simply say that WeakReference<Value> makes no sense,
in that it sidesteps the hard semantics questions.
It's an uncomfortable answer but it seems to provide the most
defensible (and thus understandable) semantics for users. Values don't
have an explicit lifetime and thus there is no way to tell when "this"
copy of a value goes out of scope and can be collected. The object
references (if any) held by the value are not a good proxy for its
lifecycle - they can have either shorter or much longer life spans -
and will make reasoning about when a WeakReference<Value> can be
collected difficult for experts, let alone most users.
My fear is that this will cause new failures, where existing libraries that
toss objects into WHMs to cache derived results, will start to experience new
failures when the value types show up in the heap (after all,
WeakReference::new takes Object.)
This may be a case where the WeakReference constructor needs to be
updated to take an IdentityObject and the old constructor marked as
@Deprecated? Which doesn't solve the immediate problem but helps
justify adding a "fail-fast" check to all WeakReference constructors
so that they throw an IllegalArgumentException if the referent isn't
an IdentityObject.
This won't avoid failures but it does make it very clear what went
wrong rather than introducing "strange", hard to diagnose failures.
And we'll have to have something to tell those users, because they declared a
WeakHashMap<User, UserData>, and someone created a value subtype of User --
which seems entirely valid.
It is possible we could do something special with WHM, since it is layered atop
WR, but that still begs the question -- what?
Starting from the conclusion that WeakReference<Value> is a
meaningless entity, what are the options here?
1) Make it impossible to use a Value as a key in a WeakHashMap.
::put(key, value) & ::pulAll(Map m) will throw if a key is a Value
object. ::containsKey(Object) will always be false if the Object is a
ValueObject. This makes WeakHashMap unusable with Values. The
semantics are clear but all existing uses of WeakHashMap will need to
be adapted to defensively check for Values and do something (tbd) to
avoid the exceptions.
2) Use strong references for Value keys in WeakHashMap.
Treat each Value object used as a key in WeakHashMap as a strong
reference. This means Value keys will never be removed and will keep
their corresponding map value alive forever (or until explicitly
removed). While this will allow WeakHashMaps to continue to be used
as Maps for Values, it will break the contract for WHM and may
introduce memory leaks into otherwise correct programs.
3) Pick some other object to act as the reference when using a Value
key in a WHM.
This is basically the solution we rejected for WeakReference<Value>
and all the same problems apply here. It may allow existing code to
"keep working" when it first deals with Values but introduces strange
failure cases and difficult to reason about rules. It avoids
exceptions but leaves the code doing the wrong thing with no way to
tell.
Anyone see another option here?
--Dan