Re: Sometimes constraints are questionable

David Holmes Tue, 02 Jun 2020 23:51:45 -0700

Hi Stuart,

On 3/06/2020 8:08 am, Stuart Marks wrote:

Hi Jim,
This was mentioned previously in this thread but not discussed verymuch. I suggest you take a look atjdk.internal.util.ArraysSupport.newLength(). Ivan Gerasimov and I workedthis over fairly closely last year, and I'm pretty sure it does whatMartin is saying, which I also think is the right thing.
The intent is that it be used for things that have growable arrays,where the array might have a larger capacity than the logical number ofelements currently stored. Sometimes the array needs to be grown toaccommodate an immediate need (minGrowth) but it's desired to growlarger (prefGrowth) in anticipation of future needs. If minGrowth can'tbe accommodated, it throws OOME, but if prefGrowth can't beaccommodated, it might be acceptable to provide a smaller amount of growth.
(Of course, all this assumes that there is sufficient memory availableto allocate the actual array. ArraysSupport.newLength doesn't attempt toascertain that.)
One issue is integer wraparound (overflow). This is the primary valuethat ArraysSupport.newLength provides. It would be good to centralizethese computations instead of having them be spread all over.
Another issue is the one that MAX_ARRAY_LENGTH (also calledMAX_ARRAY_SIZE) is trying to address. This is sort-of a misnomer. It'snot the actual maximum array size (which in fact isn't known the thelibrary). It's actually the maximum array size that the library isfairly confident the VM can provide, assuming that enough memory isactually available. What the heck does that mean?
The theoretical maximum array size is Integer.MAX_VALUE, since the JLSand JVMS don't allow anything larger. However, actual VM implementationswill refuse to allocate an array above a certain amount slightly smallerthan that, even if there is enough memory available. In practice, Ibelieve the values for current Hotspot are Integer.MAX_VALUE-3 orInteger.MAX_VALUE-2, depending on whether compressed OOPS are in use.
Why is this significant? Consider the following case, where the capacityof something is currently Integer.MAX_VALUE-100, and it's filled withelements. The application performs some operation that requires 50elements (minGrowth) be added. A new array could certainly be allocatedwith size Integer.MAX_VALUE-50, but typical growth policies for thesekinds of containers want to increase the current capacity by 1.5x or 2x(prefGrowth). Doing this multiplication would exceed Integer.MAX_VALUE,so that won't work. Clearly, we need to clamp the capacity somewhere.
We don't want to clamp the capacity at Integer.MAX_VALUE, because thisallocation would fail on every JVM I'm aware of, even if enough memoryis available. So we don't do that. Instead, we clamp the preferredgrowth at some fairly arbitrary value smaller than Integer.MAX_VALUE,which is here called MAX_ARRAY_LENGTH, and increase the capacity to thatinstead. This allows the container's requested allocation to proceed: itsatisfies minGrowth, but it doesn't satisfy prefGrowth. Instead, itreturns a capacity value that's reasonably likely to succeed, given anunknown JVM implementation limit.
Recall that the container now has Integer.MAX_VALUE-50 elements and thecapacity is MAX_ARRAY_SIZE, which is currently defined somewhatarbitrarily at Integer.MAX_VALUE-8. Suppose the application now wants toadd 43 elements. What should happen?
We could say, this exceeds MAX_ARRAY_LENGTH, so refuse the request andthrow OOME. This is reasonable and consistent in some sense, but perhapsnot in another. Suppose there is sufficient memory, and the JVM doesallow arrays of Integer.MAX_VALUE-7 to be created. Shouldn't we even try?
That's what hugeLength() does -- it returns a value that attempts anallocation beyond the max preferential growth, and leaves it up to theJVM to grant or refuse the request based on its own implementation limits.

IIUC what you are saying is that MAX_ARRAY_LENGTH is treated as asoft-limit. A request for prefGrowth won't be allowed to exceed it. Butif minGrowth takes the length passed it then the code tries to do theallocation that large anyway. If it succeeds we win, and if we get OOMEthat is what we would have thrown anyway if we rejected the request astoo big.

So my misunderstanding in this was that MAX_ARRAY_LENGTH is notattempting to define the actual VM hard limit, just a large value closeto that which is expected to always be valid (actual memory permitting).


Thanks for the detailed explanation.

David
-----

Anyway, this is all quite subtle, and maybe comments in ArraysSupportdon't describe this adequately. But the code that implements this kindof policy has been copied to different locations around the JDK, and ituses somewhat different terminology, and it might have slightlydifferent bugs, but they're all essentially trying to implement thispolicy.
**

Several questions could be asked:

1) Is this the right policy for implementing growable arrays?
2) In cases where a class needs a growable array, can and should it berefactored to use ArraysSupport.newLength()?
3) Does ArraysSupport.newLength() need to be modified to accommodateneeds of additional call sites?
4) We might want to consider refactoring PriorityBlockingQueue andArrayDeque to use ArraysSupport.newLength, in order to provide aconsistent policy for collections. Other growable-array-basedcollections (Vector, ArrayList, PriorityQueue) do already.
s'marks





On 6/1/20 4:47 AM, Jim Laskey wrote:
Thanks David will run with that,
On May 31, 2020, at 8:34 PM, David Holmes <david.hol...@oracle.com>wrote:
On 31/05/2020 12:29 am, Jim Laskey wrote:
I'm working through https://bugs.openjdk.java.net/browse/JDK-8230744<https://bugs.openjdk.java.net/browse/JDK-8230744> Several classesthrow OutOfMemoryError without message .I'm wondering why hugeCapacity insrc/jdk.zipfs/share/classes/jdk/nio/zipfs/ByteArrayChannel.java isdefined as
     /**
      * The maximum size of array to allocate.
      * Some VMs reserve some header words in an array.
      * Attempts to allocate larger arrays may result in
      * OutOfMemoryError: Requested array size exceeds VM limit
      */
     private static final int MAX_ARRAY_SIZE = Integer.MAX_VALUE - 8;
     /**
      * Increases the capacity to ensure that it can hold at least the
      * number of elements specified by the minimum capacity argument.
      *
      * @param minCapacity the desired minimum capacity
      */
     private void grow(int minCapacity) {
         // overflow-conscious code
         int oldCapacity = buf.length;
         int newCapacity = oldCapacity << 1;
         if (newCapacity - minCapacity < 0)
             newCapacity = minCapacity;
         if (newCapacity - MAX_ARRAY_SIZE > 0)
             newCapacity = hugeCapacity(minCapacity);
         buf = Arrays.copyOf(buf, newCapacity);
     }
     private static int hugeCapacity(int minCapacity) {
         if (minCapacity < 0) // overflow
             throw new OutOfMemoryError();
Not sure how we could have minCapacity < 0 at this point. It shouldhave been checked before the call to grow, and grow will not make itnegative.
         return (minCapacity > MAX_ARRAY_SIZE) ?
             Integer.MAX_VALUE :
             MAX_ARRAY_SIZE;
That's a bug plain and simple. It should never report a size >MAX_ARRAY_SIZE.
     }
It just seems that it's pushing the inevitable off toArrays.copyOf. Shouldn't it be:
     private static int hugeCapacity(int minCapacity) {
         if (minCapacity < 0 || minCapacity > MAX_ARRAY_SIZE) {
             throw
new OutOfMemoryError("ByteArrayChannel exceedsmaximum size: " +
                                       MAX_ARRAY_SIZE);
         }
                  return MAX_ARRAY_SIZE;
     }
That seems more appropriate to me - modulo the question mark overminCapacity being negative.
Real question: is there some hidden purpose behind this kind of logic?
The basic strategy is to double the current capacity unless that willtrigger an unnecessary exception, in which case just use therequested capacity, but again watch for the implementation limits.
Cheers,
David
-----
Cheers,
-- Jim

Re: Sometimes constraints are questionable

Reply via email to