[GitHub] [spark] viirya commented on a change in pull request #26828: [SPARK-30198][Core] BytesToBytesMap does not grow internal long array as expected

GitBox Sun, 13 Sep 2020 13:57:10 -0700


viirya commented on a change in pull request #26828:
URL: https://github.com/apache/spark/pull/26828#discussion_r487576415




##########
File path: core/src/main/java/org/apache/spark/unsafe/map/BytesToBytesMap.java
##########
@@ -741,7 +741,9 @@ public boolean append(Object kbase, long koff, int klen, 
Object vbase, long voff
         longArray.set(pos * 2 + 1, keyHashcode);
         isDefined = true;
 
-        if (numKeys >= growthThreshold && longArray.size() < MAX_CAPACITY) {
+        // We use two array entries per key, so the array size is twice the 
capacity.
+        // We should compare the current capacity of the array, instead of its 
size.
+        if (numKeys >= growthThreshold && longArray.size() / 2 < MAX_CAPACITY) 
{
           try {
             growAndRehash();

Review comment:
       I think the problem I posted above, is when we reach `MAX_CAPACITY`, a 
forever-loop happens during calling lookup. The previous PR fixed it. Sounds 
like you are encountering another problem?




----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] viirya commented on a change in pull request #26828: [SPARK-30198][Core] BytesToBytesMap does not grow internal long array as expected

Reply via email to