loukey_j created HUDI-5373:
------------------------------

             Summary:  Different fileids are assigned to the same bucket
                 Key: HUDI-5373
                 URL: https://issues.apache.org/jira/browse/HUDI-5373
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: loukey_j


partition =30 bucketNum=11 
bucketId = 3011

partition =301 bucketNum=1

bucketId = 3011
 
Different fileids are assigned to the same bucket

final String bucketId = partition  + bucketNum;

if (incBucketIndex.contains(bucketId)) {
location = new HoodieRecordLocation("I", bucketToFileId.get(bucketNum));
} else if (bucketToFileId.containsKey(bucketNum)) {
location = new HoodieRecordLocation("U", bucketToFileId.get(bucketNum));
} else {
String newFileId = BucketIdentifier.newBucketFileIdPrefix(bucketNum);
location = new HoodieRecordLocation("I", newFileId);
bucketToFileId.put(bucketNum, newFileId);
incBucketIndex.add(bucketId);
}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to