[
https://issues.apache.org/jira/browse/HUDI-5373?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Danny Chen updated HUDI-5373:
-----------------------------
Fix Version/s: 0.12.2
0.13.0
> Different fileids are assigned to the same bucket
> --------------------------------------------------
>
> Key: HUDI-5373
> URL: https://issues.apache.org/jira/browse/HUDI-5373
> Project: Apache Hudi
> Issue Type: Bug
> Reporter: loukey_j
> Assignee: loukey_j
> Priority: Major
> Labels: pull-request-available
> Fix For: 0.12.2, 0.13.0
>
>
> partition =30 bucketNum=11
> bucketId = 3011
> partition =301 bucketNum=1
> bucketId = 3011
>
> Different fileids are assigned to the same bucket
> final String bucketId = partition + bucketNum;
> if (incBucketIndex.contains(bucketId)) {
> location = new HoodieRecordLocation("I", bucketToFileId.get(bucketNum));
> } else if (bucketToFileId.containsKey(bucketNum)) {
> location = new HoodieRecordLocation("U", bucketToFileId.get(bucketNum));
> } else {
> String newFileId = BucketIdentifier.newBucketFileIdPrefix(bucketNum);
> location = new HoodieRecordLocation("I", newFileId);
> bucketToFileId.put(bucketNum, newFileId);
> incBucketIndex.add(bucketId);
> }
--
This message was sent by Atlassian Jira
(v8.20.10#820010)