vmaster.cc created HUDI-7051:
--------------------------------

             Summary: Incorrect replace operation in compaction strategy filter
                 Key: HUDI-7051
                 URL: https://issues.apache.org/jira/browse/HUDI-7051
             Project: Apache Hudi
          Issue Type: Bug
          Components: compaction
            Reporter: vmaster.cc
         Attachments: image-2023-11-08-16-01-46-166.png, 
image-2023-11-08-16-02-39-291.png

There are some incorrect replace operation to sort all partition paths.
{code:java}
return allPartitionPaths.stream().map(partition -> partition.replace("/", "-"))
    .sorted(Comparator.reverseOrder()).map(partitionPath -> 
partitionPath.replace("-", "/")) {code}
the hive partition before replace is dllr_date=2023-10-10, then after will 
convert to dllr_date=2023/10/10, this is an incorrect partition.
 # org.apache.hudi.table.action.compact.strategy.DayBasedCompactionStrategy
 # 
org.apache.hudi.table.action.compact.strategy.BoundedPartitionAwareCompactionStrategy
 # 
org.apache.hudi.table.action.compact.strategy.UnBoundedPartitionAwareCompactionStrategy

!image-2023-11-08-16-02-39-291.png!



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to