vinothchandar commented on a change in pull request #4428:
URL: https://github.com/apache/hudi/pull/4428#discussion_r831043757



##########
File path: 
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/config/HoodieCompactionConfig.java
##########
@@ -224,6 +224,12 @@
       .withDocumentation("Used by 
org.apache.hudi.io.compact.strategy.DayBasedCompactionStrategy to denote the 
number of "
           + "latest partitions to compact during a compaction run.");
 
+  public static final ConfigProperty<Boolean> PRESERVE_COMMIT_METADATA = 
ConfigProperty
+      .key("hoodie.compaction.preserve.commit.metadata")
+      .defaultValue(false)

Review comment:
       Compaction should not change the original `_hoodie_commit_time` or 
`_hoodie_commit_seqno` values at all. So we should look into making that the 
default behavior as @YannByron suggested.

##########
File path: 
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/table/action/cluster/strategy/PartitionAwareClusteringPlanStrategy.java
##########
@@ -101,7 +101,7 @@ public PartitionAwareClusteringPlanStrategy(HoodieTable 
table, HoodieEngineConte
         .setInputGroups(clusteringGroups)
         .setExtraMetadata(getExtraMetadata())
         .setVersion(getPlanVersion())
-        
.setPreserveHoodieMetadata(getWriteConfig().isPreserveHoodieCommitMetadata())
+        
.setPreserveHoodieMetadata(getWriteConfig().isPreserveHoodieCommitMetadataForClustering())

Review comment:
       clustering should not change the commit time either.

##########
File path: 
hudi-client/hudi-client-common/src/main/java/org/apache/hudi/io/HoodieMergeHandle.java
##########
@@ -291,7 +293,11 @@ protected boolean writeRecord(HoodieRecord<T> 
hoodieRecord, Option<IndexedRecord
       if (indexedRecord.isPresent() && !isDelete) {
         // Convert GenericRecord to GenericRecord with hoodie commit metadata 
in schema
         IndexedRecord recordWithMetadataInSchema = 
rewriteRecord((GenericRecord) indexedRecord.get());
-        fileWriter.writeAvroWithMetadata(recordWithMetadataInSchema, 
hoodieRecord);
+        if (preserveMetadata) {

Review comment:
       let's see. `_hoodie_file_name` could technically change to the base 
file? 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to