vamshipasunuru commented on issue #14200:
URL: https://github.com/apache/hudi/issues/14200#issuecomment-3493251641

   For the marker path 
   
`.hoodie/.temp/20251102234807244/2025/11/02/56d55f29-b3bf-4c94-8f2d-b5d85e4d9ab4-0_0-1-0_20251102234807244.parquet.marker.CREATE`
   
   
   It seems the rollback plan contains trimmed file path, and that's the reason 
delete don't happen. This doesn't happen always, is there any possibility 
during getRollBackRequests, path parsing can result in this? 
   
   
    `
    % hadoop fs -cat 
gs://uber-prod-ptdo9/wsygp/rawdata/db/table/.hoodie/20251102235705793.rollback.requested
   
   
   
Objavro.schema�{"type":"record","name":"HoodieRollbackPlan","namespace":"org.apache.hudi.avro.model","fields":[{"name":"instantToRollback","type":["null",{"type":"record","name":"HoodieInstantInfo","fields":[{"name":"commitTime","type":{"type":"string","avro.java.string":"String"}},{"name":"action","type":{"type":"string","avro.java.string":"String"}}]}],"doc":"Hoodie
 instant that needs to be rolled 
back","default":null},{"name":"RollbackRequests","type":["null",{"type":"array","items":{"type":"record","name":"HoodieRollbackRequest","fields":[{"name":"partitionPath","type":{"type":"string","avro.java.string":"String"}},{"name":"fileId","type":["null",{"type":"string","avro.java.string":"String"}],"default":null},{"name":"latestBaseInstant","type":["null",{"type":"string","avro.java.string":"String"}],"default":null},{"name":"filesToBeDeleted","type":{"type":"array","items":{"type":"string","avro.java.string":"String"},"default":[]},"default":[]},{"name":"logBlocksToBeDeleted","t
 
ype":["null",{"type":"map","values":"long","avro.java.string":"String"}],"default":null}]}}],"default":null},{"name":"version","type":["int","null"],"default":1}]}Zj���@b;����ҥ(b"20251102234807244
                                                                                
                                                                                
                                                    
commit2025/11/02Zj���@b;����ҥ(Nov 05, 2025 8:19:30 PM 
com.google.cloud.hadoop.fs.gcs.GhfsStorageStatistics updateMinMaxStats
   
   INFO: Detected potential high latency for operation stream_read_operations. 
latencyMs=548; previousMaxLatencyMs=0; operationCount=0; 
context=gs://uber-prod-ptdo9/wsygp/rawdata/db/table/.hoodie/20251102235705793.rollback.requested
   `
   
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to