krisnaru opened a new pull request, #14355:
URL: https://github.com/apache/iceberg/pull/14355

   ### Overview
   Enhanced Apache Iceberg's RewriteTablePathSparkAction to support multiple 
source-target prefix pairs through a fluent chaining API, enabling complex 
table migration scenarios with hierarchical path mappings.
   
   ### Problem Statement
   The original implementation only supported a single source-target prefix 
pair, limiting users to simple one-to-one path transformations. This was 
insufficient for:
   
   Multi-cloud migrations with different storage systems
   Complex data reorganization with multiple path hierarchies
   Cross-environment moves requiring multiple prefix mapping
   
   Multiple hadoop clusters support
   Usage
   // Before: Single prefix only
   .rewriteLocationPrefix(sourcePrefix, targetPrefix)
   
   // After: Chainable multiple prefixes
   .rewriteLocationPrefix("s3://old-bucket/", "s3://new-bucket/")
   .rewriteLocationPrefix("hdfs://cluster/", "s3://data-lake/")
   .rewriteLocationPrefix("/tmp/", "s3://staging/")


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to