krisnaru opened a new pull request, #14355:
URL: https://github.com/apache/iceberg/pull/14355
### Overview
Enhanced Apache Iceberg's RewriteTablePathSparkAction to support multiple
source-target prefix pairs through a fluent chaining API, enabling complex
table migration scenarios with hierarchical path mappings.
### Problem Statement
The original implementation only supported a single source-target prefix
pair, limiting users to simple one-to-one path transformations. This was
insufficient for:
Multi-cloud migrations with different storage systems
Complex data reorganization with multiple path hierarchies
Cross-environment moves requiring multiple prefix mapping
Multiple hadoop clusters support
Usage
// Before: Single prefix only
.rewriteLocationPrefix(sourcePrefix, targetPrefix)
// After: Chainable multiple prefixes
.rewriteLocationPrefix("s3://old-bucket/", "s3://new-bucket/")
.rewriteLocationPrefix("hdfs://cluster/", "s3://data-lake/")
.rewriteLocationPrefix("/tmp/", "s3://staging/")
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]