krisnaru opened a new issue, #14318:
URL: https://github.com/apache/iceberg/issues/14318
### Feature Request / Improvement
### Overview
Enhanced Apache Iceberg's RewriteTablePathSparkAction to support multiple
source-target prefix pairs through a fluent chaining API, enabling complex
table migration scenarios with hierarchical path mappings.
### Problem Statement
The original implementation only supported a single source-target prefix
pair, limiting users to simple one-to-one path transformations. This was
insufficient for:
- Multi-cloud migrations with different storage systems
- Complex data reorganization with multiple path hierarchies
- Cross-environment moves requiring multiple prefix mapping
- Multiple hadoop clusters support
### Usage
// Before: Single prefix only
.rewriteLocationPrefix(sourcePrefix, targetPrefix)
// After: Chainable multiple prefixes
.rewriteLocationPrefix("s3://old-bucket/", "s3://new-bucket/")
.rewriteLocationPrefix("hdfs://cluster/", "s3://data-lake/")
.rewriteLocationPrefix("/tmp/", "s3://staging/")
### Query engine
Spark
### Willingness to contribute
- [x] I can contribute this improvement/feature independently
- [x] I would be willing to contribute this improvement/feature with
guidance from the Iceberg community
- [ ] I cannot contribute this improvement/feature at this time
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]