wombatu-kun opened a new pull request, #16592: URL: https://github.com/apache/iceberg/pull/16592
## Summary Refreshes the FileIO concepts page (`docs/docs/fileio.md`) so it matches the FileIO surface actually shipping today, in response to issue #7966. ## What changed - Replaces the inaccurate "Object Service Storage (including https)" entry with a six-row "Built-in implementations" table covering `HadoopFileIO`, `S3FileIO`, `GCSFileIO`, `ADLSFileIO`, `OSSFileIO`, and `EcsFileIO`, each with a plain-English summary of what it can do efficiently (single-file IO, bulk delete, prefix listing, best-effort recovery). Adds Azure / ADLS, which was missing entirely. - Adds a "Routing by URI scheme" section documenting `ResolvingFileIO`'s scheme map (`s3/s3a/s3n` to `S3FileIO`, `gs` to `GCSFileIO`, `abfs/abfss/wasb/wasbs` to `ADLSFileIO`, anything else to `HadoopFileIO`), and notes that `oss://`, `ecs://`, and bare `https://` fall through to `HadoopFileIO`. - Adds "Choosing the FileIO" with the per-catalog defaults that are actually in the code (REST to `ResolvingFileIO`; `HadoopCatalog` and `JdbcCatalog` to `HadoopFileIO`) and the `io-impl` catalog property. - Adds short "Encryption" and "Implementing a custom FileIO" sections that link out to `encryption.md` and `custom-catalog.md` rather than duplicating their content. Closes #7966 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
