rdettai opened a new pull request #1010:
URL: https://github.com/apache/arrow-datafusion/pull/1010
# Which issue does this PR close?
Closes 1009.
# Rationale for this change
Currently the `TableProvider` implementations are split by file format
(Parquet, CSV...). One other solution would be to organize `TableProvider`s
would be by table format (file system listing, Iceberg,
[Delta](https://github.com/delta-io/delta-rs/blob/main/rust/src/delta_datafusion.rs)).
**This is discussed in this [design
document](https://docs.google.com/document/d/1Bd4-PLLH-pHj0BquMDsJ6cVr_awnxTuvwNJuWsTHxAQ/edit?usp=sharing).**
# What changes are included in this PR?
- this change will delete the file format specific implementation of
`TableProvider`
- this change will remove the `TableDescriptor` abstraction
- this change will replace the `GetFileMetadata` Ballista rpc endpoint with
`GetSchema`
# Are there any user-facing changes?
The current implementations of `TableProvider` will be replaced, but this
will partly be abstracted by methods such as `ExecutionContext.read_parquet()`
or `ExecutionContext.read_sql()`
<!--
If there are any breaking changes to public APIs, please add the `api
change` label.
-->
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]