[
https://issues.apache.org/jira/browse/ARROW-15135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17461078#comment-17461078
]
Will Jones commented on ARROW-15135:
------------------------------------
I agree Weston, it should just be a dataset factory. The metadata just
determines the list of files.
I'm not sure this is true of Iceberg, but in my experience with Delta Lake
(which has a similar design) the metadata itself can become quite large. So we
may want to block this work on an Avro -> Arrow reader (ARROW-1209).
> [C++][R][Python] Support reading from Apache Iceberg tables
> -----------------------------------------------------------
>
> Key: ARROW-15135
> URL: https://issues.apache.org/jira/browse/ARROW-15135
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Will Jones
> Priority: Major
>
> This is an umbrella issue for supporting the [Apache Iceberg table
> format|https://iceberg.apache.org/].
> Dremio has a good overview of the format here:
> https://www.dremio.com/apache-iceberg-an-architectural-look-under-the-covers/
--
This message was sent by Atlassian Jira
(v8.20.1#820001)