[jira] [Commented] (ARROW-15135) [C++][R][Python] Support reading from Apache Iceberg tables

Will Jones (Jira) Thu, 16 Dec 2021 14:28:08 -0800


    [ 
https://issues.apache.org/jira/browse/ARROW-15135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17461078#comment-17461078
 ]


Will Jones commented on ARROW-15135:
------------------------------------

I agree Weston, it should just be a dataset factory. The metadata just 
determines the list of files.

I'm not sure this is true of Iceberg, but in my experience with Delta Lake 
(which has a similar design) the metadata itself can become quite large. So we 
may want to block this work on an Avro -> Arrow reader (ARROW-1209).

> [C++][R][Python] Support reading from Apache Iceberg tables
> -----------------------------------------------------------
>
>                 Key: ARROW-15135
>                 URL: https://issues.apache.org/jira/browse/ARROW-15135
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Will Jones
>            Priority: Major
>
> This is an umbrella issue for supporting the [Apache Iceberg table 
> format|https://iceberg.apache.org/].
> Dremio has a good overview of the format here: 
> https://www.dremio.com/apache-iceberg-an-architectural-look-under-the-covers/



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

[jira] [Commented] (ARROW-15135) [C++][R][Python] Support reading from Apache Iceberg tables

Reply via email to