[
https://issues.apache.org/jira/browse/ARROW-989?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16088792#comment-16088792
]
Wes McKinney commented on ARROW-989:
------------------------------------
I think what I'm going to do is implement a stream reader for table that
handles different chunking across columns, producing a sequence of clean record
batches.
> [Python] Write pyarrow.Table to FileWriter or StreamWriter
> ----------------------------------------------------------
>
> Key: ARROW-989
> URL: https://issues.apache.org/jira/browse/ARROW-989
> Project: Apache Arrow
> Issue Type: New Feature
> Components: Python
> Reporter: Wes McKinney
> Fix For: 0.5.0
>
>
> As part of this, we need to be able to get an iterator of record batches from
> a table. We may want to write this iteration logic in C++ as it will be
> generally useful. The chunking between columns may be different, so there is
> some amount of complexity there
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)