[jira] [Commented] (ARROW-1860) [C++] Add data structure to "stage" a sequence of IPC messages from in-memory data

ASF GitHub Bot (JIRA) Mon, 12 Feb 2018 14:55:00 -0800

    [ 
https://issues.apache.org/jira/browse/ARROW-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16361562#comment-16361562
 ]


ASF GitHub Bot commented on ARROW-1860:
---------------------------------------

wesm commented on issue #1500: ARROW-1860: [C++] Introduce ipc::PreparedMessage 
data structure to avoid making multiple passes over record batches
URL: https://github.com/apache/arrow/pull/1500#issuecomment-365091190
 
 
   So it seems that not all Python file objects support the `tell` operation, 
for example the `_io.BufferedReader` that you get calling `makefile` on a 
socket. That's the last issue here -- I think probably the best way to deal 
with this is to implement a `PyInputStream` in 
https://github.com/apache/arrow/blob/master/cpp/src/arrow/python/io.h and 
handle the stream position bookkeeping internally, if that sounds good to folks

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


> [C++] Add data structure to "stage" a sequence of IPC messages from in-memory 
> data
> ----------------------------------------------------------------------------------
>
>                 Key: ARROW-1860
>                 URL: https://issues.apache.org/jira/browse/ARROW-1860
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Assignee: Wes McKinney
>            Priority: Major
>              Labels: pull-request-available
>             Fix For: 0.9.0
>
>         Attachments: text.html
>
>
> Currently, when you need to pre-allocate space for a record batch or a stream 
> (schema + dictionaries + record batches), you must make multiple passes over 
> the data structures of interest (and use e.g. {{MockOutputStream}} to compute 
> the size of the output buffer). It would be useful to make a single pass to 
> "prepare" the IPC payload for both sizing and writing to prevent having to 
> make multiple passes



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (ARROW-1860) [C++] Add data structure to "stage" a sequence of IPC messages from in-memory data

Reply via email to