Re: [Format] [Rust] ChunkedArray, Column and Table

2019-01-28 Thread Sebastien Binet
On Sun, Jan 27, 2019 at 1:08 PM Neville Dipale wrote: > Hi Antoine, > > I've given your response some thought. > > I'm thinking more looking at the computational aspect of Arrow. I agree > that for representing and sharing data, RecordBatches achieve the purpose. > > I came across ChunkedArray,

Re: [Format] [Rust] ChunkedArray, Column and Table

2019-01-27 Thread Wes McKinney
Just to add my two cents: The Arrow specification and Flatbuffers files defines a _binary protocol_ for making data available at the contiguous record batch level either in-process or via some other address space (a memory mapped file, a socket payload / RPC message). Chunked arrays and tables

Re: [Format] [Rust] ChunkedArray, Column and Table

2019-01-27 Thread Antoine Pitrou
Hi Neville, Le 27/01/2019 à 13:07, Neville Dipale a écrit : > Hi Antoine, > > I've given your response some thought. > > I'm thinking more looking at the computational aspect of Arrow. I agree > that for representing and sharing data, RecordBatches achieve the purpose. > > I came across

Re: [Format] [Rust] ChunkedArray, Column and Table

2019-01-25 Thread Antoine Pitrou
Hello Neville, I don't know if Tables need standardizing. Record Batches are part of the spec (*), and they are the basic block for exchanging and sharing tabular data. Depending on your application, you might exchange a stream of Record Batches, or a fixed-length sequence thereof (in which

[Format] [Rust] ChunkedArray, Column and Table

2019-01-25 Thread Neville Dipale
Hi Arrow developers, I've been looking at the various language impls, and although a Table isn't currently part of the spec, it seems to be implemented in CPP, Python, Go, JS (and perhaps other languages). Are there plans of standardising these and adding them to the spec? I'm asking because