Re: Creating column of type LIST from parquet-cpp

2017-12-08 Thread Wes McKinney
Didn't realize this question was on the Arrow mailing list instead of the Parquet mailing list! You can make things much easier on yourself by putting your data in Arrow arrays and using the parquet::arrow APIs. If you want to write the data using the lower-level Parquet column writer API, you wi

Re: Creating column of type LIST from parquet-cpp

2017-12-08 Thread Renato Marroquín Mogrovejo
Thanks Wes! So I create it this way, but I still don't know how to populate and auto element = PrimitiveNode::Make("element", Repetition::OPTIONAL, Type::INT32); auto list = GroupNode::Make("list", Repetition::REPEATED, {element}); auto my_array = GroupNode::Make("my_array", Repetition::REQUIRED,

Re: Coordinating Arrow 0.8.0 end-game

2017-12-08 Thread Wes McKinney
It looks like we are in pretty good shape to release next week. We have a handful of Python / C++ patches in flight, but I'm hopefully we'll be in position to cut the RC1 on Monday or Tuesday. On Thu, Dec 7, 2017 at 11:47 AM, Uwe L. Korn wrote: > The turbodbc side is fixed, we sadly did some brea

[jira] [Created] (ARROW-1907) [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data

2017-12-08 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1907: --- Summary: [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data Key: ARROW-1907 URL: https://issues.apache.org/jira/browse/ARROW-1

[jira] [Created] (ARROW-1906) [Python] Creating a pyarrow.Array with timestamp of different unit is not casted

2017-12-08 Thread Bryan Cutler (JIRA)
Bryan Cutler created ARROW-1906: --- Summary: [Python] Creating a pyarrow.Array with timestamp of different unit is not casted Key: ARROW-1906 URL: https://issues.apache.org/jira/browse/ARROW-1906 Project:

Re: Creating column of type LIST from parquet-cpp

2017-12-08 Thread Wes McKinney
You want something like this (didn't test this, but hopefully gives you the idea) to make the "my_array" node: auto element = PrimitiveNode::Make("element", Repetition::OPTIONAL, Type::INT32); auto list = GroupNode::Make("list", Repetition::REPEATED, {element}); auto my_array = GroupNode::Make("my

Creating column of type LIST from parquet-cpp

2017-12-08 Thread Renato Marroquín Mogrovejo
Hi devs, I am trying to create a parquet file that contains an array on int32 for each record. The schema I am trying to implement is as follows: required arr_schema { required int32 id; required group my_array (LIST) { repeated group list { optional int32 element; }

Re: Linking against parquet-cpp

2017-12-08 Thread Renato Marroquín Mogrovejo
Thanks Daniel! I figure it out with some help over the slack channel. I ended up using the following command. g++ -std=c++11 parquet_rw.cc -Iparquet-cpp_install/release/include/ parquet-cpp_install/release/libarrow.so.0 -lparquet But the suggested way to do it would be to just install arrow and