Re: Snappy Compression with red-parquet Ruby Gem

2020-04-23 Thread Sutou Kouhei
Hi, Oh, we forgot to integrate saver interface with the Parquet compression option. You can use the feature by the following code with 0.17.0: -- require "parquet" table = Arrow::Table.new({"count" => [1, 2, 3]}) Arrow::FileOutputStream.open("test.parquet", false) do |output| properties =

Re: Arrow Format vs Feather v2

2020-04-23 Thread Wes McKinney
hi Dan See https://lists.apache.org/thread.html/r0be397a5f901b9dc8787a7dbcb0a34c9ed60ad07ff1e3f064d418a98%40%3Cdev.arrow.apache.org%3E. There is an experimental implementation in C++ which is being used for "Feather V2" The Arrow specification does include a "file" format -- this is exactly

Arrow Format vs Feather v2

2020-04-23 Thread Daniel Nugent
Was just reading the 0.17 release notes (congratulations to the maintainers, btw), and was wondering if there could be some clarification on the language about file formats. The notes mention that the compression support available for Feather 2 will be formalized in the Arrow format at a later

Re: Snappy Compression with red-parquet Ruby Gem

2020-04-23 Thread Wes McKinney
hi David, You don't want to pass the compression option to TableSaver.new -- compression is something that's configured in the Parquet writer. This would need to be an option on save_as_parquet, but it doesn't look like it is exposed right now

Snappy Compression with red-parquet Ruby Gem

2020-04-23 Thread David Lahn
Hi, Does anyone have any examples of how to output a Parquet file with Snappy compression using the Ruby gem? We have tested trying to set compression to “snappy” on the TableSaver, but we get the following: [compressed-output-stream][new]: NotImplemented: Streaming compression unsupported