Re: [VOTE] Accept donation of Rust DataFusion library for Apache Arrow

2019-01-24 Thread Kouhei Sutou
+1 (binding) In "[VOTE] Accept donation of Rust DataFusion library for Apache Arrow" on Tue, 22 Jan 2019 19:05:16 -0600, Wes McKinney wrote: > Dear all, > > The developers of DataFusion, an analytical query engine written > in Rust, based on the Arrow columnar memory format, are

[jira] [Created] (ARROW-4366) [Docs] Change extension from format/README.md to format/README.rst

2019-01-24 Thread Yosuke Shiro (JIRA)
Yosuke Shiro created ARROW-4366: --- Summary: [Docs] Change extension from format/README.md to format/README.rst Key: ARROW-4366 URL: https://issues.apache.org/jira/browse/ARROW-4366 Project: Apache Arrow

[jira] [Created] (ARROW-4365) [Rust] [Parquet] Implement RecordReader

2019-01-24 Thread Renjie Liu (JIRA)
Renjie Liu created ARROW-4365: - Summary: [Rust] [Parquet] Implement RecordReader Key: ARROW-4365 URL: https://issues.apache.org/jira/browse/ARROW-4365 Project: Apache Arrow Issue Type: Bug

Re: [VOTE] Accept donation of Rust DataFusion library for Apache Arrow

2019-01-24 Thread Donald E. Foss
+1 On a side note, +1 for Rust in general. Donald E. Foss (mobile-US ET) > On Jan 23, 2019, at 6:26 AM, Neville Dipale wrote: > > Hi Andy, > > +1 : Accept contribution of DataFusion Rust library > > Thanks > >> On Wed, 23 Jan 2019 at 03:05, Wes McKinney wrote: >> >> Dear all, >> >> The

Re: [VOTE] Accept donation of Rust DataFusion library for Apache Arrow

2019-01-24 Thread Wes McKinney
hi Areg -- no one has worked on a bridge between Rust and C++ but it should definitely be possible. I figure we are destined to end up with multiple query engines in the project from lighter-weight / smaller scope to heavier-weight / larger scope. Having a first-class embeddable query engine

Re: [VOTE] Accept donation of Rust DataFusion library for Apache Arrow

2019-01-24 Thread Antoine Pitrou
Given the interest among the Rust community, +1 from me (binding). Regards Antoine. Le 23/01/2019 à 18:29, Andy Grove a écrit : > As far as I know, the majority of the PMC are not actively using Rust, so > as supporting evidence for interest in this donation from the Rust > community, here

Re: Round-trip of categorical data with Arrow and Parquet

2019-01-24 Thread Hatem Helal
Thanks Wes, Glad to hear this in your plan. I probably should have done this earlier...but here are some JIRA tickets that seem to cover this: https://issues.apache.org/jira/browse/ARROW-3772 https://issues.apache.org/jira/browse/ARROW-3325 https://issues.apache.org/jira/browse/ARROW-3769

Re: A renewed plea for help [was Re: Recruiting more maintainers for Apache Arrow]

2019-01-24 Thread Wes McKinney
hi Antoine, On Wed, Jan 23, 2019 at 4:35 AM Antoine Pitrou wrote: > > On Tue, 22 Jan 2019 16:57:42 -0600 > Wes McKinney wrote: > > > > There were 1540 patches merged into the project in 2018 (excluding the > > Parquet merge) -- that's more than 4 patches per day. Evidence > > suggests that the

Re: Round-trip of categorical data with Arrow and Parquet

2019-01-24 Thread Wes McKinney
hi Hatem, There are several issues open about this already (I'll have to dig them up), so this is something that we have desired for a long time, but have not gotten around to implementing. Since many Parquet writers use dictionary encoding, it would make most sense to have an option to return

Round-trip of categorical data with Arrow and Parquet

2019-01-24 Thread Hatem Helal
Hi everyone, I wanted to gauge interest and feasibility for adding support for natively reading an arrow::DictionaryArray from a parquet file. Currently, writing an arrow::DictionaryArray is read back as the native index type [0]. I came across a prior discussion for this problem in the

[jira] [Created] (ARROW-4364) [C++] Fix -weverything -wextra compilation errors

2019-01-24 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-4364: - Summary: [C++] Fix -weverything -wextra compilation errors Key: ARROW-4364 URL: https://issues.apache.org/jira/browse/ARROW-4364 Project: Apache

[jira] [Created] (ARROW-4362) [Java] Test OpenJDK 11 in CI

2019-01-24 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4362: -- Summary: [Java] Test OpenJDK 11 in CI Key: ARROW-4362 URL: https://issues.apache.org/jira/browse/ARROW-4362 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-4363) [C++] Add CMake format checks

2019-01-24 Thread Antoine Pitrou (JIRA)
Antoine Pitrou created ARROW-4363: - Summary: [C++] Add CMake format checks Key: ARROW-4363 URL: https://issues.apache.org/jira/browse/ARROW-4363 Project: Apache Arrow Issue Type: Improvement

Re: [VOTE] Accept donation of Rust DataFusion library for Apache Arrow

2019-01-24 Thread Krisztián Szűcs
+1 (binding) On Thu, Jan 24, 2019 at 11:19 AM Uwe L. Korn wrote: > +1 (binding) as the Rust community seems to support this. > > Uwe > > On Thu, Jan 24, 2019, at 7:45 AM, Melik-Adamyan, Areg wrote: > > +1 (non-binding) > > > > Is there a plan for C++ API? > > > > -Original Message- > >

[jira] [Created] (ARROW-4361) [Website] Update commiters list

2019-01-24 Thread Yosuke Shiro (JIRA)
Yosuke Shiro created ARROW-4361: --- Summary: [Website] Update commiters list Key: ARROW-4361 URL: https://issues.apache.org/jira/browse/ARROW-4361 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-4360) [C++] Query homebrew for Thrift

2019-01-24 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4360: -- Summary: [C++] Query homebrew for Thrift Key: ARROW-4360 URL: https://issues.apache.org/jira/browse/ARROW-4360 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-4359) Column metadata is not saved or loaded in parquet

2019-01-24 Thread Seb Fru (JIRA)
Seb Fru created ARROW-4359: -- Summary: Column metadata is not saved or loaded in parquet Key: ARROW-4359 URL: https://issues.apache.org/jira/browse/ARROW-4359 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-4358) [Gandiva][Crossbow] Trusty build broken

2019-01-24 Thread Praveen Kumar Desabandu (JIRA)
Praveen Kumar Desabandu created ARROW-4358: -- Summary: [Gandiva][Crossbow] Trusty build broken Key: ARROW-4358 URL: https://issues.apache.org/jira/browse/ARROW-4358 Project: Apache Arrow

[jira] [Created] (ARROW-4357) arrow java build broken on trusty

2019-01-24 Thread Pindikura Ravindra (JIRA)
Pindikura Ravindra created ARROW-4357: - Summary: arrow java build broken on trusty Key: ARROW-4357 URL: https://issues.apache.org/jira/browse/ARROW-4357 Project: Apache Arrow Issue Type:

Re: [VOTE] Accept donation of Rust DataFusion library for Apache Arrow

2019-01-24 Thread Uwe L. Korn
+1 (binding) as the Rust community seems to support this. Uwe On Thu, Jan 24, 2019, at 7:45 AM, Melik-Adamyan, Areg wrote: > +1 (non-binding) > > Is there a plan for C++ API? > > -Original Message- > From: Renjie Liu [mailto:liurenjie2...@gmail.com] > Sent: Wednesday, January 23, 2019

[jira] [Created] (ARROW-4356) [CI] Add integration (docker) test for turbodbc

2019-01-24 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4356: -- Summary: [CI] Add integration (docker) test for turbodbc Key: ARROW-4356 URL: https://issues.apache.org/jira/browse/ARROW-4356 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-4355) [C++] test-util functions are no longer part of libarrow

2019-01-24 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4355: -- Summary: [C++] test-util functions are no longer part of libarrow Key: ARROW-4355 URL: https://issues.apache.org/jira/browse/ARROW-4355 Project: Apache Arrow