Re: [Discuss] Array Cast Kernels Support Matrix

2019-03-04 Thread Micah Kornfield
Hi Neville, In case it helps you do some digging most of the allowed casts in C++ can be found at [1]. * It does support Uft8 to boolean but I don't believe it does not boolean to utf8 * It looks like it does support casting List to List. * It doesn't support Struct to struct In general, I'm not

Depending on non-released Apache projects (C++ Avro)

2019-03-04 Thread Micah Kornfield
I'm looking at incorporating Avro in Arrow C++ [1]. It seems that the Avro C++ library APIs have improved from the last release. However, it is not clear when a new release will be available (I asked on the JIRA Item for the next release [2] and received no response). I was wondering if there

[jira] [Created] (ARROW-4772) Provide new ORC adapter interface that allow user to specify row number

2019-03-04 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-4772: - Summary: Provide new ORC adapter interface that allow user to specify row number Key: ARROW-4772 URL: https://issues.apache.org/jira/browse/ARROW-4772 Project: Apache

[jira] [Created] (ARROW-4773) Enable copy free conversion for dictionary encoded string column

2019-03-04 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-4773: - Summary: Enable copy free conversion for dictionary encoded string column Key: ARROW-4773 URL: https://issues.apache.org/jira/browse/ARROW-4773 Project: Apache Arrow

[jira] [Created] (ARROW-4771) Enable copy free conversion for Composite type

2019-03-04 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-4771: - Summary: Enable copy free conversion for Composite type Key: ARROW-4771 URL: https://issues.apache.org/jira/browse/ARROW-4771 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-4770) Enable copy free conversion for primitive types

2019-03-04 Thread Yurui Zhou (JIRA)
Yurui Zhou created ARROW-4770: - Summary: Enable copy free conversion for primitive types Key: ARROW-4770 URL: https://issues.apache.org/jira/browse/ARROW-4770 Project: Apache Arrow Issue Type:

Re: [Discuss][C++] Hashing floating point numbers

2019-03-04 Thread Micah Kornfield
OK to summarize my understanding of the thoughts expressed: 1. People really shouldn't be trying to do things like grouping and joining on double valued columns (but they do). 2. The consensus (but not 100% agreement) : *Canonicalize NaNs and assume NaN == NaN, for group by/unique kernels

Re: [C++] BUILD_WARNING_LEVEL=EVERYTHING?

2019-03-04 Thread Micah Kornfield
After spending a few hours trying to fix the warnings I think I've come around to your points of view :) What got me in the end is DCHECK macros aren't allowed in headers (and I think if we want to clean these up, we should DCHECK before doing casts). Cheers, Micah On Sun, Mar 3, 2019 at 8:11

[jira] [Created] (ARROW-4769) [Rust] Improve array limit function where max records > len

2019-03-04 Thread Neville Dipale (JIRA)
Neville Dipale created ARROW-4769: - Summary: [Rust] Improve array limit function where max records > len Key: ARROW-4769 URL: https://issues.apache.org/jira/browse/ARROW-4769 Project: Apache Arrow

[jira] [Created] (ARROW-4768) [C++][CI] arrow-test-array sometimes gets stuck in MinGW build

2019-03-04 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-4768: --- Summary: [C++][CI] arrow-test-array sometimes gets stuck in MinGW build Key: ARROW-4768 URL: https://issues.apache.org/jira/browse/ARROW-4768 Project: Apache Arrow

[jira] [Created] (ARROW-4767) [C#] ArrowStreamReader crashes while reading the end of a stream

2019-03-04 Thread Prashanth Govindarajan (JIRA)
Prashanth Govindarajan created ARROW-4767: - Summary: [C#] ArrowStreamReader crashes while reading the end of a stream Key: ARROW-4767 URL: https://issues.apache.org/jira/browse/ARROW-4767

[jira] [Created] (ARROW-4766) Casting empty boolean array causes segfault

2019-03-04 Thread Keith Kraus (JIRA)
Keith Kraus created ARROW-4766: -- Summary: Casting empty boolean array causes segfault Key: ARROW-4766 URL: https://issues.apache.org/jira/browse/ARROW-4766 Project: Apache Arrow Issue Type: Bug

Re: Flaky Travis CI builds on master

2019-03-04 Thread Wes McKinney
hi Francois, I updated the CI failure page https://cwiki.apache.org/confluence/display/ARROW/Continuous+Integration+failures to split the resolved issues out into a separate filter. This makes things clearer and we can see now there are 7 open issues in the filter I also changed the language a

[jira] [Created] (ARROW-4765) [JAVA][Flight] Memory leak

2019-03-04 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-4765: - Summary: [JAVA][Flight] Memory leak Key: ARROW-4765 URL: https://issues.apache.org/jira/browse/ARROW-4765 Project: Apache Arrow Issue

[jira] [Created] (ARROW-4764) [C++/Java] conda-built libplasma_java doesn't work with system Java on Ubuntu Xenial

2019-03-04 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4764: -- Summary: [C++/Java] conda-built libplasma_java doesn't work with system Java on Ubuntu Xenial Key: ARROW-4764 URL: https://issues.apache.org/jira/browse/ARROW-4764

[Discuss] Array Cast Kernels Support Matrix

2019-03-04 Thread Neville Dipale
Hi Arrow devs, I'm currently adding support for casting arrays in Rust, and I'm wondering what casting operations should be supported, and how. Most operations are simple, but I have a few questions below. * Struct to Struct: I am not supporting in Rust as it might not make sense/be easy to

[jira] [Created] (ARROW-4762) [C++] Support RapidJSON<1.1.0

2019-03-04 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4762: -- Summary: [C++] Support RapidJSON<1.1.0 Key: ARROW-4762 URL: https://issues.apache.org/jira/browse/ARROW-4762 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-4763) [C++/Python] Cannot build Gandiva in conda on OSX due to package conflicts

2019-03-04 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4763: -- Summary: [C++/Python] Cannot build Gandiva in conda on OSX due to package conflicts Key: ARROW-4763 URL: https://issues.apache.org/jira/browse/ARROW-4763 Project: Apache

Re: [Rust] [DataFusion] Preferences on futures / threading crates?

2019-03-04 Thread Neville Dipale
I'm a fan of using Rayon. Perhaps if it's not too much work, we could compare the two On Mon, 4 Mar 2019 at 15:04, Krisztián Szűcs wrote: > On Mon, Mar 4, 2019 at 5:55 AM Andy Grove wrote: > > > I have been working on a PoC of parallel query execution and it is > working > > well, and I am now

[jira] [Created] (ARROW-4760) [C++] protobuf 3.7 defines EXPECT_OK that clashes with Arrow's macro

2019-03-04 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4760: -- Summary: [C++] protobuf 3.7 defines EXPECT_OK that clashes with Arrow's macro Key: ARROW-4760 URL: https://issues.apache.org/jira/browse/ARROW-4760 Project: Apache Arrow

[jira] [Created] (ARROW-4761) [C++] Support zstandard<1

2019-03-04 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-4761: -- Summary: [C++] Support zstandard<1 Key: ARROW-4761 URL: https://issues.apache.org/jira/browse/ARROW-4761 Project: Apache Arrow Issue Type: Improvement

Re: Flaky Travis CI builds on master

2019-03-04 Thread Francois Saint-Jacques
Hello, I created a new label named `ci-failure`, which was retroactively applied to most issues triggering a CI failure in other PRs/master (I searched for travis-ci.org/apache/arrow and tagged them). The goal here is to track issues which generates false positives failure in PRs and ideally

[jira] [Created] (ARROW-4759) [Rust] [DataFusion] It should be possible to share an execution context between threads

2019-03-04 Thread Andy Grove (JIRA)
Andy Grove created ARROW-4759: - Summary: [Rust] [DataFusion] It should be possible to share an execution context between threads Key: ARROW-4759 URL: https://issues.apache.org/jira/browse/ARROW-4759

Re: [Rust] [DataFusion] Preferences on futures / threading crates?

2019-03-04 Thread Krisztián Szűcs
On Mon, Mar 4, 2019 at 5:55 AM Andy Grove wrote: > I have been working on a PoC of parallel query execution and it is working > well, and I am now starting to create PRs for the various refactors > necessary for this in DataFusion. > > I haven't been following the async/await and futures/tokio

Re: [Rust] [DataFusion] Preferences on futures / threading crates?

2019-03-04 Thread paddy horan
No opposition here. P Get Outlook for iOS From: Andy Grove Sent: Sunday, March 3, 2019 11:55 PM To: dev@arrow.apache.org Subject: [Rust] [DataFusion] Preferences on futures / threading crates? I have been working on a PoC of parallel

Re: Parquet Shared Library Versioning

2019-03-04 Thread Hatem Helal
Thank you Wes, keeping the same version sounds good to me. On 2/27/19, 9:05 PM, "Wes McKinney" wrote: hi Hatem, Until the Parquet community begins to make C++ releases out of the new monorepo structure, I think we should continue to use the same SO version for all

[jira] [Created] (ARROW-4758) [Flight] Build fails on Mac due to missing Schema_generated.h

2019-03-04 Thread Pindikura Ravindra (JIRA)
Pindikura Ravindra created ARROW-4758: - Summary: [Flight] Build fails on Mac due to missing Schema_generated.h Key: ARROW-4758 URL: https://issues.apache.org/jira/browse/ARROW-4758 Project:

Re: Boost and manylinux CI builds

2019-03-04 Thread Ravindra Pindikura
> On Mar 4, 2019, at 4:45 AM, Wes McKinney wrote: > > hi Ravindra, > > Can we document this (just by copy-pasting what you wrote) on the wiki > or someplace for future work that may touch the manylinux package > builds? This might be a bit more discoverable than going through the > email

[jira] [Created] (ARROW-4757) Nested chunked array support

2019-03-04 Thread Philipp Moritz (JIRA)
Philipp Moritz created ARROW-4757: - Summary: Nested chunked array support Key: ARROW-4757 URL: https://issues.apache.org/jira/browse/ARROW-4757 Project: Apache Arrow Issue Type: Improvement