[jira] [Created] (ARROW-8906) [Rust] Support reading multiple CSV files for schema inference

2020-05-22 Thread QP Hou (Jira)
QP Hou created ARROW-8906: - Summary: [Rust] Support reading multiple CSV files for schema inference Key: ARROW-8906 URL: https://issues.apache.org/jira/browse/ARROW-8906 Project: Apache Arrow Issue

[jira] [Created] (ARROW-8905) [C++] Collapse Take APIs from 8 to 1 or 2

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8905: --- Summary: [C++] Collapse Take APIs from 8 to 1 or 2 Key: ARROW-8905 URL: https://issues.apache.org/jira/browse/ARROW-8905 Project: Apache Arrow Issue Type: Impr

[jira] [Created] (ARROW-8904) [Python] Fix usages of deprecated C++ APIs related to child/field

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8904: --- Summary: [Python] Fix usages of deprecated C++ APIs related to child/field Key: ARROW-8904 URL: https://issues.apache.org/jira/browse/ARROW-8904 Project: Apache Arrow

[jira] [Created] (ARROW-8903) [C++] Implement optimized "unsafe take" for use with selection vectors for kernel execution

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8903: --- Summary: [C++] Implement optimized "unsafe take" for use with selection vectors for kernel execution Key: ARROW-8903 URL: https://issues.apache.org/jira/browse/ARROW-8903

[jira] [Created] (ARROW-8902) [rust][datafusion] optimize count(*) queries on parquet sources

2020-05-22 Thread Alex Gaynor (Jira)
Alex Gaynor created ARROW-8902: -- Summary: [rust][datafusion] optimize count(*) queries on parquet sources Key: ARROW-8902 URL: https://issues.apache.org/jira/browse/ARROW-8902 Project: Apache Arrow

[jira] [Created] (ARROW-8901) [C++] Reduce number of take kernels

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8901: --- Summary: [C++] Reduce number of take kernels Key: ARROW-8901 URL: https://issues.apache.org/jira/browse/ARROW-8901 Project: Apache Arrow Issue Type: Improvemen

[jira] [Created] (ARROW-8900) Respect HTTP(S)_PROXY for S3 Filesystems and/or expose proxy options as parameters

2020-05-22 Thread Daniel Nugent (Jira)
Daniel Nugent created ARROW-8900: Summary: Respect HTTP(S)_PROXY for S3 Filesystems and/or expose proxy options as parameters Key: ARROW-8900 URL: https://issues.apache.org/jira/browse/ARROW-8900 Proj

Re: [DISCUSS] Need for 0.17.2 patch release?

2020-05-22 Thread Wes McKinney
That sounds fine to me, mostly was curious about what others thought. On Fri, May 22, 2020 at 5:01 PM Neal Richardson wrote: > > I'm not convinced (but open to persuasion) that ARROW-8860 alone merits the > effort of a patch release. It's unfortunate but has a number of > workarounds/alternatives

[jira] [Created] (ARROW-8899) [R] Add R metadata like pandas metadata for round-trip fidelity

2020-05-22 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8899: -- Summary: [R] Add R metadata like pandas metadata for round-trip fidelity Key: ARROW-8899 URL: https://issues.apache.org/jira/browse/ARROW-8899 Project: Apache Arr

Re: [DISCUSS] Need for 0.17.2 patch release?

2020-05-22 Thread Neal Richardson
I'm not convinced (but open to persuasion) that ARROW-8860 alone merits the effort of a patch release. It's unfortunate but has a number of workarounds/alternatives, and it's arguably not a regression but rather a bug in a new feature. Plus, there are (unofficial) nightly Python and R packages avai

[jira] [Created] (ARROW-8898) [C++] Determine desirable maximum length for ExecBatch in pipelined and parallel execution of kernels

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8898: --- Summary: [C++] Determine desirable maximum length for ExecBatch in pipelined and parallel execution of kernels Key: ARROW-8898 URL: https://issues.apache.org/jira/browse/ARROW-8898

[jira] [Created] (ARROW-8897) [C++] Determine strategy for propagating failures in initializing built-in function registry in arrow/compute

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8897: --- Summary: [C++] Determine strategy for propagating failures in initializing built-in function registry in arrow/compute Key: ARROW-8897 URL: https://issues.apache.org/jira/browse/ARR

[jira] [Created] (ARROW-8896) [C++] Reimplement dictionary unpacking in Cast kernels using Take

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8896: --- Summary: [C++] Reimplement dictionary unpacking in Cast kernels using Take Key: ARROW-8896 URL: https://issues.apache.org/jira/browse/ARROW-8896 Project: Apache Arrow

[jira] [Created] (ARROW-8895) [C++] Add C++ unit tests for filter function on temporal type inputs, including timestamps

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8895: --- Summary: [C++] Add C++ unit tests for filter function on temporal type inputs, including timestamps Key: ARROW-8895 URL: https://issues.apache.org/jira/browse/ARROW-8895

[jira] [Created] (ARROW-8894) [C++] C++ array kernels framework and execution buildout (umbrella issue)

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8894: --- Summary: [C++] C++ array kernels framework and execution buildout (umbrella issue) Key: ARROW-8894 URL: https://issues.apache.org/jira/browse/ARROW-8894 Project: Apache

Re: [DISCUSS] Need for 0.17.2 patch release?

2020-05-22 Thread Wes McKinney
Given some of the biggish projects (eg C++ kernels ARROW-8792) ongoing I'd be slightly concerned about cutting a release out of master right away until a little more time has passed, but I can see arguments both ways On Fri, May 22, 2020, 3:40 PM Krisztián Szűcs wrote: > The amount of effort req

Re: [DISCUSS] Need for 0.17.2 patch release?

2020-05-22 Thread Krisztián Szűcs
The amount of effort required to create a patch release is comparable to a minor release. How about we should create a 0.18 minor release instead? On Fri, May 22, 2020 at 3:54 PM Wes McKinney wrote: > > In light of ARROW-8860 and perhaps some other critical bugs that have > been reported, and sin

[jira] [Created] (ARROW-8893) [R] Fix cpplint issues introduced by ARROW-8885

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8893: --- Summary: [R] Fix cpplint issues introduced by ARROW-8885 Key: ARROW-8893 URL: https://issues.apache.org/jira/browse/ARROW-8893 Project: Apache Arrow Issue Type

[jira] [Created] (ARROW-8892) [C++][CI] CI builds for MSVC do not build benchmarks

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8892: --- Summary: [C++][CI] CI builds for MSVC do not build benchmarks Key: ARROW-8892 URL: https://issues.apache.org/jira/browse/ARROW-8892 Project: Apache Arrow Issue

[jira] [Created] (ARROW-8891) [C++] Split non-cast compute kernels into a separate shared library

2020-05-22 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8891: --- Summary: [C++] Split non-cast compute kernels into a separate shared library Key: ARROW-8891 URL: https://issues.apache.org/jira/browse/ARROW-8891 Project: Apache Arrow

Re: Arrow Flight connector for SQL Server

2020-05-22 Thread Ryan Murray
Hey Brendan, As Jacques promised here are a few things to act as pointers for your work on Flight: Our early release Flight connector[1] this fully supports single flight streams and partially supports parallel streams I also have a Spark DataSourceV2 client which may be of interest to you[2] Bo

[DISCUSS] Need for 0.17.2 patch release?

2020-05-22 Thread Wes McKinney
In light of ARROW-8860 and perhaps some other critical bugs that have been reported, and since our releases have been going more smoothly, what do people think about doing another patch release in a week or two?

[jira] [Created] (ARROW-8890) [R] Fix C++ lint issue

2020-05-22 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-8890: - Summary: [R] Fix C++ lint issue Key: ARROW-8890 URL: https://issues.apache.org/jira/browse/ARROW-8890 Project: Apache Arrow Issue Type: Im

[jira] [Created] (ARROW-8889) [Python] Python 3.7 SIGSEGV when comparing RecordBatch to None

2020-05-22 Thread David Li (Jira)
David Li created ARROW-8889: --- Summary: [Python] Python 3.7 SIGSEGV when comparing RecordBatch to None Key: ARROW-8889 URL: https://issues.apache.org/jira/browse/ARROW-8889 Project: Apache Arrow Is

[NIGHTLY] Arrow Build Report for Job nightly-2020-05-22-0

2020-05-22 Thread Crossbow
Arrow Build Report for Job nightly-2020-05-22-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-05-22-0 Failed Tasks: - conda-linux-gcc-py36: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-05-22-0-azure-conda-linux-gcc-py36 - cond

[jira] [Created] (ARROW-8888) Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions

2020-05-22 Thread Kevin Glasson (Jira)
Kevin Glasson created ARROW-: Summary: Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions Key: ARROW- URL: https://issues.apache.org/jira/browse/ARROW-