[jira] [Created] (ARROW-9415) Arrow does not compile on Power9
Jens Glaser created ARROW-9415: -- Summary: Arrow does not compile on Power9 Key: ARROW-9415 URL: https://issues.apache.org/jira/browse/ARROW-9415 Project: Apache Arrow Issue Type: Bug Components: C++ Affects Versions: 0.17.1 Environment: Linux 4.14.0-115.21.2.el7a.ppc64le Reporter: Jens Glaser When compiling the master branch from github on IBM P9 (Summit), I get {{In file included from /ccs/proj/stf006/glaser/rapids/arrow/cpp/src/arrow/array/dict_internal.h:35:0, from /ccs/proj/stf006/glaser/rapids/arrow/cpp/src/arrow/array/builder_dict.cc:22: /ccs/proj/stf006/glaser/rapids/arrow/cpp/src/arrow/util/hashing.h: In static member function 'static uint32_t arrow::internal::SmallScalarTraits<__vector(4) __bool int>::AsIndex(__vector(4) __bool int)': /ccs/proj/stf006/glaser/rapids/arrow/cpp/src/arrow/util/hashing.h:495:60: error: cannot convert '__vector(4) int' to 'uint32_t \{aka unsigned int}' in return static uint32_t AsIndex(bool value) \{ return value ? 1 : 0; } ^}} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9414) [C++] apt package includes headers for S3 interface, but no support
Simon Bertron created ARROW-9414: Summary: [C++] apt package includes headers for S3 interface, but no support Key: ARROW-9414 URL: https://issues.apache.org/jira/browse/ARROW-9414 Project: Apache Arrow Issue Type: Bug Components: C++ Affects Versions: 0.17.1 Environment: Ubuntu 18.04.04 LTS Reporter: Simon Bertron I believe that the apt package is built without S3 support. But s3fs.h is exported in filesystem/api.h anyway. This creates undefined reference errors when trying to link to the package. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9413) [Rust] Fix clippy lint on master
Neville Dipale created ARROW-9413: - Summary: [Rust] Fix clippy lint on master Key: ARROW-9413 URL: https://issues.apache.org/jira/browse/ARROW-9413 Project: Apache Arrow Issue Type: Bug Components: Rust Affects Versions: 0.17.1 Reporter: Neville Dipale There was a clippy lint error with the float sort PR. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9412) [C++] Add non-BUNDLED dependencies to exported INSTALL_INTERFACE_LIBS of arrow_static and test that it works
Wes McKinney created ARROW-9412: --- Summary: [C++] Add non-BUNDLED dependencies to exported INSTALL_INTERFACE_LIBS of arrow_static and test that it works Key: ARROW-9412 URL: https://issues.apache.org/jira/browse/ARROW-9412 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Wes McKinney As a companion project to ARROW-7605, we must document and test a workflow for statically linking with external static dependencies. When a dependency is not built as BUNDLED, it can be added to "ARROW_STATIC_INSTALL_INTERFACE_LIBS" so that it's included in ArrowTargets-*.cmake. The third party project of course must configure the dependent CMake targets Prior to the patch for ARROW-7605, toolchain libraries were added unconditionally to ARROW_STATIC_INSTALL_INTERFACE_LIBS whether BUNDLED or not (including our private jemalloc), creating a broken CMake "arrow_static" target. So this patch is to partially revert these changes to enable static linking with external toolchain libraries without breaking the BUNDLED static builds. Finally, this must be tested similar to cpp/examples/minimal_build/run_static.sh so that we can verify that each of the build/link scenarios are working correctly -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9411) [Rust] Update dependencies
Neville Dipale created ARROW-9411: - Summary: [Rust] Update dependencies Key: ARROW-9411 URL: https://issues.apache.org/jira/browse/ARROW-9411 Project: Apache Arrow Issue Type: Improvement Components: Rust Affects Versions: 0.17.1 Reporter: Neville Dipale Update dependencies like tonic and rand (to reduce total dependencies) -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9410) [CI][Crossbow] Fix homebrew-cpp again
Neal Richardson created ARROW-9410: -- Summary: [CI][Crossbow] Fix homebrew-cpp again Key: ARROW-9410 URL: https://issues.apache.org/jira/browse/ARROW-9410 Project: Apache Arrow Issue Type: Bug Components: Continuous Integration, Packaging Reporter: Neal Richardson Assignee: Neal Richardson Fix For: 1.0.0 brew audit is now failing because python is an alias: https://travis-ci.org/github/ursa-labs/crossbow/builds/706771359 This was changed upstream months ago. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9409) [CI][Crossbow] Nightly conda-r fails
Neal Richardson created ARROW-9409: -- Summary: [CI][Crossbow] Nightly conda-r fails Key: ARROW-9409 URL: https://issues.apache.org/jira/browse/ARROW-9409 Project: Apache Arrow Issue Type: New Feature Components: Continuous Integration, R Reporter: Neal Richardson Assignee: Neal Richardson Fix For: 1.0.0 Adding {{decor}} to Suggests means we need to add it to the conda env too (even though we don't use it in tests, etc. etc.) -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9408) [Integration] Tests do not run in Windows due to numpy 64-bit errors
Neville Dipale created ARROW-9408: - Summary: [Integration] Tests do not run in Windows due to numpy 64-bit errors Key: ARROW-9408 URL: https://issues.apache.org/jira/browse/ARROW-9408 Project: Apache Arrow Issue Type: Bug Components: Integration Affects Versions: 0.17.1 Reporter: Neville Dipale We found that the integer range check when generating integration data doesn't work in Windows because the default C integers that numpy uses are 32-bit by default in Windows. This fixes that issue by forcing 64-bit integers. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9407) [Python] Accept pd.NA as missing value in array constructor
Uwe Korn created ARROW-9407: --- Summary: [Python] Accept pd.NA as missing value in array constructor Key: ARROW-9407 URL: https://issues.apache.org/jira/browse/ARROW-9407 Project: Apache Arrow Issue Type: New Feature Components: Python Reporter: Uwe Korn Fix For: 2.0.0 Currently we don't support using {{pandas.NA}} at all: {code} In [1]: import pyarrow as pa In [2]: import pandas as pd In [3]: pa.array([pd.NA, "A"]) --- ArrowInvalid Traceback (most recent call last) in > 1 pa.array([pd.NA, "A"]) ~/miniconda3/envs/fletcher/lib/python3.8/site-packages/pyarrow/array.pxi in pyarrow.lib.array() ~/miniconda3/envs/fletcher/lib/python3.8/site-packages/pyarrow/array.pxi in pyarrow.lib._sequence_to_array() ~/miniconda3/envs/fletcher/lib/python3.8/site-packages/pyarrow/error.pxi in pyarrow.lib.check_status() ArrowInvalid: Could not convert with type NAType: did not recognize Python value type when inferring an Arrow data type {code} -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9406) [C#] Show API docs on arrow.apache.org
Eric Erhardt created ARROW-9406: --- Summary: [C#] Show API docs on arrow.apache.org Key: ARROW-9406 URL: https://issues.apache.org/jira/browse/ARROW-9406 Project: Apache Arrow Issue Type: Improvement Components: C# Reporter: Eric Erhardt We should use the XML documentation files that are generated during the build and publish them to arrow.apache.org under the Documentation tab. One option would be to use [https://ejball.com/XmlDocMarkdown/] to generate a markdown file, which could be displayed as a web page. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9405) [R] Switch to cpp11
Neal Richardson created ARROW-9405: -- Summary: [R] Switch to cpp11 Key: ARROW-9405 URL: https://issues.apache.org/jira/browse/ARROW-9405 Project: Apache Arrow Issue Type: New Feature Components: R Reporter: Neal Richardson Assignee: Romain Francois Fix For: 2.0.0 Now that it's on CRAN. Faster compile time and better UTF-8 handling are among the benefits that make it seem worth trying at least. -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9404) [C++] Add support for Decimal16, Decimal32 and Decimal64
Artem Alekseev created ARROW-9404: - Summary: [C++] Add support for Decimal16, Decimal32 and Decimal64 Key: ARROW-9404 URL: https://issues.apache.org/jira/browse/ARROW-9404 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Artem Alekseev Assignee: Artem Alekseev It looks like arrow lacks support for decimal16, decimal32 and decimal64 types. Are there any reasons for that? -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9403) [Python] add .tolist as alias of to_pylist
Maarten Breddels created ARROW-9403: --- Summary: [Python] add .tolist as alias of to_pylist Key: ARROW-9403 URL: https://issues.apache.org/jira/browse/ARROW-9403 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Maarten Breddels Assignee: Maarten Breddels As discussed on the mailing list, it helps to write library agnostic code (NumPy/Pyarrow) if arrays support a .tolist(), as alias to .to_pylist(). -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9402) [C++] Add portable wrappers for __builtin_add_overflow and friends
Antoine Pitrou created ARROW-9402: - Summary: [C++] Add portable wrappers for __builtin_add_overflow and friends Key: ARROW-9402 URL: https://issues.apache.org/jira/browse/ARROW-9402 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: Antoine Pitrou Fix For: 2.0.0 -- This message was sent by Atlassian Jira (v8.3.4#803005)
[jira] [Created] (ARROW-9401) [C++/Python] Support necessary functionality to have an Arrow-string type in pandas
Uwe Korn created ARROW-9401: --- Summary: [C++/Python] Support necessary functionality to have an Arrow-string type in pandas Key: ARROW-9401 URL: https://issues.apache.org/jira/browse/ARROW-9401 Project: Apache Arrow Issue Type: Wish Components: C++, Python Reporter: Uwe Korn Assignee: Uwe Korn Fix For: 2.0.0 This should serve as an umbrella issue for the needed functionality to have an Apache Arrow backed string type in {{pandas}}. In addition to the string kernels, we probably need to implement some more support functionality to efficiently the {{pandas}} interfaces. Some of these functions are already present in {{fletcher}} but a native string type in {{pandas}} should not have a hard dependency on {{numba}}. -- This message was sent by Atlassian Jira (v8.3.4#803005)