[jira] [Created] (ARROW-14890) [C++][Dataset] Add support for filter pushdown in the ORC Scanner
xiangxiang Shen created ARROW-14890: --- Summary: [C++][Dataset] Add support for filter pushdown in the ORC Scanner Key: ARROW-14890 URL: https://issues.apache.org/jira/browse/ARROW-14890 Project: Apache Arrow Issue Type: Improvement Components: C++ Reporter: xiangxiang Shen In arrow dataset, Filter pushdown can improve reading files performance greatly. We notice parquet has implemented, https://github.com/apache/arrow/blob/35b3567e73423420a99dbe6116f000e3c77d2a4c/cpp/src/arrow/dataset/file_parquet.cc#L465-L484. But ORC fileformat has not supported Filter pushdown. It ignores the "filter" of ScanOptions now. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14889) [C++] GCFS tests hang if testbench not installed
Antoine Pitrou created ARROW-14889: -- Summary: [C++] GCFS tests hang if testbench not installed Key: ARROW-14889 URL: https://issues.apache.org/jira/browse/ARROW-14889 Project: Apache Arrow Issue Type: Bug Components: C++ Reporter: Antoine Pitrou They should probably error out instead of hanging. {code} Running main() from /home/antoine/arrow/dev/cpp/build-preset/googletest_ep-prefix/src/googletest_ep/googletest/src/gtest_main.cc [==] Running 22 tests from 2 test suites. [--] Global test environment set-up. [--] 13 tests from GcsFileSystem [ RUN ] GcsFileSystem.OptionsCompare [ OK ] GcsFileSystem.OptionsCompare (0 ms) [ RUN ] GcsFileSystem.ToArrowStatusOK [ OK ] GcsFileSystem.ToArrowStatusOK (0 ms) [ RUN ] GcsFileSystem.ToArrowStatus [ OK ] GcsFileSystem.ToArrowStatus (0 ms) [ RUN ] GcsFileSystem.FileSystemCompare [ OK ] GcsFileSystem.FileSystemCompare (2 ms) [ RUN ] GcsFileSystem.ToEncryptionKey [ OK ] GcsFileSystem.ToEncryptionKey (0 ms) [ RUN ] GcsFileSystem.ToEncryptionKeyEmpty [ OK ] GcsFileSystem.ToEncryptionKeyEmpty (0 ms) [ RUN ] GcsFileSystem.ToKmsKeyName [ OK ] GcsFileSystem.ToKmsKeyName (0 ms) [ RUN ] GcsFileSystem.ToKmsKeyNameEmpty [ OK ] GcsFileSystem.ToKmsKeyNameEmpty (0 ms) [ RUN ] GcsFileSystem.ToPredefinedAcl [ OK ] GcsFileSystem.ToPredefinedAcl (0 ms) [ RUN ] GcsFileSystem.ToPredefinedAclEmpty [ OK ] GcsFileSystem.ToPredefinedAclEmpty (0 ms) [ RUN ] GcsFileSystem.ToObjectMetadata [ OK ] GcsFileSystem.ToObjectMetadata (0 ms) [ RUN ] GcsFileSystem.ToObjectMetadataEmpty [ OK ] GcsFileSystem.ToObjectMetadataEmpty (0 ms) [ RUN ] GcsFileSystem.ToObjectMetadataInvalidCustomTime [ OK ] GcsFileSystem.ToObjectMetadataInvalidCustomTime (0 ms) [--] 13 tests from GcsFileSystem (3 ms total) [--] 9 tests from GcsIntegrationTest [ RUN ] GcsIntegrationTest.GetFileInfoBucket /home/antoine/miniconda3/envs/pyarrow/bin/python3: No module named testbench ^C {code} -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14888) Configuring Arrow
David Dali Susanibar Arce created ARROW-14888: - Summary: Configuring Arrow Key: ARROW-14888 URL: https://issues.apache.org/jira/browse/ARROW-14888 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Configuring Arrow base on https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14887) Filtering Data
David Dali Susanibar Arce created ARROW-14887: - Summary: Filtering Data Key: ARROW-14887 URL: https://issues.apache.org/jira/browse/ARROW-14887 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Filtering Data base on https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14886) Manipulating Data
David Dali Susanibar Arce created ARROW-14886: - Summary: Manipulating Data Key: ARROW-14886 URL: https://issues.apache.org/jira/browse/ARROW-14886 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Manipulating Data base on https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14885) Defining Data Types
David Dali Susanibar Arce created ARROW-14885: - Summary: Defining Data Types Key: ARROW-14885 URL: https://issues.apache.org/jira/browse/ARROW-14885 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Defining Data Types base on https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14884) Creating Arrow Objects
David Dali Susanibar Arce created ARROW-14884: - Summary: Creating Arrow Objects Key: ARROW-14884 URL: https://issues.apache.org/jira/browse/ARROW-14884 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Creating Arrow Objects base on https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14883) Working with Arrow in both Python, R, Java
David Dali Susanibar Arce created ARROW-14883: - Summary: Working with Arrow in both Python, R, Java Key: ARROW-14883 URL: https://issues.apache.org/jira/browse/ARROW-14883 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Working with Arrow in both Python, R, Java according template at https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14882) Reading and Writing Data
David Dali Susanibar Arce created ARROW-14882: - Summary: Reading and Writing Data Key: ARROW-14882 URL: https://issues.apache.org/jira/browse/ARROW-14882 Project: Apache Arrow Issue Type: Sub-task Components: Java Reporter: David Dali Susanibar Arce Assignee: David Dali Susanibar Arce Create recipe for Reading and Writing Data according template at https://github.com/apache/arrow-cookbook -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14881) [C++][Doc] Warnings in Doxygen
Alessandro Molina created ARROW-14881: - Summary: [C++][Doc] Warnings in Doxygen Key: ARROW-14881 URL: https://issues.apache.org/jira/browse/ARROW-14881 Project: Apache Arrow Issue Type: Bug Components: Documentation Reporter: Alessandro Molina When building the doxygen apidoc for C++ I get a few warnings that forced me to disable {{WARN_AS_ERROR}} option to be able to proceed with building the docs. Is it an actual issue or something stray in my local environment? Here is a preview of the warnings {code:java} $ doxygen warning: Tag 'COLS_IN_ALPHA_INDEX' at line 1118 of file 'Doxyfile' has become obsolete. To avoid this warning please remove this line from your configuration file or upgrade it using "doxygen -u" /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2741: warning: no uniquely matching class member found for void arrow::flight::protocol::HandshakeRequest::clear_protocol_version() /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2744: warning: no uniquely matching class member found for PROTOBUF_NAMESPACE_ID::uint64 arrow::flight::protocol::HandshakeRequest::_internal_protocol_version() const /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2747: warning: no uniquely matching class member found for PROTOBUF_NAMESPACE_ID::uint64 arrow::flight::protocol::HandshakeRequest::protocol_version() const /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2751: warning: no uniquely matching class member found for void arrow::flight::protocol::HandshakeRequest::_internal_set_protocol_version(::PROTOBUF_NAMESPACE_ID::uint64 value) /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2755: warning: no uniquely matching class member found for void arrow::flight::protocol::HandshakeRequest::set_protocol_version(::PROTOBUF_NAMESPACE_ID::uint64 value) /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2761: warning: no uniquely matching class member found for void arrow::flight::protocol::HandshakeRequest::clear_payload() /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2764: warning: no uniquely matching class member found for const std::string & arrow::flight::protocol::HandshakeRequest::payload() const /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2768: warning: no uniquely matching class member found for void arrow::flight::protocol::HandshakeRequest::set_payload(const std::string ) /Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2772: warning: no uniquely matching class member found for std::string * arrow::flight::protocol::HandshakeRequest::mutable_payload() {code} -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14880) [C++][CI] Enable ccache on macOS CI builds
Antoine Pitrou created ARROW-14880: -- Summary: [C++][CI] Enable ccache on macOS CI builds Key: ARROW-14880 URL: https://issues.apache.org/jira/browse/ARROW-14880 Project: Apache Arrow Issue Type: Improvement Components: C++, Continuous Integration Reporter: Antoine Pitrou We could install and enable ccache on macOS Github Actions jobs. It would probably speed up a lot of builds. Apparently ccache is available on Homebrew. Here is an example using it: https://github.com/azerothcore/azerothcore-wotlk/blob/master/.github/workflows/macos_build.yml#L26-L33 {code:yaml} - name: Cache uses: actions/cache@v2 with: path: ~/Library/Caches/ccache key: ccache:${{ matrix.os }}:${{ github.ref }}:${{ github.sha }} restore-keys: | ccache:${{ matrix.os }}:${{ github.ref }} ccache:${{ matrix.os }} {code} -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14879) [Python][Packaging] Remove manylinux2010 wheels
Krisztian Szucs created ARROW-14879: --- Summary: [Python][Packaging] Remove manylinux2010 wheels Key: ARROW-14879 URL: https://issues.apache.org/jira/browse/ARROW-14879 Project: Apache Arrow Issue Type: Improvement Components: Packaging, Python Reporter: Krisztian Szucs Fix For: 7.0.0 More recent vcpkg is not compatible with older glibc shipped by manylinux2010 so we won't be able to regularly update the dependencies. Besides that manylinux2010 has reached EOL. -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14877) [R] Implement bindings for stringr::str_view/str_view_all
Dragoș Moldovan-Grünfeld created ARROW-14877: Summary: [R] Implement bindings for stringr::str_view/str_view_all Key: ARROW-14877 URL: https://issues.apache.org/jira/browse/ARROW-14877 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14878) [R] Implement bindings for stringr::word
Dragoș Moldovan-Grünfeld created ARROW-14878: Summary: [R] Implement bindings for stringr::word Key: ARROW-14878 URL: https://issues.apache.org/jira/browse/ARROW-14878 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14875) [R] Implement bindings for stringr::str_trunc
Dragoș Moldovan-Grünfeld created ARROW-14875: Summary: [R] Implement bindings for stringr::str_trunc Key: ARROW-14875 URL: https://issues.apache.org/jira/browse/ARROW-14875 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14876) [R] Implement bindings for stringr::str_unique
Dragoș Moldovan-Grünfeld created ARROW-14876: Summary: [R] Implement bindings for stringr::str_unique Key: ARROW-14876 URL: https://issues.apache.org/jira/browse/ARROW-14876 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14874) [R] Implement bindings for stringr::`str_sub<-`
Dragoș Moldovan-Grünfeld created ARROW-14874: Summary: [R] Implement bindings for stringr::`str_sub<-` Key: ARROW-14874 URL: https://issues.apache.org/jira/browse/ARROW-14874 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14873) [R] Implement bindings for stringr::str_replace_na
Dragoș Moldovan-Grünfeld created ARROW-14873: Summary: [R] Implement bindings for stringr::str_replace_na Key: ARROW-14873 URL: https://issues.apache.org/jira/browse/ARROW-14873 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14872) [R] Implement bindings for stringr::str_like
Dragoș Moldovan-Grünfeld created ARROW-14872: Summary: [R] Implement bindings for stringr::str_like Key: ARROW-14872 URL: https://issues.apache.org/jira/browse/ARROW-14872 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14871) [R] Implement bindings for stringr::str_conv
Dragoș Moldovan-Grünfeld created ARROW-14871: Summary: [R] Implement bindings for stringr::str_conv Key: ARROW-14871 URL: https://issues.apache.org/jira/browse/ARROW-14871 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14870) [R] Implement bindings for stringr::invert_match
Dragoș Moldovan-Grünfeld created ARROW-14870: Summary: [R] Implement bindings for stringr::invert_match Key: ARROW-14870 URL: https://issues.apache.org/jira/browse/ARROW-14870 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14869) [R] Implement bindings for stringr's other helper functions
Dragoș Moldovan-Grünfeld created ARROW-14869: Summary: [R] Implement bindings for stringr's other helper functions Key: ARROW-14869 URL: https://issues.apache.org/jira/browse/ARROW-14869 Project: Apache Arrow Issue Type: New Feature Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14868) [R] Implement bindings for stringr::str_sort
Dragoș Moldovan-Grünfeld created ARROW-14868: Summary: [R] Implement bindings for stringr::str_sort Key: ARROW-14868 URL: https://issues.apache.org/jira/browse/ARROW-14868 Project: Apache Arrow Issue Type: Sub-task Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14867) [R] Implement bindings for stringr::str_order
Dragoș Moldovan-Grünfeld created ARROW-14867: Summary: [R] Implement bindings for stringr::str_order Key: ARROW-14867 URL: https://issues.apache.org/jira/browse/ARROW-14867 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14866) [R] Implement bindings for stringr::str_equal
Dragoș Moldovan-Grünfeld created ARROW-14866: Summary: [R] Implement bindings for stringr::str_equal Key: ARROW-14866 URL: https://issues.apache.org/jira/browse/ARROW-14866 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14865) [R] Implement bindings for stringr's locale aware functions
Dragoș Moldovan-Grünfeld created ARROW-14865: Summary: [R] Implement bindings for stringr's locale aware functions Key: ARROW-14865 URL: https://issues.apache.org/jira/browse/ARROW-14865 Project: Apache Arrow Issue Type: New Feature Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14864) [R] Implement bindings for stringr::str_wrap
Dragoș Moldovan-Grünfeld created ARROW-14864: Summary: [R] Implement bindings for stringr::str_wrap Key: ARROW-14864 URL: https://issues.apache.org/jira/browse/ARROW-14864 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14863) [R] Implement bindings for stringr::str_squish
Dragoș Moldovan-Grünfeld created ARROW-14863: Summary: [R] Implement bindings for stringr::str_squish Key: ARROW-14863 URL: https://issues.apache.org/jira/browse/ARROW-14863 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14862) [R] Implement bindings for stringr's whitespace functions
Dragoș Moldovan-Grünfeld created ARROW-14862: Summary: [R] Implement bindings for stringr's whitespace functions Key: ARROW-14862 URL: https://issues.apache.org/jira/browse/ARROW-14862 Project: Apache Arrow Issue Type: New Feature Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14861) Implement bindings for stringr::str_glue_data
Dragoș Moldovan-Grünfeld created ARROW-14861: Summary: Implement bindings for stringr::str_glue_data Key: ARROW-14861 URL: https://issues.apache.org/jira/browse/ARROW-14861 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14860) Implement bindings for stringr::str_glue
Dragoș Moldovan-Grünfeld created ARROW-14860: Summary: Implement bindings for stringr::str_glue Key: ARROW-14860 URL: https://issues.apache.org/jira/browse/ARROW-14860 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)
[jira] [Created] (ARROW-14859) Implement bindings for stringr::str_flatten
Dragoș Moldovan-Grünfeld created ARROW-14859: Summary: Implement bindings for stringr::str_flatten Key: ARROW-14859 URL: https://issues.apache.org/jira/browse/ARROW-14859 Project: Apache Arrow Issue Type: Sub-task Components: R Reporter: Dragoș Moldovan-Grünfeld -- This message was sent by Atlassian Jira (v8.20.1#820001)