[jira] [Created] (ARROW-14890) [C++][Dataset] Add support for filter pushdown in the ORC Scanner

2021-11-25 Thread xiangxiang Shen (Jira)
xiangxiang Shen created ARROW-14890:
---

 Summary: [C++][Dataset] Add support for filter pushdown in the ORC 
Scanner
 Key: ARROW-14890
 URL: https://issues.apache.org/jira/browse/ARROW-14890
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: xiangxiang Shen


In arrow dataset, Filter pushdown can improve reading files performance 
greatly. We notice parquet has implemented, 
https://github.com/apache/arrow/blob/35b3567e73423420a99dbe6116f000e3c77d2a4c/cpp/src/arrow/dataset/file_parquet.cc#L465-L484.
But ORC fileformat has not supported Filter pushdown. It ignores the "filter" 
of  ScanOptions now.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14889) [C++] GCFS tests hang if testbench not installed

2021-11-25 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-14889:
--

 Summary: [C++] GCFS tests hang if testbench not installed
 Key: ARROW-14889
 URL: https://issues.apache.org/jira/browse/ARROW-14889
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Reporter: Antoine Pitrou


They should probably error out instead of hanging.
{code}
Running main() from 
/home/antoine/arrow/dev/cpp/build-preset/googletest_ep-prefix/src/googletest_ep/googletest/src/gtest_main.cc
[==] Running 22 tests from 2 test suites.
[--] Global test environment set-up.
[--] 13 tests from GcsFileSystem
[ RUN  ] GcsFileSystem.OptionsCompare
[   OK ] GcsFileSystem.OptionsCompare (0 ms)
[ RUN  ] GcsFileSystem.ToArrowStatusOK
[   OK ] GcsFileSystem.ToArrowStatusOK (0 ms)
[ RUN  ] GcsFileSystem.ToArrowStatus
[   OK ] GcsFileSystem.ToArrowStatus (0 ms)
[ RUN  ] GcsFileSystem.FileSystemCompare
[   OK ] GcsFileSystem.FileSystemCompare (2 ms)
[ RUN  ] GcsFileSystem.ToEncryptionKey
[   OK ] GcsFileSystem.ToEncryptionKey (0 ms)
[ RUN  ] GcsFileSystem.ToEncryptionKeyEmpty
[   OK ] GcsFileSystem.ToEncryptionKeyEmpty (0 ms)
[ RUN  ] GcsFileSystem.ToKmsKeyName
[   OK ] GcsFileSystem.ToKmsKeyName (0 ms)
[ RUN  ] GcsFileSystem.ToKmsKeyNameEmpty
[   OK ] GcsFileSystem.ToKmsKeyNameEmpty (0 ms)
[ RUN  ] GcsFileSystem.ToPredefinedAcl
[   OK ] GcsFileSystem.ToPredefinedAcl (0 ms)
[ RUN  ] GcsFileSystem.ToPredefinedAclEmpty
[   OK ] GcsFileSystem.ToPredefinedAclEmpty (0 ms)
[ RUN  ] GcsFileSystem.ToObjectMetadata
[   OK ] GcsFileSystem.ToObjectMetadata (0 ms)
[ RUN  ] GcsFileSystem.ToObjectMetadataEmpty
[   OK ] GcsFileSystem.ToObjectMetadataEmpty (0 ms)
[ RUN  ] GcsFileSystem.ToObjectMetadataInvalidCustomTime
[   OK ] GcsFileSystem.ToObjectMetadataInvalidCustomTime (0 ms)
[--] 13 tests from GcsFileSystem (3 ms total)

[--] 9 tests from GcsIntegrationTest
[ RUN  ] GcsIntegrationTest.GetFileInfoBucket
/home/antoine/miniconda3/envs/pyarrow/bin/python3: No module named testbench
^C
{code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14888) Configuring Arrow

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14888:
-

 Summary: Configuring Arrow
 Key: ARROW-14888
 URL: https://issues.apache.org/jira/browse/ARROW-14888
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Configuring Arrow base on 
https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14887) Filtering Data

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14887:
-

 Summary: Filtering Data
 Key: ARROW-14887
 URL: https://issues.apache.org/jira/browse/ARROW-14887
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Filtering Data base on 
https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14886) Manipulating Data

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14886:
-

 Summary: Manipulating Data
 Key: ARROW-14886
 URL: https://issues.apache.org/jira/browse/ARROW-14886
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Manipulating Data base on 
https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14885) Defining Data Types

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14885:
-

 Summary: Defining Data Types
 Key: ARROW-14885
 URL: https://issues.apache.org/jira/browse/ARROW-14885
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Defining Data Types base on 
https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14884) Creating Arrow Objects

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14884:
-

 Summary: Creating Arrow Objects
 Key: ARROW-14884
 URL: https://issues.apache.org/jira/browse/ARROW-14884
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Creating Arrow Objects base on 
https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14883) Working with Arrow in both Python, R, Java

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14883:
-

 Summary: Working with Arrow in both Python, R, Java
 Key: ARROW-14883
 URL: https://issues.apache.org/jira/browse/ARROW-14883
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Working with Arrow in both Python, R, Java according template 
at https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14882) Reading and Writing Data

2021-11-25 Thread David Dali Susanibar Arce (Jira)
David Dali Susanibar Arce created ARROW-14882:
-

 Summary: Reading and Writing Data
 Key: ARROW-14882
 URL: https://issues.apache.org/jira/browse/ARROW-14882
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: Java
Reporter: David Dali Susanibar Arce
Assignee: David Dali Susanibar Arce


Create recipe for Reading and Writing Data according template at 
https://github.com/apache/arrow-cookbook



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14881) [C++][Doc] Warnings in Doxygen

2021-11-25 Thread Alessandro Molina (Jira)
Alessandro Molina created ARROW-14881:
-

 Summary: [C++][Doc] Warnings in Doxygen
 Key: ARROW-14881
 URL: https://issues.apache.org/jira/browse/ARROW-14881
 Project: Apache Arrow
  Issue Type: Bug
  Components: Documentation
Reporter: Alessandro Molina


When building the doxygen apidoc for C++ I get a few warnings that forced me to 
disable {{WARN_AS_ERROR}} option to be able to proceed with building the docs.

Is it an actual issue or something stray in my local environment?

Here is a preview of the warnings
{code:java}
$ doxygen
warning: Tag 'COLS_IN_ALPHA_INDEX' at line 1118 of file 'Doxyfile' has become 
obsolete.
         To avoid this warning please remove this line from your configuration 
file or upgrade it using "doxygen -u"
/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2741: warning: no 
uniquely matching class member found for 
  void arrow::flight::protocol::HandshakeRequest::clear_protocol_version()


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2744: warning: no 
uniquely matching class member found for 
  PROTOBUF_NAMESPACE_ID::uint64 
arrow::flight::protocol::HandshakeRequest::_internal_protocol_version() const


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2747: warning: no 
uniquely matching class member found for 
  PROTOBUF_NAMESPACE_ID::uint64 
arrow::flight::protocol::HandshakeRequest::protocol_version() const


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2751: warning: no 
uniquely matching class member found for 
  void 
arrow::flight::protocol::HandshakeRequest::_internal_set_protocol_version(::PROTOBUF_NAMESPACE_ID::uint64
 value)


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2755: warning: no 
uniquely matching class member found for 
  void 
arrow::flight::protocol::HandshakeRequest::set_protocol_version(::PROTOBUF_NAMESPACE_ID::uint64
 value)


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2761: warning: no 
uniquely matching class member found for 
  void arrow::flight::protocol::HandshakeRequest::clear_payload()


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2764: warning: no 
uniquely matching class member found for 
  const std::string & arrow::flight::protocol::HandshakeRequest::payload() const


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2768: warning: no 
uniquely matching class member found for 
  void arrow::flight::protocol::HandshakeRequest::set_payload(const std::string 
)


/Users/amol/ARROW/arrow/cpp/src/arrow/flight/Flight.pb.h:2772: warning: no 
uniquely matching class member found for 
  std::string * arrow::flight::protocol::HandshakeRequest::mutable_payload() 
{code}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14880) [C++][CI] Enable ccache on macOS CI builds

2021-11-25 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-14880:
--

 Summary: [C++][CI] Enable ccache on macOS CI builds
 Key: ARROW-14880
 URL: https://issues.apache.org/jira/browse/ARROW-14880
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++, Continuous Integration
Reporter: Antoine Pitrou


We could install and enable ccache on macOS Github Actions jobs. It would 
probably speed up a lot of builds.

Apparently ccache is available on Homebrew.  Here is an example using it:
https://github.com/azerothcore/azerothcore-wotlk/blob/master/.github/workflows/macos_build.yml#L26-L33

{code:yaml}
  - name: Cache
uses: actions/cache@v2
with:
  path: ~/Library/Caches/ccache
  key: ccache:${{ matrix.os }}:${{ github.ref }}:${{ github.sha }}
  restore-keys: |
ccache:${{ matrix.os }}:${{ github.ref }}
ccache:${{ matrix.os }}
{code}




--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14879) [Python][Packaging] Remove manylinux2010 wheels

2021-11-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-14879:
---

 Summary: [Python][Packaging] Remove manylinux2010 wheels
 Key: ARROW-14879
 URL: https://issues.apache.org/jira/browse/ARROW-14879
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Packaging, Python
Reporter: Krisztian Szucs
 Fix For: 7.0.0


More recent vcpkg is not compatible with older glibc shipped by manylinux2010 
so we won't be able to regularly update the dependencies. Besides that 
manylinux2010 has reached EOL.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14877) [R] Implement bindings for stringr::str_view/str_view_all

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14877:


 Summary: [R] Implement bindings for stringr::str_view/str_view_all
 Key: ARROW-14877
 URL: https://issues.apache.org/jira/browse/ARROW-14877
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14878) [R] Implement bindings for stringr::word

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14878:


 Summary: [R] Implement bindings for stringr::word
 Key: ARROW-14878
 URL: https://issues.apache.org/jira/browse/ARROW-14878
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14875) [R] Implement bindings for stringr::str_trunc

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14875:


 Summary: [R] Implement bindings for stringr::str_trunc
 Key: ARROW-14875
 URL: https://issues.apache.org/jira/browse/ARROW-14875
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14876) [R] Implement bindings for stringr::str_unique

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14876:


 Summary: [R] Implement bindings for stringr::str_unique
 Key: ARROW-14876
 URL: https://issues.apache.org/jira/browse/ARROW-14876
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14874) [R] Implement bindings for stringr::`str_sub<-`

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14874:


 Summary: [R] Implement bindings for stringr::`str_sub<-`
 Key: ARROW-14874
 URL: https://issues.apache.org/jira/browse/ARROW-14874
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14873) [R] Implement bindings for stringr::str_replace_na

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14873:


 Summary: [R] Implement bindings for stringr::str_replace_na
 Key: ARROW-14873
 URL: https://issues.apache.org/jira/browse/ARROW-14873
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14872) [R] Implement bindings for stringr::str_like

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14872:


 Summary: [R] Implement bindings for stringr::str_like
 Key: ARROW-14872
 URL: https://issues.apache.org/jira/browse/ARROW-14872
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14871) [R] Implement bindings for stringr::str_conv

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14871:


 Summary: [R] Implement bindings for stringr::str_conv
 Key: ARROW-14871
 URL: https://issues.apache.org/jira/browse/ARROW-14871
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14870) [R] Implement bindings for stringr::invert_match

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14870:


 Summary: [R] Implement bindings for stringr::invert_match
 Key: ARROW-14870
 URL: https://issues.apache.org/jira/browse/ARROW-14870
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14869) [R] Implement bindings for stringr's other helper functions

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14869:


 Summary: [R] Implement bindings for stringr's other helper 
functions
 Key: ARROW-14869
 URL: https://issues.apache.org/jira/browse/ARROW-14869
 Project: Apache Arrow
  Issue Type: New Feature
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14868) [R] Implement bindings for stringr::str_sort

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14868:


 Summary: [R] Implement bindings for stringr::str_sort
 Key: ARROW-14868
 URL: https://issues.apache.org/jira/browse/ARROW-14868
 Project: Apache Arrow
  Issue Type: Sub-task
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14867) [R] Implement bindings for stringr::str_order

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14867:


 Summary: [R] Implement bindings for stringr::str_order
 Key: ARROW-14867
 URL: https://issues.apache.org/jira/browse/ARROW-14867
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14866) [R] Implement bindings for stringr::str_equal

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14866:


 Summary: [R] Implement bindings for stringr::str_equal
 Key: ARROW-14866
 URL: https://issues.apache.org/jira/browse/ARROW-14866
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14865) [R] Implement bindings for stringr's locale aware functions

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14865:


 Summary: [R] Implement bindings for stringr's locale aware 
functions
 Key: ARROW-14865
 URL: https://issues.apache.org/jira/browse/ARROW-14865
 Project: Apache Arrow
  Issue Type: New Feature
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14864) [R] Implement bindings for stringr::str_wrap

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14864:


 Summary: [R] Implement bindings for stringr::str_wrap
 Key: ARROW-14864
 URL: https://issues.apache.org/jira/browse/ARROW-14864
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14863) [R] Implement bindings for stringr::str_squish

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14863:


 Summary: [R] Implement bindings for stringr::str_squish
 Key: ARROW-14863
 URL: https://issues.apache.org/jira/browse/ARROW-14863
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14862) [R] Implement bindings for stringr's whitespace functions

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14862:


 Summary: [R] Implement bindings for stringr's whitespace functions
 Key: ARROW-14862
 URL: https://issues.apache.org/jira/browse/ARROW-14862
 Project: Apache Arrow
  Issue Type: New Feature
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14861) Implement bindings for stringr::str_glue_data

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14861:


 Summary: Implement bindings for stringr::str_glue_data
 Key: ARROW-14861
 URL: https://issues.apache.org/jira/browse/ARROW-14861
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14860) Implement bindings for stringr::str_glue

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14860:


 Summary: Implement bindings for stringr::str_glue
 Key: ARROW-14860
 URL: https://issues.apache.org/jira/browse/ARROW-14860
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)


[jira] [Created] (ARROW-14859) Implement bindings for stringr::str_flatten

2021-11-25 Thread Jira
Dragoș Moldovan-Grünfeld created ARROW-14859:


 Summary: Implement bindings for stringr::str_flatten
 Key: ARROW-14859
 URL: https://issues.apache.org/jira/browse/ARROW-14859
 Project: Apache Arrow
  Issue Type: Sub-task
  Components: R
Reporter: Dragoș Moldovan-Grünfeld






--
This message was sent by Atlassian Jira
(v8.20.1#820001)