[jira] [Resolved] (ARROW-7979) [C++] Implement experimental buffer compression in IPC messages

2020-03-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7979. - Resolution: Fixed Issue resolved by pull request 6638

[jira] [Updated] (ARROW-8216) filter method for Dataset doesn't distinguish between empty strings and NAs

2020-03-25 Thread Sam Albers (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sam Albers updated ARROW-8216: -- Description:   I have just noticed some slightly odd behaviour with the filter method for Dataset. 

[jira] [Created] (ARROW-8216) filter method for Dataset doesn't distinguish between empty strings and NAs

2020-03-25 Thread Sam Albers (Jira)
Sam Albers created ARROW-8216: - Summary: filter method for Dataset doesn't distinguish between empty strings and NAs Key: ARROW-8216 URL: https://issues.apache.org/jira/browse/ARROW-8216 Project: Apache

[jira] [Updated] (ARROW-7925) [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale

2020-03-25 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7925: Fix Version/s: (was: 0.17.0) 1.0.0 > [C++][Documentation] Instructions

[jira] [Updated] (ARROW-7894) [C++] DefineOptions should invoke add_definitions

2020-03-25 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7894: Fix Version/s: (was: 0.17.0) 1.0.0 > [C++] DefineOptions should invoke

[jira] [Resolved] (ARROW-8204) [Rust] [DataFusion] Add support for aliased expressions in SQL

2020-03-25 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan resolved ARROW-8204. Resolution: Fixed Issue resolved by pull request 6713 [https://github.com/apache/arrow/pull/6713]

[jira] [Resolved] (ARROW-8058) [C++][Python][Dataset] Provide an option to toggle validation and schema inference in FileSystemDatasetFactoryOptions

2020-03-25 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-8058. - Resolution: Fixed Issue resolved by pull request 6687

[jira] [Resolved] (ARROW-8059) [Python] Make FileSystem objects serializable

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-8059. Resolution: Fixed Issue resolved by pull request 6644

[jira] [Created] (ARROW-8215) [CI][Glib] Meson install fails in the macOS build

2020-03-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8215: -- Summary: [CI][Glib] Meson install fails in the macOS build Key: ARROW-8215 URL: https://issues.apache.org/jira/browse/ARROW-8215 Project: Apache Arrow

[jira] [Updated] (ARROW-8184) [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host the nightlies

2020-03-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8184: -- Labels: pull-request-available (was: ) > [Packaging] Use arrow-nightlies organization name on

[jira] [Updated] (ARROW-8184) [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host the nightlies

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8184: --- Summary: [Packaging] Use arrow-nightlies organization name on Anaconda and Gemfury to host

[jira] [Updated] (ARROW-8185) [Packaging] Document the available nightly wheels, conda and R packages under the development section

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8185: --- Fix Version/s: 0.17.0 > [Packaging] Document the available nightly wheels, conda and R

[jira] [Updated] (ARROW-8184) [Packaging] Use arrow-nightlies (or similar) organization name on Anaconda and Gemfury to host the nightlies

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8184?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-8184: --- Fix Version/s: 0.17.0 > [Packaging] Use arrow-nightlies (or similar) organization name on

[jira] [Assigned] (ARROW-7850) [Packaging][Python] Document how to install nightly built wheels

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-7850: -- Assignee: Krisztian Szucs > [Packaging][Python] Document how to install nightly built

[jira] [Assigned] (ARROW-7771) [Release] Use ARROW_TMPDIR environment variable in the verification scripts instead of TMPDIR

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7771?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-7771: -- Assignee: Krisztian Szucs > [Release] Use ARROW_TMPDIR environment variable in the

[jira] [Created] (ARROW-8214) [C++] Flatbuffers based serialization protocol for Expressions

2020-03-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8214: -- Summary: [C++] Flatbuffers based serialization protocol for Expressions Key: ARROW-8214 URL: https://issues.apache.org/jira/browse/ARROW-8214 Project: Apache

[jira] [Created] (ARROW-8213) [Python][Dataste] Opening a dataset with a local incorrect path gives confusing error message

2020-03-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8213: Summary: [Python][Dataste] Opening a dataset with a local incorrect path gives confusing error message Key: ARROW-8213 URL:

[jira] [Created] (ARROW-8212) [Python][Dataset] Consider adding Cast like operation

2020-03-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8212: -- Summary: [Python][Dataset] Consider adding Cast like operation Key: ARROW-8212 URL: https://issues.apache.org/jira/browse/ARROW-8212 Project: Apache Arrow

[jira] [Commented] (ARROW-6947) [Rust] [DataFusion] Add support for scalar UDFs

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066755#comment-17066755 ] Andy Grove commented on ARROW-6947: --- It's important to remember that these UDFs will operate on

[jira] [Commented] (ARROW-6947) [Rust] [DataFusion] Add support for scalar UDFs

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066754#comment-17066754 ] Andy Grove commented on ARROW-6947: --- Sorry for taking so long to get to this. I think it will be

[jira] [Updated] (ARROW-8210) [C++][Dataset] Handling of duplicate columns in Dataset factory and scanning

2020-03-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8210: - Description: While testing duplicate column names, I ran into multiple issues:

[jira] [Created] (ARROW-8211) [C++] Sanitize hdfs host when creating HadoopFileSystem from endpoint

2020-03-25 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8211: -- Summary: [C++] Sanitize hdfs host when creating HadoopFileSystem from endpoint Key: ARROW-8211 URL: https://issues.apache.org/jira/browse/ARROW-8211 Project:

[jira] [Updated] (ARROW-8210) [C++][Dataset] Handling of duplicate columns in Dataset factory and scanning

2020-03-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8210: - Description: While testing duplicate column names, I ran into multiple issues:

[jira] [Updated] (ARROW-8210) [C++][Dataset] Handling of duplicate columns in Dataset factory and scanning

2020-03-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8210: - Summary: [C++][Dataset] Handling of duplicate columns in Dataset factory and

[jira] [Updated] (ARROW-8210) [C++][Dataset] Handling of duplicate columns in Dataset factory and scanning

2020-03-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8210: - Component/s: C++ > [C++][Dataset] Handling of duplicate columns in Dataset

[jira] [Updated] (ARROW-8210) [C++][Dataset] Handling of duplicate columns in Dataset factory and scanning

2020-03-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8210: - Component/s: C++ - Dataset > [C++][Dataset] Handling of duplicate columns in

[jira] [Created] (ARROW-8210) [C++]

2020-03-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8210: Summary: [C++] Key: ARROW-8210 URL: https://issues.apache.org/jira/browse/ARROW-8210 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-8209) [Python] Accessing duplicate column of Table by name gives wrong error

2020-03-25 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-8209: Summary: [Python] Accessing duplicate column of Table by name gives wrong error Key: ARROW-8209 URL: https://issues.apache.org/jira/browse/ARROW-8209

[jira] [Comment Edited] (ARROW-2672) [Python] Build ORC extension in manylinux1 wheels

2020-03-25 Thread Leandro Ferrado (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066723#comment-17066723 ] Leandro Ferrado edited comment on ARROW-2672 at 3/25/20, 2:46 PM: -- As I

[jira] [Commented] (ARROW-2672) [Python] Build ORC extension in manylinux1 wheels

2020-03-25 Thread Leandro Ferrado (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066723#comment-17066723 ] Leandro Ferrado commented on ARROW-2672: As I mentioned

[jira] [Closed] (ARROW-5234) [Rust] [DataFusion] Create Python bindings for DataFusion

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-5234. - Resolution: Won't Fix > [Rust] [DataFusion] Create Python bindings for DataFusion >

[jira] [Updated] (ARROW-4941) [Rust] Enhance documentation for parquet

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4941?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-4941: -- Component/s: (was: Rust - DataFusion) Rust > [Rust] Enhance documentation for

[jira] [Reopened] (ARROW-5234) [Rust] [DataFusion] Create Python bindings for DataFusion

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reopened ARROW-5234: --- Assignee: Andy Grove > [Rust] [DataFusion] Create Python bindings for DataFusion >

[jira] [Closed] (ARROW-4816) [Rust] [DataFusion] Add support for repartitioning

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-4816. - Resolution: Won't Fix I am closing this because I feel that it is beyond the scope of Arrow/DataFusion.

[jira] [Assigned] (ARROW-4957) [Rust] [DataFusion] Implement get_supertype correctly

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4957?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-4957: - Assignee: Andy Grove > [Rust] [DataFusion] Implement get_supertype correctly >

[jira] [Closed] (ARROW-5234) [Rust] [DataFusion] Create Python bindings for DataFusion

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove closed ARROW-5234. - Resolution: Fixed I no longer see the value in adding this since there are other great options for

[jira] [Updated] (ARROW-4815) [Rust] [DataFusion] Add support for * in SQL projection

2020-03-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4815: -- Labels: pull-request-available (was: ) > [Rust] [DataFusion] Add support for * in SQL

[jira] [Assigned] (ARROW-4815) [Rust] [DataFusion] Add support for * in SQL projection

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4815?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-4815: - Assignee: Andy Grove > [Rust] [DataFusion] Add support for * in SQL projection >

[jira] [Commented] (ARROW-8148) [Packaging][C++] Add google-cloud-cpp to conda-forge

2020-03-25 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17066707#comment-17066707 ] Uwe Korn commented on ARROW-8148: - This is more than a single package, we need at least 

[jira] [Assigned] (ARROW-8148) [Packaging][C++] Add google-cloud-cpp to conda-forge

2020-03-25 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Korn reassigned ARROW-8148: --- Assignee: Uwe Korn > [Packaging][C++] Add google-cloud-cpp to conda-forge >

[jira] [Assigned] (ARROW-7898) [Python] Reduce the number docstring violations using numpydoc

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-7898: -- Assignee: Krisztian Szucs > [Python] Reduce the number docstring violations using

[jira] [Resolved] (ARROW-7898) [Python] Reduce the number docstring violations using numpydoc

2020-03-25 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-7898. Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6444

[jira] [Resolved] (ARROW-8197) [Rust] DataFusion "create_physical_plan" returns incorrect schema?

2020-03-25 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-8197. --- Resolution: Fixed Issue resolved by pull request 6703 [https://github.com/apache/arrow/pull/6703] >

[jira] [Updated] (ARROW-8208) [PYTHON] Row Group Filtering With ParquetDataset

2020-03-25 Thread Christophe Clienti (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christophe Clienti updated ARROW-8208: -- Description: Hello, I tried to use the row_group filtering at the file level with an

[jira] [Updated] (ARROW-8208) [PYTHON] Row Group Filtering With ParquetDataset

2020-03-25 Thread Christophe Clienti (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christophe Clienti updated ARROW-8208: -- Description: Hello, I tried to use the row_group filtering at the file level with an

[jira] [Assigned] (ARROW-7798) [R] Refactor vector to Array conversion

2020-03-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-7798: - Assignee: (was: Francois Saint-Jacques) > [R] Refactor vector to

[jira] [Assigned] (ARROW-7818) [C++][Gandiva] Generate Filter kernels from gandiva code at compile time

2020-03-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-7818: - Assignee: (was: Francois Saint-Jacques) > [C++][Gandiva] Generate

[jira] [Resolved] (ARROW-8192) [C++] script for unpack avx512 intrinsics code

2020-03-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8192. --- Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6697

[jira] [Resolved] (ARROW-8193) [C++] arrow-future-test fails to compile on gcc 4.8

2020-03-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8193. --- Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6706

[jira] [Resolved] (ARROW-8207) [Packaging][wheel] Use LLVM 8 in manylinux2010 and manylinux2014

2020-03-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8207?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8207. --- Fix Version/s: 0.17.0 Resolution: Fixed Issue resolved by pull request 6714

[jira] [Created] (ARROW-8208) [PYTHON] RowGroup filtering with ParquetDataset

2020-03-25 Thread Christophe Clienti (Jira)
Christophe Clienti created ARROW-8208: - Summary: [PYTHON] RowGroup filtering with ParquetDataset Key: ARROW-8208 URL: https://issues.apache.org/jira/browse/ARROW-8208 Project: Apache Arrow

[jira] [Updated] (ARROW-8208) [PYTHON] Row Group Filtering With ParquetDataset

2020-03-25 Thread Christophe Clienti (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Christophe Clienti updated ARROW-8208: -- Summary: [PYTHON] Row Group Filtering With ParquetDataset (was: [PYTHON] RowGroup

<    1   2