[jira] [Commented] (ARROW-9697) [C++][Dataset] num_rows method for Dataset/Scanner

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176829#comment-17176829 ] Joris Van den Bossche commented on ARROW-9697: -- It might be worth adding some method for

[jira] [Updated] (ARROW-9686) [Python] Parquet table schema missing columns when created from Pandas DataFrame with List data column

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9686: - Summary: [Python] Parquet table schema missing columns when created from Pandas

[jira] [Commented] (ARROW-9687) [C++][Compute] Optionally ignore NaN in sum kernel

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9687?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176790#comment-17176790 ] Joris Van den Bossche commented on ARROW-9687: -- Note that the discussion on ARROW-9054 is

[jira] [Commented] (ARROW-9686) [Python] Parquet table schema missing columns when created from Pandas DataFrame with List data column

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176818#comment-17176818 ] Joris Van den Bossche commented on ARROW-9686: -- So it is the _Parquet_ schema where the

[jira] [Commented] (ARROW-9686) [Python] Parquet table schema missing columns when created from Pandas DataFrame with List data column

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9686?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176831#comment-17176831 ] Joris Van den Bossche commented on ARROW-9686: -- The description of nested types in the

[jira] [Created] (ARROW-9725) [Rust] [DataFusion] LimitExec and SortExec should use MergeExec

2020-08-13 Thread Andy Grove (Jira)
Andy Grove created ARROW-9725: - Summary: [Rust] [DataFusion] LimitExec and SortExec should use MergeExec Key: ARROW-9725 URL: https://issues.apache.org/jira/browse/ARROW-9725 Project: Apache Arrow

[jira] [Updated] (ARROW-9725) [Rust] [DataFusion] LimitExec and SortExec should use MergeExec

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9725: -- Labels: pull-request-available (was: ) > [Rust] [DataFusion] LimitExec and SortExec should

[jira] [Created] (ARROW-9718) [Python] Make pyarrow.parquet work with the new filesystem interfaces

2020-08-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9718: Summary: [Python] Make pyarrow.parquet work with the new filesystem interfaces Key: ARROW-9718 URL: https://issues.apache.org/jira/browse/ARROW-9718

[jira] [Updated] (ARROW-9718) [Python] Make pyarrow.parquet work with the new filesystem interfaces

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9718: - Parent: ARROW-9645 Issue Type: Sub-task (was: Improvement) > [Python]

[jira] [Created] (ARROW-9721) [Packaging][Python] Update wheel dependency files

2020-08-13 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-9721: -- Summary: [Packaging][Python] Update wheel dependency files Key: ARROW-9721 URL: https://issues.apache.org/jira/browse/ARROW-9721 Project: Apache Arrow

[jira] [Resolved] (ARROW-9721) [Packaging][Python] Update wheel dependency files

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-9721. Resolution: Fixed Issue resolved by pull request 7922

[jira] [Created] (ARROW-9722) [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays

2020-08-13 Thread Mahmut Bulut (Jira)
Mahmut Bulut created ARROW-9722: --- Summary: [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays Key: ARROW-9722 URL: https://issues.apache.org/jira/browse/ARROW-9722 Project: Apache

[jira] [Assigned] (ARROW-9698) [C++] Revert "Add -NDEBUG flag to arrow.pc"

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9698: - Assignee: Brian Dunlay > [C++] Revert "Add -NDEBUG flag to arrow.pc" >

[jira] [Updated] (ARROW-9698) [C++] Revert "Add -NDEBUG flag to arrow.pc"

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9698: -- Component/s: Packaging > [C++] Revert "Add -NDEBUG flag to arrow.pc" >

[jira] [Resolved] (ARROW-9698) [C++] Revert "Add -NDEBUG flag to arrow.pc"

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9698?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9698. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7939

[jira] [Commented] (ARROW-9723) [C++] Expected behaviour of "mode" kernel with NaNs ?

2020-08-13 Thread Yibo Cai (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177037#comment-17177037 ] Yibo Cai commented on ARROW-9723: - +1 for {{(mode: NaN, count: 3)}} scipy's behaviour is

[jira] [Created] (ARROW-9726) [Rust] [DataFusion] ParquetScanExec launches threads too early

2020-08-13 Thread Andy Grove (Jira)
Andy Grove created ARROW-9726: - Summary: [Rust] [DataFusion] ParquetScanExec launches threads too early Key: ARROW-9726 URL: https://issues.apache.org/jira/browse/ARROW-9726 Project: Apache Arrow

[jira] [Created] (ARROW-9724) [Flight][C++] Compilation error with protobuf 3.12.4

2020-08-13 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-9724: - Summary: [Flight][C++] Compilation error with protobuf 3.12.4 Key: ARROW-9724 URL: https://issues.apache.org/jira/browse/ARROW-9724 Project: Apache Arrow

[jira] [Closed] (ARROW-9724) [Flight][C++] Compilation error with protobuf 3.12.4

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-9724. - Resolution: Not A Bug Ok, after wiping the build dir the error went away. > [Flight][C++]

[jira] [Assigned] (ARROW-9725) [Rust] [DataFusion] LimitExec and SortExec should use MergeExec

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9725: Assignee: Apache Arrow JIRA Bot (was: Andy Grove) > [Rust] [DataFusion]

[jira] [Commented] (ARROW-9697) [C++][Dataset] num_rows method for Dataset/Scanner

2020-08-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177065#comment-17177065 ] Neal Richardson commented on ARROW-9697: There are a few cases I see that we could optimize like

[jira] [Resolved] (ARROW-9644) [C++][Dataset] Do not check for ignore_prefixes in the base path

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9644?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9644. --- Resolution: Fixed Issue resolved by pull request 7907

[jira] [Resolved] (ARROW-9665) [R] head/tail/take for Datasets

2020-08-13 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9665?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-9665. - Resolution: Fixed Issue resolved by pull request 7913

[jira] [Resolved] (ARROW-9712) [Rust] [DataFusion] ParquetScanExec panics on error

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9712?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9712. --- Resolution: Fixed Issue resolved by pull request 7947 [https://github.com/apache/arrow/pull/7947] >

[jira] [Assigned] (ARROW-6154) [Rust] Too many open files (os error 24)

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove reassigned ARROW-6154: - Assignee: Andy Grove > [Rust] Too many open files (os error 24) >

[jira] [Commented] (ARROW-5756) [Python] Remove manylinux1 support

2020-08-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177050#comment-17177050 ] Wes McKinney commented on ARROW-5756: - Have other projects begun dropping them? Are there any known

[jira] [Updated] (ARROW-9718) [Python] Make pyarrow.parquet work with the new filesystem interfaces

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9718: - Description: The place internally where the "legacy" `pyarrow.filesystem`

[jira] [Created] (ARROW-9720) [Python] Long-term fate of pyarrow.parquet.ParquetDataset

2020-08-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9720: Summary: [Python] Long-term fate of pyarrow.parquet.ParquetDataset Key: ARROW-9720 URL: https://issues.apache.org/jira/browse/ARROW-9720 Project:

[jira] [Assigned] (ARROW-9556) [Python][C++] Segfaults in UnionArray with null values

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-9556: -- Assignee: Krisztian Szucs (was: Wes McKinney) > [Python][C++] Segfaults in

[jira] [Updated] (ARROW-9556) [Python][C++] Segfaults in UnionArray with null values

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9556: --- Fix Version/s: 2.0.0 1.0.1 > [Python][C++] Segfaults in UnionArray with

[jira] [Updated] (ARROW-9556) [Python][C++] Segfaults in UnionArray with null values

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9556?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9556: -- Labels: pull-request-available (was: ) > [Python][C++] Segfaults in UnionArray with null

[jira] [Commented] (ARROW-9556) [Python][C++] Segfaults in UnionArray with null values

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176904#comment-17176904 ] Krisztian Szucs commented on ARROW-9556: Thanks for the detailed bug report! > [Python][C++]

[jira] [Resolved] (ARROW-9713) [Rust][DataFusion] Remove explicit panics

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9713. --- Resolution: Fixed Issue resolved by pull request 7948 [https://github.com/apache/arrow/pull/7948] >

[jira] [Updated] (ARROW-9719) [Python] Better document the new pa.fs.HadoopFileSystem

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9719: - Parent: ARROW-9645 Issue Type: Sub-task (was: Improvement) > [Python]

[jira] [Created] (ARROW-9719) [Python] Better document the new pa.fs.HadoopFileSystem

2020-08-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9719: Summary: [Python] Better document the new pa.fs.HadoopFileSystem Key: ARROW-9719 URL: https://issues.apache.org/jira/browse/ARROW-9719 Project: Apache

[jira] [Assigned] (ARROW-9722) [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9722: Assignee: Apache Arrow JIRA Bot (was: Mahmut Bulut) > [Rust]: Shorten

[jira] [Assigned] (ARROW-9722) [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9722: Assignee: Mahmut Bulut (was: Apache Arrow JIRA Bot) > [Rust]: Shorten

[jira] [Closed] (ARROW-8066) [Python] Specify behavior for converting tz-aware datetime.datetime objects to Arrow format

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8066?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs closed ARROW-8066. -- Resolution: Duplicate > [Python] Specify behavior for converting tz-aware datetime.datetime

[jira] [Created] (ARROW-9723) [C++] Expected behaviour of "mode" kernel with NaNs ?

2020-08-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9723: Summary: [C++] Expected behaviour of "mode" kernel with NaNs ? Key: ARROW-9723 URL: https://issues.apache.org/jira/browse/ARROW-9723 Project: Apache

[jira] [Updated] (ARROW-9325) [C++][Dataset][Python] ParquetDataset typecast on read

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9325: - Labels: dataset (was: ) > [C++][Dataset][Python] ParquetDataset typecast on

[jira] [Resolved] (ARROW-7118) [CI] [Dev] "docker-compose run ubuntu-python" fails

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-7118. Resolution: Fixed > [CI] [Dev] "docker-compose run ubuntu-python" fails >

[jira] [Updated] (ARROW-9700) [Python] create_library_symlinks doesn't work in macos

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9700?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9700: --- Fix Version/s: 1.0.1 > [Python] create_library_symlinks doesn't work in macos >

[jira] [Commented] (ARROW-7118) [CI] [Dev] "docker-compose run ubuntu-python" fails

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176930#comment-17176930 ] Krisztian Szucs commented on ARROW-7118: {{archery docker run ubuntu-python}} works, so we can

[jira] [Updated] (ARROW-9722) [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9722: -- Labels: pull-request-available (was: ) > [Rust]: Shorten key lifetime for reverse lookup for

[jira] [Updated] (ARROW-9723) [C++] Expected behaviour of "mode" kernel with NaNs ?

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9723?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9723: - Description: ARROW-9638 added a "mode" kernel to arrow::compute. There was some

[jira] [Updated] (ARROW-9721) [Packaging][Python] Update wheel dependency files

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9721: -- Labels: pull-request-available (was: ) > [Packaging][Python] Update wheel dependency files >

[jira] [Assigned] (ARROW-9721) [Packaging][Python] Update wheel dependency files

2020-08-13 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs reassigned ARROW-9721: -- Assignee: Krisztian Szucs > [Packaging][Python] Update wheel dependency files >

[jira] [Commented] (ARROW-9723) [C++] Expected behaviour of "mode" kernel with NaNs ?

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9723?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17176981#comment-17176981 ] Antoine Pitrou commented on ARROW-9723: --- The question is: which behaviour would be useful? IMHO,

[jira] [Updated] (ARROW-9325) [C++][Dataset][Python] ParquetDataset typecast on read

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9325: - Component/s: C++ > [C++][Dataset][Python] ParquetDataset typecast on read >

[jira] [Commented] (ARROW-5756) [Python] Remove manylinux1 support

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177124#comment-17177124 ] Antoine Pitrou commented on ARROW-5756: --- > Have other projects begun dropping them? I'm not sure.

[jira] [Commented] (ARROW-5756) [Python] Remove manylinux1 support

2020-08-13 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5756?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177125#comment-17177125 ] Antoine Pitrou commented on ARROW-5756: --- That said, we seem to have had less trouble producing

[jira] [Created] (ARROW-9728) [Rust] [Parquet] Compute nested spacing

2020-08-13 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-9728: - Summary: [Rust] [Parquet] Compute nested spacing Key: ARROW-9728 URL: https://issues.apache.org/jira/browse/ARROW-9728 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-9722) [Rust]: Shorten key lifetime for reverse lookup for dictionary arrays

2020-08-13 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9722?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9722. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7953

[jira] [Updated] (ARROW-9654) [Rust][DataFusion] Add an EXPLAIN command to the datafusion CLI

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9654?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9654: -- Labels: pull-request-available (was: ) > [Rust][DataFusion] Add an EXPLAIN command to the

[jira] [Resolved] (ARROW-8289) [Rust] [Parquet] Implement minimal Arrow Parquet writer as starting point for full writer

2020-08-13 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-8289. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7319

[jira] [Created] (ARROW-9727) [C++] Fix crash on invalid IPC input (OSS-Fuzz)

2020-08-13 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-9727: - Summary: [C++] Fix crash on invalid IPC input (OSS-Fuzz) Key: ARROW-9727 URL: https://issues.apache.org/jira/browse/ARROW-9727 Project: Apache Arrow Issue

[jira] [Created] (ARROW-9729) Error Prone causes other annotation processors to not work with Eclipse

2020-08-13 Thread Laurent Goujon (Jira)
Laurent Goujon created ARROW-9729: - Summary: Error Prone causes other annotation processors to not work with Eclipse Key: ARROW-9729 URL: https://issues.apache.org/jira/browse/ARROW-9729 Project:

[jira] [Resolved] (ARROW-9615) [Rust] Add kernel to compute length of string array

2020-08-13 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9615?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9615. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7876

[jira] [Updated] (ARROW-9727) [C++] Fix crash on invalid IPC input (OSS-Fuzz)

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9727: -- Labels: pull-request-available (was: ) > [C++] Fix crash on invalid IPC input (OSS-Fuzz) >

[jira] [Assigned] (ARROW-9727) [C++] Fix crash on invalid IPC input (OSS-Fuzz)

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9727: Assignee: Apache Arrow JIRA Bot (was: Antoine Pitrou) > [C++] Fix crash

[jira] [Assigned] (ARROW-9727) [C++] Fix crash on invalid IPC input (OSS-Fuzz)

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9727: Assignee: Antoine Pitrou (was: Apache Arrow JIRA Bot) > [C++] Fix crash

[jira] [Updated] (ARROW-9729) Error Prone causes other annotation processors to not work with Eclipse

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9729: -- Labels: pull-request-available (was: ) > Error Prone causes other annotation processors to

[jira] [Assigned] (ARROW-9729) Error Prone causes other annotation processors to not work with Eclipse

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9729: Assignee: Apache Arrow JIRA Bot (was: Laurent Goujon) > Error Prone

[jira] [Assigned] (ARROW-9729) Error Prone causes other annotation processors to not work with Eclipse

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9729?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9729: Assignee: Laurent Goujon (was: Apache Arrow JIRA Bot) > Error Prone

[jira] [Updated] (ARROW-9730) [C++][Dataset] Parsing statistics of Parquet FileMetadata is expensive

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9730?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9730: - Description: >From a discussion in dask

[jira] [Created] (ARROW-9730) [C++][Dataset] Parsing statistics of Parquet FileMetadata is expensive

2020-08-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9730: Summary: [C++][Dataset] Parsing statistics of Parquet FileMetadata is expensive Key: ARROW-9730 URL: https://issues.apache.org/jira/browse/ARROW-9730

[jira] [Commented] (ARROW-9471) [C++] Scan Dataset in reverse

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9471?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177252#comment-17177252 ] Joris Van den Bossche commented on ARROW-9471: -- Another use case for a "reverse scan" is an

[jira] [Created] (ARROW-9732) [Rust] [DataFusion] Add "Physical Planner" type thing which can do optimizations

2020-08-13 Thread Andrew Lamb (Jira)
Andrew Lamb created ARROW-9732: -- Summary: [Rust] [DataFusion] Add "Physical Planner" type thing which can do optimizations Key: ARROW-9732 URL: https://issues.apache.org/jira/browse/ARROW-9732 Project:

[jira] [Assigned] (ARROW-9651) [C++][Dataset] Debug segfault in dataset writing on 32-bit mingw (RTools 35)

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9651: Assignee: Apache Arrow JIRA Bot (was: Ben Kietzman) > [C++][Dataset]

[jira] [Assigned] (ARROW-9651) [C++][Dataset] Debug segfault in dataset writing on 32-bit mingw (RTools 35)

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9651: Assignee: Ben Kietzman (was: Apache Arrow JIRA Bot) > [C++][Dataset]

[jira] [Updated] (ARROW-9651) [C++][Dataset] Debug segfault in dataset writing on 32-bit mingw (RTools 35)

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9651: -- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] Debug segfault in

[jira] [Created] (ARROW-9731) [C++][Dataset] Port "head" method from R to C++ Dataset Scanner

2020-08-13 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9731: Summary: [C++][Dataset] Port "head" method from R to C++ Dataset Scanner Key: ARROW-9731 URL: https://issues.apache.org/jira/browse/ARROW-9731

[jira] [Created] (ARROW-9733) [Rust][DataFusion] Aggregates COUNT/MIN/MAX don't work on VARCHAR columns

2020-08-13 Thread Andrew Lamb (Jira)
Andrew Lamb created ARROW-9733: -- Summary: [Rust][DataFusion] Aggregates COUNT/MIN/MAX don't work on VARCHAR columns Key: ARROW-9733 URL: https://issues.apache.org/jira/browse/ARROW-9733 Project: Apache

[jira] [Resolved] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-3764. -- Resolution: Fixed > [C++] Port Python "ParquetDataset" business logic to C++ >

[jira] [Commented] (ARROW-9697) [C++][Dataset] num_rows method for Dataset/Scanner

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9697?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177312#comment-17177312 ] Joris Van den Bossche commented on ARROW-9697: -- Ah, yes, I didn't consider the "filtered

[jira] [Assigned] (ARROW-9651) [C++][Dataset] Debug segfault in dataset writing on 32-bit mingw (RTools 35)

2020-08-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9651?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-9651: -- Assignee: Neal Richardson (was: Ben Kietzman) > [C++][Dataset] Debug segfault in

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-3764: - Fix Version/s: (was: 2.0.0) 1.0.0 > [C++] Port Python

[jira] [Commented] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2020-08-13 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177262#comment-17177262 ] Joris Van den Bossche commented on ARROW-3764: -- Since the actual issue (porting the logic to

[jira] [Updated] (ARROW-9733) [Rust][DataFusion] Aggregates COUNT/MIN/MAX don't work on VARCHAR columns

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove updated ARROW-9733: -- Component/s: Rust - DataFusion Rust > [Rust][DataFusion] Aggregates COUNT/MIN/MAX

[jira] [Assigned] (ARROW-9734) [Rust] [DataFusion] TableProvider.scan executing partitions prematurely

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9734: Assignee: Andy Grove (was: Apache Arrow JIRA Bot) > [Rust] [DataFusion]

[jira] [Assigned] (ARROW-9734) [Rust] [DataFusion] TableProvider.scan executing partitions prematurely

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9734: Assignee: Apache Arrow JIRA Bot (was: Andy Grove) > [Rust] [DataFusion]

[jira] [Resolved] (ARROW-9714) [Rust] [DataFusion] TypeCoercionRule not implemented for Limit or Sort

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9714. --- Resolution: Fixed Issue resolved by pull request 7949 [https://github.com/apache/arrow/pull/7949] >

[jira] [Updated] (ARROW-9734) [Rust] [DataFusion] TableProvider.scan executing partitions prematurely

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9734: -- Labels: pull-request-available (was: ) > [Rust] [DataFusion] TableProvider.scan executing

[jira] [Created] (ARROW-9734) [Rust] [DataFusion] TableProvider.scan executing partitions prematurely

2020-08-13 Thread Andy Grove (Jira)
Andy Grove created ARROW-9734: - Summary: [Rust] [DataFusion] TableProvider.scan executing partitions prematurely Key: ARROW-9734 URL: https://issues.apache.org/jira/browse/ARROW-9734 Project: Apache

[jira] [Resolved] (ARROW-9693) [CI][Docs] Nightly docs build fails

2020-08-13 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9693?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-9693. - Resolution: Fixed Issue resolved by pull request 7941

[jira] [Assigned] (ARROW-9726) [Rust] [DataFusion] ParquetScanExec launches threads too early

2020-08-13 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9726: Assignee: Andy Grove (was: Apache Arrow JIRA Bot) > [Rust] [DataFusion]

[jira] [Resolved] (ARROW-9727) [C++] Fix crash on invalid IPC input (OSS-Fuzz)

2020-08-13 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-9727. - Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7956

[jira] [Commented] (ARROW-9732) [Rust] [DataFusion] Add "Physical Planner" type thing which can do optimizations

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177341#comment-17177341 ] Andy Grove commented on ARROW-9732: --- See https://issues.apache.org/jira/browse/ARROW-9464 as well >

[jira] [Resolved] (ARROW-9725) [Rust] [DataFusion] LimitExec and SortExec should use MergeExec

2020-08-13 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9725?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9725. --- Resolution: Fixed Issue resolved by pull request 7958 [https://github.com/apache/arrow/pull/7958] >

[jira] [Commented] (ARROW-9676) [R] Error converting Table with nested structs

2020-08-13 Thread Nick DiQuattro (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177377#comment-17177377 ] Nick DiQuattro commented on ARROW-9676: --- Makes sense about converting to data.frame not being the

[jira] [Commented] (ARROW-9676) [R] Error converting Table with nested structs

2020-08-13 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9676?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17177386#comment-17177386 ] Neal Richardson commented on ARROW-9676: Thanks for trying! That's a shame, but it's still useful

[jira] [Updated] (ARROW-9736) Ruby gem no doc tweak

2020-08-13 Thread Pratik Raj (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pratik Raj updated ARROW-9736: -- Description: Optimization tweak for ruby gems no doc while gems installation using "-no-document" .

[jira] [Created] (ARROW-9736) Ruby gem no doc tweak

2020-08-13 Thread Pratik Raj (Jira)
Pratik Raj created ARROW-9736: - Summary: Ruby gem no doc tweak Key: ARROW-9736 URL: https://issues.apache.org/jira/browse/ARROW-9736 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-9736) Ruby gem no doc tweak

2020-08-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9736: -- Labels: pull-request-available (was: ) > Ruby gem no doc tweak > - > >

[jira] [Resolved] (ARROW-9706) [Java] Tests in TestLargeListVector fails on big endian platform

2020-08-13 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-9706. Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7943

[jira] [Assigned] (ARROW-9681) [Java] Failed Arrow Memory - Core on big-endian platform

2020-08-13 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield reassigned ARROW-9681: -- Assignee: Kazuaki Ishizaki > [Java] Failed Arrow Memory - Core on big-endian platform

[jira] [Resolved] (ARROW-9681) [Java] Failed Arrow Memory - Core on big-endian platform

2020-08-13 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield resolved ARROW-9681. Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 7923

[jira] [Closed] (ARROW-9736) [Ruby] Don't install document by gem

2020-08-13 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou closed ARROW-9736. --- Resolution: Won't Do > [Ruby] Don't install document by gem > >

[jira] [Updated] (ARROW-9736) [Ruby] Don't install document by gem

2020-08-13 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-9736: Summary: [Ruby] Don't install document by gem (was: Ruby gem no doc tweak) > [Ruby] Don't install

[jira] [Updated] (ARROW-9736) [Ruby] Don't install document by gem

2020-08-13 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-9736: Component/s: Ruby > [Ruby] Don't install document by gem > >

  1   2   >