[jira] [Assigned] (ARROW-9809) [Rust] [DataFusion] logical schema = physical schema is not true

2020-09-12 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge reassigned ARROW-9809: Assignee: Jorge > [Rust] [DataFusion] logical schema = physical schema is not true >

[jira] [Updated] (ARROW-9974) t

2020-09-12 Thread Ashish Gupta (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashish Gupta updated ARROW-9974: Summary: t (was: [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading

[jira] [Commented] (ARROW-8394) [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package

2020-09-12 Thread Joao (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194700#comment-17194700 ] Joao commented on ARROW-8394: - Hi. We are also facing issues using apache arrow with Typescript 3.9.x and

[jira] [Comment Edited] (ARROW-8394) [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package

2020-09-12 Thread Joao (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194700#comment-17194700 ] Joao edited comment on ARROW-8394 at 9/12/20, 10:17 AM: Hi. We are also facing

[jira] [Commented] (ARROW-9937) [Rust] [DataFusion] Average is not correct

2020-09-12 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194706#comment-17194706 ] Andrew Lamb commented on ARROW-9937: This is bad correctness bug -- we should definitely fix this >

[jira] [Updated] (ARROW-9937) [Rust] [DataFusion] Average is not correct

2020-09-12 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb updated ARROW-9937: --- Description: The current design of aggregates makes the calculation of the average incorrect.

[jira] [Closed] (ARROW-9918) [Rust] [DataFusion] Favor "from" to create arrays for improved performance

2020-09-12 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge closed ARROW-9918. Resolution: Won't Fix While this is true, in general we should use buffers for operations, as they increase

[jira] [Updated] (ARROW-9974) [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large number of files using ParquetDataset

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9974: Priority: Critical (was: Major) > [Python][C++] pyarrow version 1.0.1 throws Out Of Memory

[jira] [Updated] (ARROW-9974) [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large number of files using ParquetDataset

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9974: Summary: [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large

[jira] [Updated] (ARROW-9974) [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while reading large number of files using ParquetDataset

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9974: Fix Version/s: 2.0.0 > [Python][C++] pyarrow version 1.0.1 throws Out Of Memory exception while >

[jira] [Updated] (ARROW-9982) IterableArrayLike should support map

2020-09-12 Thread Dominik Moritz (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9982?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dominik Moritz updated ARROW-9982: -- Priority: Minor (was: Major) > IterableArrayLike should support map >

[jira] [Created] (ARROW-9982) IterableArrayLike should support map

2020-09-12 Thread Dominik Moritz (Jira)
Dominik Moritz created ARROW-9982: - Summary: IterableArrayLike should support map Key: ARROW-9982 URL: https://issues.apache.org/jira/browse/ARROW-9982 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-9980) [Rust] Fix parquet crate clippy lints

2020-09-12 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9980. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8173

[jira] [Assigned] (ARROW-9937) [Rust] [DataFusion] Average is not correct

2020-09-12 Thread Jorge (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jorge reassigned ARROW-9937: Assignee: Jorge > [Rust] [DataFusion] Average is not correct > --

[jira] [Created] (ARROW-9978) [Rust] Umbrella issue for clippy integration

2020-09-12 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-9978: - Summary: [Rust] Umbrella issue for clippy integration Key: ARROW-9978 URL: https://issues.apache.org/jira/browse/ARROW-9978 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-9296) [CI][Rust] Enable more clippy lint checks

2020-09-12 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale updated ARROW-9296: -- Parent: ARROW-9978 Issue Type: Sub-task (was: Improvement) > [CI][Rust] Enable more

[jira] [Created] (ARROW-9979) [Rust] Fix arrow crate clippy lints

2020-09-12 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-9979: - Summary: [Rust] Fix arrow crate clippy lints Key: ARROW-9979 URL: https://issues.apache.org/jira/browse/ARROW-9979 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-8394) [JS] Typescript compiler errors for arrow d.ts files, when using es2015-esm package

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8394?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194835#comment-17194835 ] Wes McKinney commented on ARROW-8394: - Don't think anyone is looking at it. > [JS] Typescript

[jira] [Updated] (ARROW-9974) t

2020-09-12 Thread Ashish Gupta (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ashish Gupta updated ARROW-9974: Attachment: legacy_true.txt legacy_false.txt > t > - > > Key:

[jira] [Updated] (ARROW-9980) [Rust] Fix parquet crate clippy lints

2020-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9980: -- Labels: pull-request-available (was: ) > [Rust] Fix parquet crate clippy lints >

[jira] [Resolved] (ARROW-9961) [Rust][DataFusion] to_timestamp function parses timestamp without timezone offset as UTC rather than local

2020-09-12 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9961. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8161

[jira] [Assigned] (ARROW-9980) [Rust] Fix parquet crate clippy lints

2020-09-12 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-9980: - Assignee: Neville Dipale > [Rust] Fix parquet crate clippy lints >

[jira] [Updated] (ARROW-9983) [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9983: Labels: dataset (was: ) > [C++][Dataset][Python] Use larger default batch size than 32K for

[jira] [Updated] (ARROW-9983) [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9983: Summary: [C++][Dataset][Python] Use larger default batch size than 32K for Datasets API (was:

[jira] [Commented] (ARROW-9924) [Python] Performance regression reading individual Parquet files using Dataset interface

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194843#comment-17194843 ] Wes McKinney commented on ARROW-9924: - I found that with a larger dataset with more columns, the

[jira] [Updated] (ARROW-9338) [Rust] Add instructions for running clippy locally

2020-09-12 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9338?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale updated ARROW-9338: -- Parent: ARROW-9978 Issue Type: Sub-task (was: Improvement) > [Rust] Add instructions

[jira] [Updated] (ARROW-9848) [Rust] Implement changes to ensure flatbuffer alignment

2020-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9848?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9848: -- Labels: pull-request-available (was: ) > [Rust] Implement changes to ensure flatbuffer

[jira] [Resolved] (ARROW-9979) [Rust] Fix arrow crate clippy lints

2020-09-12 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9979. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8168

[jira] [Commented] (ARROW-9924) [Python] Performance regression reading individual Parquet files using Dataset interface

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194834#comment-17194834 ] Wes McKinney commented on ARROW-9924: - I took a look into this since I was curious what's wrong. So

[jira] [Assigned] (ARROW-9979) [Rust] Fix arrow crate clippy lints

2020-09-12 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale reassigned ARROW-9979: - Assignee: Neville Dipale > [Rust] Fix arrow crate clippy lints >

[jira] [Assigned] (ARROW-9979) [Rust] Fix arrow crate clippy lints

2020-09-12 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9979: Assignee: Neville Dipale (was: Apache Arrow JIRA Bot) > [Rust] Fix arrow

[jira] [Updated] (ARROW-9979) [Rust] Fix arrow crate clippy lints

2020-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9979: -- Labels: pull-request-available (was: ) > [Rust] Fix arrow crate clippy lints >

[jira] [Created] (ARROW-9980) [Rust] Fix parquet crate clippy lints

2020-09-12 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-9980: - Summary: [Rust] Fix parquet crate clippy lints Key: ARROW-9980 URL: https://issues.apache.org/jira/browse/ARROW-9980 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-9974) t

2020-09-12 Thread Ashish Gupta (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194764#comment-17194764 ] Ashish Gupta commented on ARROW-9974: - 1) Please find attached full traceback of both casesĀ 

[jira] [Resolved] (ARROW-9950) [Rust] [DataFusion] Allow UDF usage without registry

2020-09-12 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9950. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8144

[jira] [Resolved] (ARROW-9954) [Rust] [DataFusion] Simplify code of aggregate planning

2020-09-12 Thread Andy Grove (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andy Grove resolved ARROW-9954. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8155

[jira] [Commented] (ARROW-9924) [Python] Performance regression reading individual Parquet files using Dataset interface

2020-09-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9924?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194853#comment-17194853 ] Wes McKinney commented on ARROW-9924: - Think I found the problem. I expanded the chunk size to 10M so

[jira] [Created] (ARROW-9981) [Rust] Allow configuring flight IPC with IpcWriteOptions

2020-09-12 Thread Neville Dipale (Jira)
Neville Dipale created ARROW-9981: - Summary: [Rust] Allow configuring flight IPC with IpcWriteOptions Key: ARROW-9981 URL: https://issues.apache.org/jira/browse/ARROW-9981 Project: Apache Arrow

[jira] [Created] (ARROW-9983) [C++][Dataset] Use larger default batch size than 32K for Datasets API

2020-09-12 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9983: --- Summary: [C++][Dataset] Use larger default batch size than 32K for Datasets API Key: ARROW-9983 URL: https://issues.apache.org/jira/browse/ARROW-9983 Project: Apache

[jira] [Commented] (ARROW-7288) [C++][R] read_parquet() freezes on Windows with Japanese locale

2020-09-12 Thread Hiroaki Yutani (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7288?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17194918#comment-17194918 ] Hiroaki Yutani commented on ARROW-7288: --- > To be clear, I believe the issue is in the parquet C++

[jira] [Updated] (ARROW-9984) [Rust] [DataFusion] DRY of function to string

2020-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9984: -- Labels: pull-request-available (was: ) > [Rust] [DataFusion] DRY of function to string >

[jira] [Assigned] (ARROW-9984) [Rust] [DataFusion] DRY of function to string

2020-09-12 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9984: Assignee: Jorge (was: Apache Arrow JIRA Bot) > [Rust] [DataFusion] DRY

[jira] [Assigned] (ARROW-9984) [Rust] [DataFusion] DRY of function to string

2020-09-12 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-9984: Assignee: Apache Arrow JIRA Bot (was: Jorge) > [Rust] [DataFusion] DRY

[jira] [Created] (ARROW-9985) [C++][Parquet] Add bitmap based validity bitmap and nested reconstruction

2020-09-12 Thread Micah Kornfield (Jira)
Micah Kornfield created ARROW-9985: -- Summary: [C++][Parquet] Add bitmap based validity bitmap and nested reconstruction Key: ARROW-9985 URL: https://issues.apache.org/jira/browse/ARROW-9985 Project:

[jira] [Updated] (ARROW-8494) [C++] Implement vectorized array reassembly logic

2020-09-12 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8494: -- Labels: pull-request-available (was: ) > [C++] Implement vectorized array reassembly logic >

[jira] [Assigned] (ARROW-8494) [C++] Implement vectorized array reassembly logic

2020-09-12 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-8494: Assignee: Apache Arrow JIRA Bot (was: Micah Kornfield) > [C++] Implement

[jira] [Assigned] (ARROW-8494) [C++] Implement vectorized array reassembly logic

2020-09-12 Thread Apache Arrow JIRA Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8494?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Arrow JIRA Bot reassigned ARROW-8494: Assignee: Micah Kornfield (was: Apache Arrow JIRA Bot) > [C++] Implement

[jira] [Created] (ARROW-9984) [Rust] [DataFusion] DRY of function to string

2020-09-12 Thread Jorge (Jira)
Jorge created ARROW-9984: Summary: [Rust] [DataFusion] DRY of function to string Key: ARROW-9984 URL: https://issues.apache.org/jira/browse/ARROW-9984 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-8883) [Rust] [Integration Testing] Enable passing tests and update spec doc

2020-09-12 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8883?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale updated ARROW-8883: -- Summary: [Rust] [Integration Testing] Enable passing tests and update spec doc (was: [Rust]