[jira] [Resolved] (ARROW-16807) [C++] count_distinct aggregates incorrectly across row groups

2022-07-16 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16807. -- Resolution: Fixed Issue resolved by pull request 13583 [https://github.com/apache/arrow/pull/1

[jira] [Assigned] (ARROW-16929) [C++] Remove ExecBatchIterator and usages thereof

2022-07-17 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-16929: Assignee: Wes McKinney > [C++] Remove ExecBatchIterator and usages thereof >

[jira] [Created] (ARROW-17099) [Python] pyarrow build does not support RELWITHDEBINFO build type

2022-07-17 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-17099: Summary: [Python] pyarrow build does not support RELWITHDEBINFO build type Key: ARROW-17099 URL: https://issues.apache.org/jira/browse/ARROW-17099 Project: Apache Arr

[jira] [Created] (ARROW-17100) [C++][Parquet] Fix backwards compatibility for ParquetV2 data pages written prior to 3.0.0 per ARROW-10353

2022-07-17 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-17100: Summary: [C++][Parquet] Fix backwards compatibility for ParquetV2 data pages written prior to 3.0.0 per ARROW-10353 Key: ARROW-17100 URL: https://issues.apache.org/jira/browse/ARR

[jira] [Created] (ARROW-17129) [C++][Compute] Improve memory efficiency in Grouper

2022-07-19 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-17129: Summary: [C++][Compute] Improve memory efficiency in Grouper Key: ARROW-17129 URL: https://issues.apache.org/jira/browse/ARROW-17129 Project: Apache Arrow Is

[jira] [Resolved] (ARROW-16852) [C++] Migrate SCALAR_AGGREGATE, HASH_AGGREGATE functions to use ExecSpan

2022-07-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16852. -- Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13630 [https://gi

[jira] [Created] (ARROW-17135) [C++] Reduce code size in arrow/compute/kernels/scalar_compare.cc

2022-07-19 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-17135: Summary: [C++] Reduce code size in arrow/compute/kernels/scalar_compare.cc Key: ARROW-17135 URL: https://issues.apache.org/jira/browse/ARROW-17135 Project: Apache Arr

[jira] [Resolved] (ARROW-17135) [C++] Reduce code size in arrow/compute/kernels/scalar_compare.cc

2022-07-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-17135. -- Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13654 [https://gi

[jira] [Assigned] (ARROW-17213) [C++] Compute kernel change introduced test-r-linux-valgrind failure

2022-07-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-17213: Assignee: Wes McKinney > [C++] Compute kernel change introduced test-r-linux-valgrind fai

[jira] [Commented] (ARROW-17213) [C++] Compute kernel change introduced test-r-linux-valgrind failure

2022-07-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17571520#comment-17571520 ] Wes McKinney commented on ARROW-17213: -- I tried to reproduce this locally on the ma

[jira] [Updated] (ARROW-17213) [C++] Compute kernel change introduced test-r-linux-valgrind failure

2022-07-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-17213: - Fix Version/s: 9.0.0 > [C++] Compute kernel change introduced test-r-linux-valgrind failure > --

[jira] [Resolved] (ARROW-17213) [C++] Compute kernel change introduced test-r-linux-valgrind failure

2022-07-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-17213. -- Resolution: Fixed Issue resolved by pull request 13715 [https://github.com/apache/arrow/pull/1

[jira] [Commented] (ARROW-17225) [C++] Possible memory leak when registering compare functions

2022-07-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17571998#comment-17571998 ] Wes McKinney commented on ARROW-17225: -- This is really strange, because the line th

[jira] [Commented] (ARROW-17225) [C++] Possible memory leak when registering compare functions

2022-07-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17572141#comment-17572141 ] Wes McKinney commented on ARROW-17225: -- For the InputType issue, I think it's leaki

[jira] [Resolved] (ARROW-16929) [C++] Remove ExecBatchIterator and usages thereof

2022-07-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16929. -- Fix Version/s: 9.0.0 Resolution: Fixed Resolved in a related PR > [C++] Remove ExecBat

[jira] [Created] (ARROW-17259) [C++] Use shared_ptr less throughout arrow/compute

2022-07-29 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-17259: Summary: [C++] Use shared_ptr less throughout arrow/compute Key: ARROW-17259 URL: https://issues.apache.org/jira/browse/ARROW-17259 Project: Apache Arrow Is

[jira] [Assigned] (ARROW-17259) [C++] Use shared_ptr less throughout arrow/compute

2022-07-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17259?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-17259: Assignee: Wes McKinney > [C++] Use shared_ptr less throughout arrow/compute > ---

[jira] [Created] (ARROW-17296) [Python] Doctest failure in pyarrow.parquet.read_metadata after 10.0.0 dev version update

2022-08-03 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-17296: Summary: [Python] Doctest failure in pyarrow.parquet.read_metadata after 10.0.0 dev version update Key: ARROW-17296 URL: https://issues.apache.org/jira/browse/ARROW-17296

[jira] [Resolved] (ARROW-17296) [Python] Doctest failure in pyarrow.parquet.read_metadata after 10.0.0 dev version update

2022-08-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-17296. -- Resolution: Fixed Issue resolved by pull request 13790 [https://github.com/apache/arrow/pull/1

[jira] [Assigned] (ARROW-17296) [Python] Doctest failure in pyarrow.parquet.read_metadata after 10.0.0 dev version update

2022-08-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17296?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-17296: Assignee: Wes McKinney > [Python] Doctest failure in pyarrow.parquet.read_metadata after

[jira] [Updated] (ARROW-12030) [C++] Change dataset readahead to be based on available RAM/CPU instead of fixed constants/options

2021-03-22 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12030: - Summary: [C++] Change dataset readahead to be based on available RAM/CPU instead of fixed consta

[jira] [Commented] (ARROW-12100) [C#] Cannot round-trip record batch with PyArrow

2021-03-29 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12100?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17310949#comment-17310949 ] Wes McKinney commented on ARROW-12100: -- It seems okay to treat a null field as a le

[jira] [Commented] (ARROW-12114) [C++] Dataset to table filter expression API change

2021-04-04 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17314642#comment-17314642 ] Wes McKinney commented on ARROW-12114: -- The exception looks correct to me. It would

[jira] [Created] (ARROW-12280) [Developer] Remove @-mentions from commit messages in merge tool

2021-04-07 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-12280: Summary: [Developer] Remove @-mentions from commit messages in merge tool Key: ARROW-12280 URL: https://issues.apache.org/jira/browse/ARROW-12280 Project: Apache Arro

[jira] [Created] (ARROW-12495) [C++][Python] NumPy buffer sets is_mutable_ to true but does not set mutable_data_ when the NumPy array is writable

2021-04-21 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-12495: Summary: [C++][Python] NumPy buffer sets is_mutable_ to true but does not set mutable_data_ when the NumPy array is writable Key: ARROW-12495 URL: https://issues.apache.org/jira/b

[jira] [Updated] (ARROW-12529) [R] Writing to Parquet from tibble Consumes Large Amount of Memory

2021-04-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12529: - Summary: [R] Writing to Parquet from tibble Consumes Large Amount of Memory (was: Writing to Pa

[jira] [Updated] (ARROW-12526) [Python] Pre-generate pyarrow.compute members

2021-04-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12526?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12526: - Summary: [Python] Pre-generate pyarrow.compute members (was: Pre-generate pyarrow.compute mem

[jira] [Created] (ARROW-12530) [C++] Remove Buffer::mutable_data_ member and use const_cast on data_ only if is_mutable_ is true

2021-04-24 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-12530: Summary: [C++] Remove Buffer::mutable_data_ member and use const_cast on data_ only if is_mutable_ is true Key: ARROW-12530 URL: https://issues.apache.org/jira/browse/ARROW-12530

[jira] [Updated] (ARROW-12243) [C++] Datasets/Fragment/ScanOptions should be immutable

2021-04-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12243: - Summary: [C++] Datasets/Fragment/ScanOptions should be immutable (was: Datasets/Fragment/ScanOp

[jira] [Updated] (ARROW-11172) [Python] NumPyBuffer does not set mutable_data_

2021-04-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-11172: - Fix Version/s: 5.0.0 > [Python] NumPyBuffer does not set mutable_data_ > ---

[jira] [Updated] (ARROW-12578) [JS] Simplify UTF8 handling in NodeJS

2021-04-28 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12578?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12578: - Fix Version/s: 4.0.1 > [JS] Simplify UTF8 handling in NodeJS > -

[jira] [Commented] (ARROW-12739) [C++] Function to combine Arrays row-wise into ListArray

2021-05-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12739?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17343262#comment-17343262 ] Wes McKinney commented on ARROW-12739: -- Since naming is hard, my suggested name for

[jira] [Created] (ARROW-12849) [C++] Implement scalar kernel function that computes "isin" for each element in a List array

2021-05-21 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-12849: Summary: [C++] Implement scalar kernel function that computes "isin" for each element in a List array Key: ARROW-12849 URL: https://issues.apache.org/jira/browse/ARROW-12849

[jira] [Updated] (ARROW-2328) [C++] Writing a slice with feather ignores the offset

2021-05-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2328: Summary: [C++] Writing a slice with feather ignores the offset (was: Writing a slice with feather

[jira] [Created] (ARROW-12884) [Flight] Data checksumming support

2021-05-26 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-12884: Summary: [Flight] Data checksumming support Key: ARROW-12884 URL: https://issues.apache.org/jira/browse/ARROW-12884 Project: Apache Arrow Issue Type: New Fea

[jira] [Commented] (ARROW-12888) Implement arrow::Table::GetSizeInBytes()

2021-05-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17352040#comment-17352040 ] Wes McKinney commented on ARROW-12888: -- What should this return? > Implement arro

[jira] [Commented] (ARROW-12890) Implement arrow::Table::clone() -> std::shared_ptr

2021-05-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17352596#comment-17352596 ] Wes McKinney commented on ARROW-12890: -- I think it would be better for an applicati

[jira] [Commented] (ARROW-12888) Implement arrow::Table::GetSizeInBytes()

2021-05-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17352599#comment-17352599 ] Wes McKinney commented on ARROW-12888: -- I'm still not clear on what the function sh

[jira] [Updated] (ARROW-12888) [C++] Implement arrow::Table::GetSizeInBytes()

2021-05-27 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12888: - Summary: [C++] Implement arrow::Table::GetSizeInBytes() (was: Implement arrow::Table::GetSizeIn

[jira] [Updated] (ARROW-12888) [C++] Implement arrow::Table::GetSizeInBytes() that returns the sum of all Buffer sizes from the internal columns

2021-06-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12888: - Summary: [C++] Implement arrow::Table::GetSizeInBytes() that returns the sum of all Buffer sizes

[jira] [Updated] (ARROW-12888) [C++] Implement arrow::Table::GetSizeInBytes() that returns the sum of all Buffer sizes from the internal ArrayData objects

2021-06-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12888: - Summary: [C++] Implement arrow::Table::GetSizeInBytes() that returns the sum of all Buffer sizes

[jira] [Commented] (ARROW-12888) [C++] Implement arrow::Table::GetSizeInBytes()

2021-06-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17355138#comment-17355138 ] Wes McKinney commented on ARROW-12888: -- C++ objects take up space, so a pedantic in

[jira] [Updated] (ARROW-12888) [C++] Implement arrow::Table::GetSizeInBytes() that returns the sum of all Buffer sizes from the internal ArrayData objects

2021-06-01 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12888: - Description: This function will return the sum of {{Buffer::size()}} as int64 for all the ArrayD

[jira] [Commented] (ARROW-1009) [C++] Create asynchronous version of StreamReader

2021-06-03 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17356978#comment-17356978 ] Wes McKinney commented on ARROW-1009: - The intent was to provide an asynchronous API

[jira] [Commented] (ARROW-16519) [C++] ASAN/UBSAN build fails linking with conda-forge clang

2022-05-14 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16519?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17537102#comment-17537102 ] Wes McKinney commented on ARROW-16519: -- On gitter I was told to install https://ana

[jira] [Created] (ARROW-16643) [C++] Fix -Werror CHECKIN build with clang-14

2022-05-24 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16643: Summary: [C++] Fix -Werror CHECKIN build with clang-14 Key: ARROW-16643 URL: https://issues.apache.org/jira/browse/ARROW-16643 Project: Apache Arrow Issue Ty

[jira] [Assigned] (ARROW-16643) [C++] Fix -Werror CHECKIN build with clang-14

2022-05-24 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-16643: Assignee: Wes McKinney > [C++] Fix -Werror CHECKIN build with clang-14 >

[jira] [Commented] (ARROW-12626) [C++] Build doesn't find external xsimd, and then installs a bundled one into a wrong path

2022-05-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17542638#comment-17542638 ] Wes McKinney commented on ARROW-12626: -- [~assignUser] the C++ build system does not

[jira] [Updated] (ARROW-12626) [C++] Support non-BUNDLED xsimd dependency

2022-05-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12626: - Fix Version/s: 9.0.0 > [C++] Support non-BUNDLED xsimd dependency >

[jira] [Updated] (ARROW-12626) [C++] Support non-BUNDLED xsimd dependency

2022-05-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-12626: - Summary: [C++] Support non-BUNDLED xsimd dependency (was: [C++] Build doesn't find external xsi

[jira] [Assigned] (ARROW-12626) [C++] Support non-BUNDLED xsimd dependency

2022-05-26 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-12626: Assignee: Wes McKinney > [C++] Support non-BUNDLED xsimd dependency > ---

[jira] [Created] (ARROW-16755) [C++] Improve array expression and kernel evaluation performance on small inputs

2022-06-06 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16755: Summary: [C++] Improve array expression and kernel evaluation performance on small inputs Key: ARROW-16755 URL: https://issues.apache.org/jira/browse/ARROW-16755 Proj

[jira] [Commented] (ARROW-16562) [C++] Avoid slicing array inputs in ExecBatchIterator that would result in one slice

2022-06-06 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17550547#comment-17550547 ] Wes McKinney commented on ARROW-16562: -- As a result of work connected to ARROW-1675

[jira] [Created] (ARROW-16756) [C++] Introduce initial ArraySpan, ExecSpan non-owning / shared_ptr-free data structures for kernel execution, refactor scalar kernels

2022-06-06 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16756: Summary: [C++] Introduce initial ArraySpan, ExecSpan non-owning / shared_ptr-free data structures for kernel execution, refactor scalar kernels Key: ARROW-16756 URL: https://issue

[jira] [Created] (ARROW-16757) [C++] Remove "scalar" output modality from array kernels

2022-06-06 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16757: Summary: [C++] Remove "scalar" output modality from array kernels Key: ARROW-16757 URL: https://issues.apache.org/jira/browse/ARROW-16757 Project: Apache Arrow

[jira] [Created] (ARROW-16758) [C++] Rewrite ExecuteScalarExpression to not use ScalarExecutor

2022-06-06 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16758: Summary: [C++] Rewrite ExecuteScalarExpression to not use ScalarExecutor Key: ARROW-16758 URL: https://issues.apache.org/jira/browse/ARROW-16758 Project: Apache Arrow

[jira] [Created] (ARROW-16819) [C++] arrow::compute::CallFunction needs a batch length for nullary functions

2022-06-12 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16819: Summary: [C++] arrow::compute::CallFunction needs a batch length for nullary functions Key: ARROW-16819 URL: https://issues.apache.org/jira/browse/ARROW-16819 Project

[jira] [Updated] (ARROW-16819) [C++] arrow::compute::CallFunction needs a batch length for nullary functions

2022-06-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-16819: - Description: This is a design deficiency in {{CallFunction}}. If a function is nullary (zero inp

[jira] [Updated] (ARROW-16819) [C++] arrow::compute::CallFunction needs a batch length for nullary functions

2022-06-12 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-16819: - Description: This is a design deficiency in {{CallFunction}}. If a function is nullary (zero inp

[jira] [Created] (ARROW-16824) [C++] Migrate non-ScalarKernel implementations to use ExecSpan, ArraySpan

2022-06-13 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16824: Summary: [C++] Migrate non-ScalarKernel implementations to use ExecSpan, ArraySpan Key: ARROW-16824 URL: https://issues.apache.org/jira/browse/ARROW-16824 Project: Ap

[jira] [Resolved] (ARROW-16756) [C++] Introduce initial ArraySpan, ExecSpan non-owning / shared_ptr-free data structures for kernel execution, refactor scalar kernels

2022-06-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16756. -- Resolution: Fixed Issue resolved by pull request 13364 [https://github.com/apache/arrow/pull/1

[jira] [Resolved] (ARROW-16819) [C++] arrow::compute::CallFunction needs a batch length for nullary functions

2022-06-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16819. -- Resolution: Fixed Done in https://github.com/apache/arrow/commit/53752adc6b81166cd4ee7db5a819

[jira] [Updated] (ARROW-16827) [C++] Refactor internal array sorting code to use ArraySpan

2022-06-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16827?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-16827: - Description: I won't be tackling this in ARROW-16824 since this code will require more work to p

[jira] [Created] (ARROW-16827) [C++] Refactor internal array sorting code to use ArraySpan

2022-06-13 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16827: Summary: [C++] Refactor internal array sorting code to use ArraySpan Key: ARROW-16827 URL: https://issues.apache.org/jira/browse/ARROW-16827 Project: Apache Arrow

[jira] [Assigned] (ARROW-16590) [C++] Consolidate files dealing with row-major storage, add some helper methods

2022-06-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-16590: Assignee: Weston Pace > [C++] Consolidate files dealing with row-major storage, add some

[jira] [Resolved] (ARROW-16590) [C++] Consolidate files dealing with row-major storage, add some helper methods

2022-06-13 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16590?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16590. -- Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13218 [https://gi

[jira] [Created] (ARROW-16837) [C++] Investigate performance regressions observed in Unique, VisitArraySpanInline

2022-06-15 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16837: Summary: [C++] Investigate performance regressions observed in Unique, VisitArraySpanInline Key: ARROW-16837 URL: https://issues.apache.org/jira/browse/ARROW-16837 Pr

[jira] [Created] (ARROW-16845) [C++] ArraySpan::IsNull/IsValid implementations are incorrect for union types

2022-06-16 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16845: Summary: [C++] ArraySpan::IsNull/IsValid implementations are incorrect for union types Key: ARROW-16845 URL: https://issues.apache.org/jira/browse/ARROW-16845 Project

[jira] [Created] (ARROW-16847) [C++] Rename or fix compute/kernels/aggregate_{mode, quantile}.cc modules to actually be aggregate functions

2022-06-16 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16847: Summary: [C++] Rename or fix compute/kernels/aggregate_{mode, quantile}.cc modules to actually be aggregate functions Key: ARROW-16847 URL: https://issues.apache.org/jira/browse/A

[jira] [Updated] (ARROW-16824) [C++] Migrate VectorKernel implementations to use ExecSpan, ArraySpan

2022-06-17 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-16824: - Summary: [C++] Migrate VectorKernel implementations to use ExecSpan, ArraySpan (was: [C++] Migr

[jira] [Updated] (ARROW-16824) [C++] Migrate VectorKernel implementations to use ExecSpan, ArraySpan

2022-06-17 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-16824: - Description: ARROW-16756 handles the scalar kernels. Migrate the rest of the kernels and remove

[jira] [Created] (ARROW-16852) [C++] Migrate SCALAR_AGGREGATE, HASH_AGGREGATE functions to use ExecSpan

2022-06-17 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-16852: Summary: [C++] Migrate SCALAR_AGGREGATE, HASH_AGGREGATE functions to use ExecSpan Key: ARROW-16852 URL: https://issues.apache.org/jira/browse/ARROW-16852 Project: Apa

[jira] [Resolved] (ARROW-16824) [C++] Migrate VectorKernel implementations to use ExecSpan, ArraySpan

2022-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16824. -- Resolution: Fixed Issue resolved by pull request 13398 [https://github.com/apache/arrow/pull/1

[jira] [Resolved] (ARROW-16757) [C++] Remove "scalar" output modality from array kernels

2022-07-07 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16757?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-16757. -- Fix Version/s: 9.0.0 Resolution: Fixed Issue resolved by pull request 13521 [https://gi

[jira] [Updated] (ARROW-10611) [Python] Deletion of existing file when write_table fails

2021-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10611: - Summary: [Python] Deletion of existing file when write_table fails (was: Deletion of existing f

[jira] [Updated] (ARROW-11197) [C++] Add support for the dictionary type in the C++ ORC writer

2021-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-11197: - Summary: [C++] Add support for the dictionary type in the C++ ORC writer (was: [C++]Add support

[jira] [Updated] (ARROW-9880) [Python] Lose access to indices & dictionary roundtripping DictionaryArray to parquet file

2021-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9880: Summary: [Python] Lose access to indices & dictionary roundtripping DictionaryArray to parquet file

[jira] [Commented] (ARROW-3978) [C++] Implement hashing, dictionary-encoding for StructArray

2021-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17360064#comment-17360064 ] Wes McKinney commented on ARROW-3978: - [~bkietz] [~michalno] seems like this could be

[jira] [Created] (ARROW-13021) [C++] Add/improve documentation about employing Arrow in downstream CMake projects

2021-06-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-13021: Summary: [C++] Add/improve documentation about employing Arrow in downstream CMake projects Key: ARROW-13021 URL: https://issues.apache.org/jira/browse/ARROW-13021 Pr

[jira] [Created] (ARROW-13023) [Go] Upgrade "text" dependency to mitigate CVE

2021-06-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-13023: Summary: [Go] Upgrade "text" dependency to mitigate CVE Key: ARROW-13023 URL: https://issues.apache.org/jira/browse/ARROW-13023 Project: Apache Arrow Issue T

[jira] [Updated] (ARROW-645) [Format] Mitigating the cost of random access in "wide" record batches

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-645: --- Fix Version/s: (was: 5.0.0) > [Format] Mitigating the cost of random access in "wide" record batch

[jira] [Updated] (ARROW-1013) [C++] Add asynchronous RecordBatchStreamWriter

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1013?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1013: Fix Version/s: (was: 5.0.0) > [C++] Add asynchronous RecordBatchStreamWriter >

[jira] [Updated] (ARROW-567) [C++] File and stream APIs for interacting with "large" schemas

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-567: --- Fix Version/s: (was: 5.0.0) > [C++] File and stream APIs for interacting with "large" schemas > --

[jira] [Updated] (ARROW-1009) [C++] Create asynchronous version of StreamReader

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1009: Fix Version/s: (was: 5.0.0) > [C++] Create asynchronous version of StreamReader > -

[jira] [Updated] (ARROW-981) [C++] Write comparable columnar serialization benchmarks versus Protocol Buffers / gRPC

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-981?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-981: --- Fix Version/s: (was: 5.0.0) > [C++] Write comparable columnar serialization benchmarks versus Prot

[jira] [Updated] (ARROW-1699) [C++] Forward, backward fill kernel functions

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1699: Fix Version/s: (was: 5.0.0) > [C++] Forward, backward fill kernel functions > -

[jira] [Updated] (ARROW-1761) [C++] Multi argument operator kernel behavior for decimal columns

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1761?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1761: Fix Version/s: (was: 5.0.0) > [C++] Multi argument operator kernel behavior for decimal columns

[jira] [Updated] (ARROW-2290) [C++/Python] Add ability to set codec options for lz4 codec

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2290: Fix Version/s: (was: 5.0.0) > [C++/Python] Add ability to set codec options for lz4 codec > ---

[jira] [Updated] (ARROW-2366) [Python][C++][Parquet] Support reading Parquet files having a permutation of column order

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2366: Fix Version/s: (was: 5.0.0) > [Python][C++][Parquet] Support reading Parquet files having a per

[jira] [Updated] (ARROW-1790) [Format] Define logical data type that represents a "packed C struct" composed from other fixed-size primitive types

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1790: Fix Version/s: (was: 5.0.0) > [Format] Define logical data type that represents a "packed C str

[jira] [Updated] (ARROW-1565) [C++] Implement TopK/BottomK streaming execution nodes

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1565: Fix Version/s: (was: 5.0.0) > [C++] Implement TopK/BottomK streaming execution nodes >

[jira] [Updated] (ARROW-1106) [C++] Native result set adapter for PostgreSQL / libpq

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1106: Fix Version/s: (was: 5.0.0) > [C++] Native result set adapter for PostgreSQL / libpq >

[jira] [Updated] (ARROW-3120) [C++] Parallelize execution of ScalarAggregateFunction

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3120: Fix Version/s: (was: 5.0.0) > [C++] Parallelize execution of ScalarAggregateFunction >

[jira] [Assigned] (ARROW-3822) [C++] parquet::arrow::FileReader::GetRecordBatchReader may not iterate through chunked columns completely

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-3822: --- Assignee: (was: Ben Kietzman) > [C++] parquet::arrow::FileReader::GetRecordBatchReader m

[jira] [Updated] (ARROW-2860) [Python][Parquet][C++] Null values in a single partition of Parquet dataset, results in invalid schema on read

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2860: Fix Version/s: (was: 5.0.0) > [Python][Parquet][C++] Null values in a single partition of Parqu

[jira] [Updated] (ARROW-3822) [C++] parquet::arrow::FileReader::GetRecordBatchReader may not iterate through chunked columns completely

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3822: Fix Version/s: (was: 5.0.0) > [C++] parquet::arrow::FileReader::GetRecordBatchReader may not it

[jira] [Updated] (ARROW-2967) [Python] Add option to treat invalid PyObject* values as null in pyarrow.array

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2967: Fix Version/s: (was: 5.0.0) > [Python] Add option to treat invalid PyObject* values as null in

[jira] [Updated] (ARROW-3155) [C++] Native client interface to SQL Server / TDS protocol

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3155: Fix Version/s: (was: 5.0.0) > [C++] Native client interface to SQL Server / TDS protocol >

[jira] [Updated] (ARROW-3156) [C++] Native client interface to Clickhouse

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3156: Fix Version/s: (was: 5.0.0) > [C++] Native client interface to Clickhouse > ---

[jira] [Updated] (ARROW-3102) [C++] Native Arrow interface to sqlite3

2021-06-21 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3102: Fix Version/s: (was: 5.0.0) > [C++] Native Arrow interface to sqlite3 > ---

  1   2   3   4   5   6   7   8   9   10   >