[jira] [Updated] (ARROW-8462) [Python] Crash in lib.concat_tables on Windows

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8462: Summary: [Python] Crash in lib.concat_tables on Windows (was: Crash in lib.concat_tables on

[jira] [Updated] (ARROW-8293) [Python] Run flake8 on python/examples also

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8293?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8293: Fix Version/s: 1.0.0 > [Python] Run flake8 on python/examples also >

[jira] [Commented] (ARROW-8214) [C++] Flatbuffers based serialization protocol for Expressions

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116250#comment-17116250 ] Wes McKinney commented on ARROW-8214: - We will need to create a serialization scheme for general

[jira] [Closed] (ARROW-8180) [C++] Should default_memory_pool() be in arrow/type_fwd.h?

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8180?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8180. --- Resolution: Not A Problem Closing as not a problem > [C++] Should default_memory_pool() be in

[jira] [Commented] (ARROW-8173) [C++] Validate ChunkedArray()'s arguments

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116246#comment-17116246 ] Wes McKinney commented on ARROW-8173: - {{ChunkedArray::MakeSafe}}? > [C++] Validate ChunkedArray()'s

[jira] [Commented] (ARROW-7871) [Python] Expose more compute kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116245#comment-17116245 ] Wes McKinney commented on ARROW-7871: - I unassigned the issue from myself. Perhaps some others can

[jira] [Commented] (ARROW-7871) [Python] Expose more compute kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116244#comment-17116244 ] Wes McKinney commented on ARROW-7871: - This is extremely easy now since functions/kernels can now be

[jira] [Assigned] (ARROW-7871) [Python] Expose more compute kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7871: --- Assignee: (was: Wes McKinney) > [Python] Expose more compute kernels >

[jira] [Commented] (ARROW-7822) [C++] Allocation free error Status constants

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7822?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116243#comment-17116243 ] Wes McKinney commented on ARROW-7822: - I'm not sure that non-OK Status should ever be found on a

[jira] [Commented] (ARROW-7784) [C++] diff.cc is extremely slow to compile

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116238#comment-17116238 ] Wes McKinney commented on ARROW-7784: - "QuadraticSpaceMyersDiff" is being instantiated for every

[jira] [Commented] (ARROW-7409) [C++][Python] Windows link error LNK1104: cannot open file 'python37_d.lib'

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116234#comment-17116234 ] Wes McKinney commented on ARROW-7409: - [~rbocanegra] any update? > [C++][Python] Windows link error

[jira] [Updated] (ARROW-8939) [C++] Arrow-native C++ Data Frame-style programming interface for analytics (umbrella issue)

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8939?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8939: Summary: [C++] Arrow-native C++ Data Frame-style programming interface for analytics (umbrella

[jira] [Created] (ARROW-8939) [C++] Arrow C++ Data Frame-style programming interface for analytics (umbrella issue)

2020-05-25 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8939: --- Summary: [C++] Arrow C++ Data Frame-style programming interface for analytics (umbrella issue) Key: ARROW-8939 URL: https://issues.apache.org/jira/browse/ARROW-8939

[jira] [Assigned] (ARROW-7394) [C++][DataFrame] Implement zero-copy optimizations when performing Filter

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7394: --- Assignee: Wes McKinney > [C++][DataFrame] Implement zero-copy optimizations when performing

[jira] [Commented] (ARROW-7245) [C++] Allow automatic String -> LargeString promotions when concatenating tables

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7245?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116231#comment-17116231 ] Wes McKinney commented on ARROW-7245: - Perhaps Concatenate can be reimplemented as a vector kernel,

[jira] [Closed] (ARROW-7316) [C++] compile error due to incomplete type for unique_ptr

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7316?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-7316. --- Resolution: Cannot Reproduce > [C++] compile error due to incomplete type for unique_ptr >

[jira] [Resolved] (ARROW-7230) [C++] Use vendored std::optional instead of boost::optional in Gandiva

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7230. - Assignee: Neal Richardson (was: Projjal Chanda) Resolution: Fixed This was done in

[jira] [Commented] (ARROW-7179) [C++][Compute] Coalesce kernel

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7179?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116227#comment-17116227 ] Wes McKinney commented on ARROW-7179: - We can implement this either as a Binary or VarArgs scalar

[jira] [Commented] (ARROW-7083) [C++] Determine the feasibility and build a prototype to replace compute/kernels with gandiva kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116225#comment-17116225 ] Wes McKinney commented on ARROW-7083: - Note that we should be able to add Gandiva-generated kernels

[jira] [Commented] (ARROW-7083) [C++] Determine the feasibility and build a prototype to replace compute/kernels with gandiva kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116224#comment-17116224 ] Wes McKinney commented on ARROW-7083: - I'm inclined to close this issue. After much study, I believe

[jira] [Resolved] (ARROW-7075) [C++] Boolean kernels should not allocate in Call()

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7075. - Resolution: Fixed This was done in ARROW-8792 > [C++] Boolean kernels should not allocate in

[jira] [Comment Edited] (ARROW-7017) [C++] Refactor AddKernel to support other operations and types

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116221#comment-17116221 ] Wes McKinney edited comment on ARROW-7017 at 5/25/20, 7:56 PM: --- I think the

[jira] [Commented] (ARROW-7017) [C++] Refactor AddKernel to support other operations and types

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116221#comment-17116221 ] Wes McKinney commented on ARROW-7017: - I think the path forward here is to refactor to utilize common

[jira] [Commented] (ARROW-7012) [C++] Clarify ChunkedArray chunking strategy and policy

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116220#comment-17116220 ] Wes McKinney commented on ARROW-7012: - In general, this is not something that users should be too

[jira] [Closed] (ARROW-8905) [C++] Collapse Take APIs from 8 to 1 or 2

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-8905. --- Fix Version/s: (was: 1.0.0) Resolution: Duplicate dup of ARROW-7009 > [C++] Collapse

[jira] [Created] (ARROW-8938) [R] Provide binding and argument packing to use arrow::compute::CallFunction to use any compute kernel from R dynamically

2020-05-25 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8938: --- Summary: [R] Provide binding and argument packing to use arrow::compute::CallFunction to use any compute kernel from R dynamically Key: ARROW-8938 URL:

[jira] [Commented] (ARROW-6982) [R] Add bindings for compare and boolean kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6982?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116218#comment-17116218 ] Wes McKinney commented on ARROW-6982: - Like ARROW-6978, wrapping {{CallFunction}} would allow dynamic

[jira] [Commented] (ARROW-6978) [R] Add bindings for sum and mean compute kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116215#comment-17116215 ] Wes McKinney commented on ARROW-6978: - R should expose {{arrow::compute::CallFunction}} so that

[jira] [Closed] (ARROW-6959) [C++] Clarify what signatures are preferred for compute kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6959. --- Assignee: Wes McKinney Resolution: Fixed This is addressed by the new

[jira] [Closed] (ARROW-6956) [C++] Status should use unique_ptr

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6956?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6956. --- Resolution: Won't Fix I'm not comfortable with this. I think this falls into the "if it ain't broke"

[jira] [Updated] (ARROW-6856) [C++] Use ArrayData instead of Array for ArrayData::dictionary

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6856: Fix Version/s: 1.0.0 > [C++] Use ArrayData instead of Array for ArrayData::dictionary >

[jira] [Updated] (ARROW-6923) [C++] Option for Filter kernel how to handle nulls in the selection vector

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6923: Fix Version/s: 1.0.0 > [C++] Option for Filter kernel how to handle nulls in the selection vector

[jira] [Commented] (ARROW-6856) [C++] Use ArrayData instead of Array for ArrayData::dictionary

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6856?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116211#comment-17116211 ] Wes McKinney commented on ARROW-6856: - Yes. I just added to the milestone > [C++] Use ArrayData

[jira] [Closed] (ARROW-6799) [C++] Plasma JNI component links to flatbuffers::flatbuffers (unnecessarily?)

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6799. --- Resolution: Cannot Reproduce This is no longer an issue because Flatbuffers is not in our toolchain

[jira] [Updated] (ARROW-6523) [C++][Dataset] arrow_dataset target does not depend on anything

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6523: Fix Version/s: 1.0.0 > [C++][Dataset] arrow_dataset target does not depend on anything >

[jira] [Closed] (ARROW-6514) [Developer][C++][CMake] LLVM tools are restricted to the exact version 7.0

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6514?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6514. --- Resolution: Not A Problem Closing since we've moved on from LLVM 7 > [Developer][C++][CMake] LLVM

[jira] [Updated] (ARROW-6548) [Python] consistently handle conversion of all-NaN arrays across types

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6548?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6548: Fix Version/s: 1.0.0 > [Python] consistently handle conversion of all-NaN arrays across types >

[jira] [Assigned] (ARROW-6456) [C++] Possible to reduce object code generated in compute/kernels/take.cc?

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6456: --- Assignee: Wes McKinney > [C++] Possible to reduce object code generated in

[jira] [Commented] (ARROW-6456) [C++] Possible to reduce object code generated in compute/kernels/take.cc?

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6456?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116202#comment-17116202 ] Wes McKinney commented on ARROW-6456: - I will take care of this. > [C++] Possible to reduce object

[jira] [Closed] (ARROW-6261) [C++] Install any bundled components and add installed CMake or pkgconfig configuration to enable downstream linkers to utilize bundled libraries when statically linking

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6261. --- Fix Version/s: (was: 2.0.0) Resolution: Won't Fix Closing in favor of the approach of

[jira] [Closed] (ARROW-6124) [C++] ArgSort kernel should sort in a single pass (with nulls)

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6124. --- Fix Version/s: (was: 2.0.0) Resolution: Won't Fix Sorting on large or chunked inputs will

[jira] [Commented] (ARROW-6123) [C++] ArgSort kernel should not materialize the output internal

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116197#comment-17116197 ] Wes McKinney commented on ARROW-6123: - [~fsaintjacques] could you clarify what you mean? > [C++]

[jira] [Updated] (ARROW-6122) [C++] SortToIndices kernel must support FixedSizeBinary

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6122: Summary: [C++] SortToIndices kernel must support FixedSizeBinary (was: [C++] ArgSort kernel must

[jira] [Closed] (ARROW-5980) [Python] Missing libarrow.so and libarrow_python.so in wheel file

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5980. --- Resolution: Not A Problem Our current wheels don't have this problem > [Python] Missing libarrow.so

[jira] [Closed] (ARROW-5916) [C++] Allow RecordBatch.length to be less than array lengths

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5916. --- Resolution: Later We didn't reach a conclusion on this so closing for now > [C++] Allow

[jira] [Commented] (ARROW-5760) [C++] Optimize Take and Filter

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116112#comment-17116112 ] Wes McKinney commented on ARROW-5760: - I'd like to work on this next week if it's alright > [C++]

[jira] [Updated] (ARROW-5854) [Python] Expose compare kernels on Array class

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5854: Fix Version/s: (was: 2.0.0) 1.0.0 > [Python] Expose compare kernels on

[jira] [Commented] (ARROW-5854) [Python] Expose compare kernels on Array class

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116113#comment-17116113 ] Wes McKinney commented on ARROW-5854: - This should be fairly trivial now > [Python] Expose compare

[jira] [Updated] (ARROW-5760) [C++] Optimize Take and Filter

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5760: Fix Version/s: (was: 2.0.0) 1.0.0 > [C++] Optimize Take and Filter >

[jira] [Commented] (ARROW-5530) [C++] Add options to ValueCount/Unique/DictEncode kernel to toggle null behavior

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116111#comment-17116111 ] Wes McKinney commented on ARROW-5530: - a HashOptions would also need to be introduced > [C++] Add

[jira] [Assigned] (ARROW-5760) [C++] Optimize Take and Filter

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5760: --- Assignee: Wes McKinney (was: Ben Kietzman) > [C++] Optimize Take and Filter >

[jira] [Resolved] (ARROW-5489) [C++] Normalize kernels and ChunkedArray behavior

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5489. - Fix Version/s: 1.0.0 Assignee: Wes McKinney Resolution: Fixed This is done in

[jira] [Updated] (ARROW-5506) [C++] "Shredder" and "stitcher" functionality

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5506: Fix Version/s: (was: 2.0.0) > [C++] "Shredder" and "stitcher" functionality >

[jira] [Closed] (ARROW-5506) [C++] "Shredder" and "stitcher" functionality

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5506. --- Resolution: Won't Fix > [C++] "Shredder" and "stitcher" functionality >

[jira] [Closed] (ARROW-5193) [C++] Linker error with bundled zlib

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5193. --- Resolution: Fixed I believe this is fixed now > [C++] Linker error with bundled zlib >

[jira] [Created] (ARROW-8933) [C++] Reduce generated code in vector_hash.cc

2020-05-25 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8933: --- Summary: [C++] Reduce generated code in vector_hash.cc Key: ARROW-8933 URL: https://issues.apache.org/jira/browse/ARROW-8933 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-5005) [C++] Implement support for using selection vectors in scalar aggregate function kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5005: Summary: [C++] Implement support for using selection vectors in scalar aggregate function kernels

[jira] [Commented] (ARROW-5005) [C++] Implement support for using selection vectors in scalar aggregate function kernels

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116107#comment-17116107 ] Wes McKinney commented on ARROW-5005: - I believe the best approach right now is to use selection

[jira] [Comment Edited] (ARROW-5002) [C++] Implement Hash Aggregation query execution node

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116106#comment-17116106 ] Wes McKinney edited comment on ARROW-5002 at 5/25/20, 3:10 PM: --- I renamed

[jira] [Commented] (ARROW-5002) [C++] Implement Hash Aggregation query execution node

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116106#comment-17116106 ] Wes McKinney commented on ARROW-5002: - I renamed the issue. I need to be able to execute hash

[jira] [Updated] (ARROW-5002) [C++] Implement Hash Aggregation query execution node

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5002: Labels: query-engine (was: ) > [C++] Implement Hash Aggregation query execution node >

[jira] [Updated] (ARROW-5002) [C++] Implement Hash Aggregation query execution node

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-5002: Summary: [C++] Implement Hash Aggregation query execution node (was: [C++] Implement GroupBy) >

[jira] [Closed] (ARROW-4798) [C++] Re-enable runtime/references cpplint check

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-4798. --- Fix Version/s: (was: 2.0.0) Resolution: Won't Fix The benchmark thing is enough of a

[jira] [Updated] (ARROW-4633) [Python] ParquetFile.read(use_threads=False) creates ThreadPool anyway

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4633: Fix Version/s: 1.0.0 > [Python] ParquetFile.read(use_threads=False) creates ThreadPool anyway >

[jira] [Commented] (ARROW-4530) [C++] Review Aggregate kernel state allocation/ownership semantics

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116099#comment-17116099 ] Wes McKinney commented on ARROW-4530: - You may have noticed that the aggregation API was iterated in

[jira] [Commented] (ARROW-4333) [C++] Sketch out design for kernels and "query" execution in compute layer

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116096#comment-17116096 ] Wes McKinney commented on ARROW-4333: - I partially addressed some of these questions in ARROW-8792,

[jira] [Assigned] (ARROW-4333) [C++] Sketch out design for kernels and "query" execution in compute layer

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4333: --- Assignee: (was: Wes McKinney) > [C++] Sketch out design for kernels and "query"

[jira] [Assigned] (ARROW-4333) [C++] Sketch out design for kernels and "query" execution in compute layer

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4333?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4333: --- Assignee: Wes McKinney > [C++] Sketch out design for kernels and "query" execution in

[jira] [Commented] (ARROW-4097) [C++] Add function to "conform" a dictionary array to a target new dictionary

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4097?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116095#comment-17116095 ] Wes McKinney commented on ARROW-4097: - This can be implemented as a ScalarFunction I think > [C++]

[jira] [Updated] (ARROW-3978) [C++] Implement hashing, dictionary-encoding for StructArray

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3978?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3978: Labels: query-engine (was: ) > [C++] Implement hashing, dictionary-encoding for StructArray >

[jira] [Updated] (ARROW-3822) [C++] parquet::arrow::FileReader::GetRecordBatchReader has logical error on row groups with chunked columns

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3822: Fix Version/s: 1.0.0 > [C++] parquet::arrow::FileReader::GetRecordBatchReader has logical error on

[jira] [Created] (ARROW-8937) [C++] Add "parse_strptime" function for string to timestamp conversions using the kernels framework

2020-05-25 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8937: --- Summary: [C++] Add "parse_strptime" function for string to timestamp conversions using the kernels framework Key: ARROW-8937 URL: https://issues.apache.org/jira/browse/ARROW-8937

[jira] [Closed] (ARROW-3372) [C++] Introduce SlicedBuffer class

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3372?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-3372. --- Resolution: Won't Fix > [C++] Introduce SlicedBuffer class > -- > >

[jira] [Commented] (ARROW-1846) [C++] Implement "any" reduction kernel for boolean data

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116060#comment-17116060 ] Wes McKinney commented on ARROW-1846: - With fresh eyes and ARROW-8792 in the rear view mirror, I

[jira] [Comment Edited] (ARROW-971) [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116043#comment-17116043 ] Wes McKinney edited comment on ARROW-971 at 5/25/20, 2:03 PM: -- The correct

[jira] [Commented] (ARROW-1888) [C++] Implement casts from one struct type to another (with same field names and number of fields)

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116062#comment-17116062 ] Wes McKinney commented on ARROW-1888: - This should be implemented in scalar_cast_nested.cc > [C++]

[jira] [Commented] (ARROW-971) [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116043#comment-17116043 ] Wes McKinney commented on ARROW-971: The correct way to implement is as

[jira] [Resolved] (ARROW-1570) [C++] Define API for creating a kernel instance from function of scalar input and output with a particular signature

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1570. - Fix Version/s: 1.0.0 Assignee: Wes McKinney Resolution: Fixed This was basically

[jira] [Commented] (ARROW-1574) [C++] Implement kernel function that converts a dense array to dictionary given known dictionary

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1574?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116054#comment-17116054 ] Wes McKinney commented on ARROW-1574: - This would be a useful expansion of the functions in

[jira] [Commented] (ARROW-1568) [C++] Implement "drop null" kernels that return array without nulls

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1568?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116050#comment-17116050 ] Wes McKinney commented on ARROW-1568: - This can be implemented as a

[jira] [Commented] (ARROW-1761) [C++] Multi argument operator kernel behavior for decimal columns

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116057#comment-17116057 ] Wes McKinney commented on ARROW-1761: - This will need to be resolved once adding Decimal support to

[jira] [Commented] (ARROW-1699) [C++] Forward, backward fill kernel functions

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1699?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116055#comment-17116055 ] Wes McKinney commented on ARROW-1699: - These are VECTOR functions > [C++] Forward, backward fill

[jira] [Commented] (ARROW-1569) [C++] Kernel functions for determining monotonicity (ascending or descending) for well-ordered types

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116051#comment-17116051 ] Wes McKinney commented on ARROW-1569: - This can be implemented as a {{ScalarAggregateFunction}}. We

[jira] [Created] (ARROW-8936) [C++] Parallelize execution of arrow::compute::ScalarFunction

2020-05-25 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8936: --- Summary: [C++] Parallelize execution of arrow::compute::ScalarFunction Key: ARROW-8936 URL: https://issues.apache.org/jira/browse/ARROW-8936 Project: Apache Arrow

[jira] [Updated] (ARROW-3120) [C++] Parallelize execution of ScalarAggregateFunction

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3120: Description: After ARROW-8972, we have a generic chunk-based executor for aggregate functions. It

[jira] [Updated] (ARROW-3120) [C++] Parallelize execution of ScalarAggregateFunction

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3120: Summary: [C++] Parallelize execution of ScalarAggregateFunction (was: [C++] Thread-safe parallel

[jira] [Closed] (ARROW-2685) [C++] Implement kernels for in-place sorting of fixed-width contiguous arrays

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-2685. --- Fix Version/s: (was: 2.0.0) Resolution: Won't Fix Since we have SortToIndices and Arrow

[jira] [Closed] (ARROW-3079) [C++] Create initial collection of "ill-behaved CSVs" in apache/arrow-testing

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3079?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-3079. --- Resolution: Later Let's address such issues incrementally as they arise > [C++] Create initial

[jira] [Commented] (ARROW-1567) [C++] Implement "fill null" kernels that replace null values with some scalar replacement value

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116049#comment-17116049 ] Wes McKinney commented on ARROW-1567: - We can implement a new function with kernels of form

[jira] [Commented] (ARROW-1565) [C++] Implement TopK/BottomK streaming execution nodes

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116048#comment-17116048 ] Wes McKinney commented on ARROW-1565: - I reframed this issue as a query processing task. We need to

[jira] [Commented] (ARROW-1489) [C++] Add casting option to set unsafe casts to null rather than some garbage value

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1489?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116047#comment-17116047 ] Wes McKinney commented on ARROW-1489: - This might yield some code bloat but would still be useful to

[jira] [Commented] (ARROW-1329) [C++] Define "virtual table" interface

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1329?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116046#comment-17116046 ] Wes McKinney commented on ARROW-1329: - Some time has passed but I plan to make some progress on this

[jira] [Updated] (ARROW-1565) [C++] Implement TopK/BottomK streaming execution nodes

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1565: Summary: [C++] Implement TopK/BottomK streaming execution nodes (was: [C++] "argtopk" and

[jira] [Updated] (ARROW-1565) [C++] Implement TopK/BottomK streaming execution nodes

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1565: Labels: Analytics query-engine (was: Analytics) > [C++] Implement TopK/BottomK streaming

[jira] [Closed] (ARROW-1133) [C++] Convert all non-accessor function names to PascalCase

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-1133. --- Resolution: Not A Problem We can go about fixing any mis-named APIs as we run across them > [C++]

[jira] [Commented] (ARROW-8935) [Python] Add necessary plumbing to enable Numba-generated functions to be registered as functions in the global C++ function/kernels registry

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116080#comment-17116080 ] Wes McKinney commented on ARROW-8935: - cc @uwe > [Python] Add necessary plumbing to enable

[jira] [Created] (ARROW-8935) [Python] Add necessary plumbing to enable Numba-generated functions to be registered as functions in the global C++ function/kernels registry

2020-05-25 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8935: --- Summary: [Python] Add necessary plumbing to enable Numba-generated functions to be registered as functions in the global C++ function/kernels registry Key: ARROW-8935 URL:

[jira] [Commented] (ARROW-2665) [Python/C++] Add index() method to find first occurence of Python scalar

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116079#comment-17116079 ] Wes McKinney commented on ARROW-2665: - I suggest implementing this as a short-circuiting aggregate

[jira] [Commented] (ARROW-488) [Python] Implement conversion between integer coded as floating points with NaN to an Arrow integer type

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-488?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17116039#comment-17116039 ] Wes McKinney commented on ARROW-488: This could be implemented as a standalone function in the new

[jira] [Updated] (ARROW-971) [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions

2020-05-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-971: --- Summary: [C++/Python] Implement Array.isvalid/notnull/isnull as scalar functions (was: [C++/Python]

<    1   2   3   4   5   6   7   8   9   10   >