[jira] [Updated] (ARROW-6839) [Java] Add APIs to read and write "custom_metadata" field of IPC file footer

2020-06-15 Thread Ji Liu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ji Liu updated ARROW-6839: -- Fix Version/s: 1.0.0 > [Java] Add APIs to read and write "custom_metadata" field of IPC file footer > -

[jira] [Resolved] (ARROW-6839) [Java] Add APIs to read and write "custom_metadata" field of IPC file footer

2020-06-15 Thread Ji Liu (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ji Liu resolved ARROW-6839. --- Resolution: Fixed Issue resolved by pull request 7231: [https://github.com/apache/arrow/pull/7231] > [Java]

[jira] [Commented] (ARROW-2882) [C++][Python] Support AWS Firehose partition_scheme implementation for Parquet datasets

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136341#comment-17136341 ] Joris Van den Bossche commented on ARROW-2882: -- Let's declare this as resolv

[jira] [Resolved] (ARROW-2882) [C++][Python] Support AWS Firehose partition_scheme implementation for Parquet datasets

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-2882. -- Fix Version/s: (was: 2.0.0) 0.17.0 Resolution: Fix

[jira] [Created] (ARROW-9142) [C++] random::RandomArrayGenerator::Boolean "probability" misdocumented / incorrect

2020-06-15 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9142: --- Summary: [C++] random::RandomArrayGenerator::Boolean "probability" misdocumented / incorrect Key: ARROW-9142 URL: https://issues.apache.org/jira/browse/ARROW-9142 Proje

[jira] [Created] (ARROW-9141) [R] Update cross-package documentation links

2020-06-15 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9141: -- Summary: [R] Update cross-package documentation links Key: ARROW-9141 URL: https://issues.apache.org/jira/browse/ARROW-9141 Project: Apache Arrow Issue T

[jira] [Commented] (ARROW-9117) [Python] Is there Pandas circular dependency problem?

2020-06-15 Thread SEUNGMIN HEO (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136142#comment-17136142 ] SEUNGMIN HEO commented on ARROW-9117: - [~jorisvandenbossche]  Hi :) I solved this p

[jira] [Updated] (ARROW-3446) [R] Document mapping of Arrow <-> R types

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3446: -- Labels: pull-request-available (was: ) > [R] Document mapping of Arrow <-> R types > -

[jira] [Updated] (ARROW-8631) [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8631: -- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] Add ConvertOptions and

[jira] [Assigned] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-8769: --- Assignee: Ben Kietzman > [C++] Add convenience methods to access fields by name in StructSca

[jira] [Commented] (ARROW-9140) [R] Zero-copy Arrow to R where possible

2020-06-15 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9140?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136114#comment-17136114 ] Francois Saint-Jacques commented on ARROW-9140: --- That's already mentioned i

[jira] [Updated] (ARROW-9081) [C++] Upgrade default LLVM version to 10

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9081: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Upgrade default LLVM version to 10

[jira] [Updated] (ARROW-4309) [Documentation] Add a docker-compose entry which builds the documentation with CUDA enabled

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4309?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4309: -- Labels: docker pull-request-available (was: docker) > [Documentation] Add a docker-compose ent

[jira] [Updated] (ARROW-9105) [C++] ParquetFileFragment scanning doesn't handle filter on partition field

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9105: -- Labels: dataset dataset-dask-integration pull-request-available (was: dataset dataset-dask-int

[jira] [Updated] (ARROW-7798) [R] Refactor R <-> Array conversion

2020-06-15 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-7798: -- Description: There's a bit of technical debt accumulated in array_to_vector and

[jira] [Created] (ARROW-9140) [R] Zero-copy Arrow to R where possible

2020-06-15 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9140: -- Summary: [R] Zero-copy Arrow to R where possible Key: ARROW-9140 URL: https://issues.apache.org/jira/browse/ARROW-9140 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-8943) [C++] Add support for Partitioning to ParquetDatasetFactory

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8943: -- Labels: dataset dataset-dask-integration pull-request-available (was: dataset dataset-dask-int

[jira] [Assigned] (ARROW-9009) [C++][Dataset] ARROW:schema should be removed from schema's metadata when reading Parquet files

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9009: --- Assignee: Wes McKinney > [C++][Dataset] ARROW:schema should be removed from schema's metadat

[jira] [Updated] (ARROW-9105) [C++] ParquetFileFragment scanning doesn't handle filter on partition field

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9105: Description: When splitting a fragment into row group fragments, filtering on the partition field

[jira] [Updated] (ARROW-9094) [Python] Bump versions of compiled dependencies in manylinux wheels

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9094: -- Labels: pull-request-available (was: ) > [Python] Bump versions of compiled dependencies in ma

[jira] [Updated] (ARROW-8631) [C++][Dataset] Add ConvertOptions and ReadOptions to CsvFileFormat

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8631: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] Add ConvertOptions

[jira] [Commented] (ARROW-9081) [C++] Upgrade default LLVM version to 10

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136073#comment-17136073 ] Neal Richardson commented on ARROW-9081: Does this need to happen for 1.0? Seems

[jira] [Updated] (ARROW-8613) [C++][Dataset] Raise error for unparsable partition value

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8613: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] Raise error for unp

[jira] [Commented] (ARROW-8651) [Python][Dataset] Support pickling of Dataset objects

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136071#comment-17136071 ] Neal Richardson commented on ARROW-8651: [~jorisvandenbossche] is this required f

[jira] [Assigned] (ARROW-9105) [C++] ParquetFileFragment scanning doesn't handle filter on partition field

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-9105: --- Assignee: Ben Kietzman > [C++] ParquetFileFragment scanning doesn't handle filter on partiti

[jira] [Updated] (ARROW-9009) [C++][Dataset] ARROW:schema should be removed from schema's metadata when reading Parquet files

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9009: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] ARROW:schema should

[jira] [Created] (ARROW-9139) [Python] parquet read_table should not use_legacy_dataset

2020-06-15 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9139: -- Summary: [Python] parquet read_table should not use_legacy_dataset Key: ARROW-9139 URL: https://issues.apache.org/jira/browse/ARROW-9139 Project: Apache Arrow

[jira] [Updated] (ARROW-9139) [Python] parquet read_table should not use_legacy_dataset

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9139: --- Labels: dataset-parquet-read parquet (was: parquet) > [Python] parquet read_table should not

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3764: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Port Python "ParquetDataset"

[jira] [Assigned] (ARROW-8779) [R] Implement conversion to List

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-8779: -- Assignee: Neal Richardson (was: Ben Kietzman) > [R] Implement conversion to List > --

[jira] [Assigned] (ARROW-8779) [R] Implement conversion to List

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-8779: -- Assignee: Romain Francois (was: Neal Richardson) > [R] Implement conversion to List >

[jira] [Updated] (ARROW-9078) [C++] Parquet writing of extension type with nested storage type fails

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9078: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Parquet writing of extension

[jira] [Updated] (ARROW-8977) [R] Table$create with schema crashes with some dictionary index types

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8977: --- Summary: [R] Table$create with schema crashes with some dictionary index types (was: [R] Tab

[jira] [Assigned] (ARROW-8977) [R] Table$create with schema crashes with some dictionary index types

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-8977: -- Assignee: Romain Francois > [R] Table$create with schema crashes with some dictionary

[jira] [Updated] (ARROW-6075) [FlightRPC] Handle uncaught exceptions in middleware

2020-06-15 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-6075: Fix Version/s: (was: 1.0.0) 2.0.0 > [FlightRPC] Handle uncaught exceptions in middle

[jira] [Updated] (ARROW-7579) [FlightRPC] Make Handshake optional

2020-06-15 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-7579: Fix Version/s: (was: 1.0.0) 2.0.0 > [FlightRPC] Make Handshake optional > --

[jira] [Assigned] (ARROW-7068) [C++] Expose the offsets of a ListArray as a Int32Array

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7068?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7068: --- Assignee: (was: Wes McKinney) > [C++] Expose the offsets of a ListArray as a Int32Array

[jira] [Resolved] (ARROW-6456) [C++] Possible to reduce object code generated in compute/kernels/take.cc?

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6456. - Fix Version/s: 1.0.0 Resolution: Fixed This was done in ARROW-5760 > [C++] Possible to re

[jira] [Assigned] (ARROW-9118) [C++] Add more general BoundsCheck function that also checks for arbitrary lower limits in integer arrays

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9118: --- Assignee: (was: Wes McKinney) > [C++] Add more general BoundsCheck function that also ch

[jira] [Assigned] (ARROW-9128) [C++] Implement string space trimming kernels: trim, ltrim, and rtrim

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9128: --- Assignee: Wes McKinney > [C++] Implement string space trimming kernels: trim, ltrim, and rtr

[jira] [Assigned] (ARROW-8989) [C++] Document available functions in compute::FunctionRegistry

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8989: --- Assignee: Wes McKinney > [C++] Document available functions in compute::FunctionRegistry > -

[jira] [Updated] (ARROW-8961) [C++] Vendor utf8proc library

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8961: Fix Version/s: (was: 1.0.0) > [C++] Vendor utf8proc library > - > >

[jira] [Assigned] (ARROW-8969) [C++] Reduce generated code in compute/kernels/scalar_compare.cc

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8969: --- Assignee: Wes McKinney > [C++] Reduce generated code in compute/kernels/scalar_compare.cc >

[jira] [Assigned] (ARROW-8991) [C++][Compute] Add scalar_hash function

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8991: --- Assignee: Wes McKinney > [C++][Compute] Add scalar_hash function > -

[jira] [Updated] (ARROW-9003) [C++] Add VectorFunction wrapping arrow::Concatenate

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9003: Fix Version/s: (was: 1.0.0) > [C++] Add VectorFunction wrapping arrow::Concatenate > --

[jira] [Updated] (ARROW-8919) [C++] Add "DispatchBest" APIs to compute::Function that selects a kernel that may require implicit casts to invoke

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8919: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Add "DispatchBest" APIs to compute

[jira] [Updated] (ARROW-8928) [C++] Measure microperformance associated with data structure access interactions with arrow::compute::ExecBatch

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8928: Fix Version/s: (was: 1.0.0) > [C++] Measure microperformance associated with data structure acc

[jira] [Assigned] (ARROW-8936) [C++] Parallelize execution of arrow::compute::ScalarFunction

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8936: --- Assignee: Wes McKinney > [C++] Parallelize execution of arrow::compute::ScalarFunction > ---

[jira] [Assigned] (ARROW-8933) [C++] Reduce generated code in vector_hash.cc

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8933: --- Assignee: Wes McKinney > [C++] Reduce generated code in vector_hash.cc > ---

[jira] [Assigned] (ARROW-8921) [C++] Add "TypeResolver" class interface to replace current OutputType::Resolver pattern

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8921: --- Assignee: Wes McKinney > [C++] Add "TypeResolver" class interface to replace current > Outp

[jira] [Commented] (ARROW-8895) [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136016#comment-17136016 ] Wes McKinney commented on ARROW-8895: - I'll add these types to the random data test b

[jira] [Assigned] (ARROW-8863) [C++] Array subclass constructors must set ArrayData::null_count to 0 when there is no validity bitmap

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8863: --- Assignee: Wes McKinney > [C++] Array subclass constructors must set ArrayData::null_count to

[jira] [Assigned] (ARROW-8895) [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8895: --- Assignee: Wes McKinney > [C++] Add C++ unit tests for filter and take functions on temporal

[jira] [Commented] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136015#comment-17136015 ] Wes McKinney commented on ARROW-8769: - This should have corresponding work done in bo

[jira] [Updated] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8769: Priority: Critical (was: Major) > [C++] Add convenience methods to access fields by name in Struct

[jira] [Assigned] (ARROW-8762) [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementation

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8762: --- Assignee: Wes McKinney > [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementati

[jira] [Commented] (ARROW-8762) [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementation

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136013#comment-17136013 ] Wes McKinney commented on ARROW-8762: - I see, thanks. If this optimization is benefic

[jira] [Updated] (ARROW-8749) [C++] IpcFormatWriter writes dictionary batches with wrong ID

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8749: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] IpcFormatWriter writes dictionary

[jira] [Updated] (ARROW-8667) [C++] Add multi-consumer Scheduler API to sit one layer above ThreadPool

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8667: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Add multi-consumer Scheduler API t

[jira] [Updated] (ARROW-8618) [C++] ASSIGN_OR_RAISE should move its argument

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8618: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] ASSIGN_OR_RAISE should move its ar

[jira] [Updated] (ARROW-7586) [C++][Dataset] Read feather 1.0 files

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7586: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] Read feather 1.0 files >

[jira] [Updated] (ARROW-7586) [C++][Dataset] Read feather 1.0 files

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7586: Summary: [C++][Dataset] Read feather 1.0 files (was: [C++][Dataset] Read feather files) > [C++][D

[jira] [Commented] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136009#comment-17136009 ] Wes McKinney commented on ARROW-8152: - I removed any milestone. This can be pursued a

[jira] [Updated] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8152: Fix Version/s: (was: 1.0.0) > [C++] IO: split large coalesced reads into smaller ones > ---

[jira] [Updated] (ARROW-7855) [Python] Always raise ArrowTypeError instead of TypeError inside pyarrow.array

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7855: Fix Version/s: (was: 1.0.0) 2.0.0 > [Python] Always raise ArrowTypeError ins

[jira] [Assigned] (ARROW-7925) [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7925: --- Assignee: Wes McKinney > [C++][Documentation] Instructions about running IWYU and other task

[jira] [Updated] (ARROW-7853) [CI][Packaging] Add nightly test that pip-installs nightly wheels

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7853: Fix Version/s: (was: 1.0.0) 2.0.0 > [CI][Packaging] Add nightly test that pi

[jira] [Updated] (ARROW-8655) [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8655: - Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset][Python]

[jira] [Commented] (ARROW-9135) [C++][Compute] Provide a kernel property testing API

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136006#comment-17136006 ] Ben Kietzman commented on ARROW-9135: - That's accurate; currently other random tests

[jira] [Updated] (ARROW-7218) [Python] Conversion from boolean numpy scalars not working

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7218: Fix Version/s: (was: 1.0.0) 2.0.0 > [Python] Conversion from boolean numpy s

[jira] [Updated] (ARROW-6979) [R] Enable jemalloc in autobrew formula

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6979: Priority: Blocker (was: Major) > [R] Enable jemalloc in autobrew formula > ---

[jira] [Commented] (ARROW-6979) [R] Enable jemalloc in autobrew formula

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17136005#comment-17136005 ] Wes McKinney commented on ARROW-6979: - I'd like to push to make this happen for the r

[jira] [Resolved] (ARROW-9067) [C++] Create reusable branchless / vectorized index boundschecking functions

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9067. - Resolution: Fixed This was done in ARROW-5760 > [C++] Create reusable branchless / vectorized in

[jira] [Assigned] (ARROW-4633) [Python] ParquetFile.read(use_threads=False) creates ThreadPool anyway

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4633: --- Assignee: (was: Wes McKinney) > [Python] ParquetFile.read(use_threads=False) creates Thr

[jira] [Assigned] (ARROW-6235) [R] Conversion from arrow::BinaryArray to R character vector not implemented

2020-06-15 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-6235: - Assignee: Romain Francois (was: Francois Saint-Jacques) > [R] Conversio

[jira] [Resolved] (ARROW-5158) [Packaging][Wheel] Symlink libraries in wheels

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5158. - Fix Version/s: 1.0.0 Resolution: Fixed This was resolved by ARROW-5082 > [Packaging][Whee

[jira] [Resolved] (ARROW-9124) [Rust][Datafusion] DFParser should consume sql query as &str instead of String

2020-06-15 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9124. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7428 [https://gi

[jira] [Updated] (ARROW-7394) [C++][DataFrame] Implement zero-copy optimizations when performing Filter

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7394: Fix Version/s: 2.0.0 > [C++][DataFrame] Implement zero-copy optimizations when performing Filter >

[jira] [Updated] (ARROW-8001) [C++][Dataset] R and Python bindings for dataset writing

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8001: - Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] R and P

[jira] [Updated] (ARROW-9044) [Go][Packaging] Revisit the license file attachment to the go packages

2020-06-15 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9044: --- Priority: Minor (was: Major) > [Go][Packaging] Revisit the license file attachment to the go

[jira] [Created] (ARROW-9138) [Docs][Format] Make sure format version is hard coded in the docs

2020-06-15 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9138: -- Summary: [Docs][Format] Make sure format version is hard coded in the docs Key: ARROW-9138 URL: https://issues.apache.org/jira/browse/ARROW-9138 Project: Apache A

[jira] [Resolved] (ARROW-9005) [Rust] [DataFusion] Support sort expression

2020-06-15 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9005. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7324 [https://gi

[jira] [Updated] (ARROW-8733) [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8733: - Fix Version/s: (was: 1.0.0) > [C++][Dataset][Python] ParquetFileFragment shou

[jira] [Resolved] (ARROW-9127) [Rust] Update thrift library dependencies

2020-06-15 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9127. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7432 [https://gi

[jira] [Updated] (ARROW-8074) [C++][Dataset] Support for file-like objects (buffers) in FileSystemDataset?

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-8074: Labels: dataset dataset-dask-integration pull-request-available (was: dataset pull-request-availab

[jira] [Commented] (ARROW-9108) [C++][Dataset] Add Parquet Statistics conversion for timestamp columns

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17135992#comment-17135992 ] Wes McKinney commented on ARROW-9108: - While important, this should not block the rel

[jira] [Updated] (ARROW-9108) [C++][Dataset] Add Parquet Statistics conversion for timestamp columns

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9108: Priority: Major (was: Blocker) > [C++][Dataset] Add Parquet Statistics conversion for timestamp co

[jira] [Commented] (ARROW-9107) [C++][Dataset] Time-based types support

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17135990#comment-17135990 ] Wes McKinney commented on ARROW-9107: - While important, this is not a blocker > [C++

[jira] [Updated] (ARROW-9107) [C++][Dataset] Time-based types support

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9107: Priority: Critical (was: Blocker) > [C++][Dataset] Time-based types support >

[jira] [Created] (ARROW-9137) Allow to read Parquet files in chunks (by RowGroup)

2020-06-15 Thread Dimitrij Denissenko (Jira)
Dimitrij Denissenko created ARROW-9137: -- Summary: Allow to read Parquet files in chunks (by RowGroup) Key: ARROW-9137 URL: https://issues.apache.org/jira/browse/ARROW-9137 Project: Apache Arrow

[jira] [Updated] (ARROW-7179) [C++][Compute] Coalesce kernel

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7179: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Compute] Coalesce kernel > ---

[jira] [Updated] (ARROW-6256) [Rust] parquet-format should be released by Apache process

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6256: Fix Version/s: (was: 1.0.0) > [Rust] parquet-format should be released by Apache process >

[jira] [Updated] (ARROW-8802) [C++][Dataset] Schema metadata are lost when reading a subset of columns

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8802: - Fix Version/s: 1.0.0 > [C++][Dataset] Schema metadata are lost when reading a sub

[jira] [Commented] (ARROW-6256) [Rust] parquet-format should be released by Apache process

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17135980#comment-17135980 ] Wes McKinney commented on ARROW-6256: - I haven't seen any movement on this so I'm rem

[jira] [Created] (ARROW-9136) pandas index information gets lost when partition_cols are used

2020-06-15 Thread Hans Pirnay (Jira)
Hans Pirnay created ARROW-9136: -- Summary: pandas index information gets lost when partition_cols are used Key: ARROW-9136 URL: https://issues.apache.org/jira/browse/ARROW-9136 Project: Apache Arrow

[jira] [Commented] (ARROW-9134) [Python] Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Nicholas Palko (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17135948#comment-17135948 ] Nicholas Palko commented on ARROW-9134: --- Makes sense to me. Thank you for the incre

[jira] [Updated] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9132: - Summary: [C++] Support unique kernel for dictionary type (was: [Python] Support

[jira] [Updated] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9132: - Component/s: C++ > [C++] Support unique kernel for dictionary type >

[jira] [Commented] (ARROW-9134) [Python] Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17135944#comment-17135944 ] Joris Van den Bossche commented on ARROW-9134: -- It might still be good to ad

[jira] [Updated] (ARROW-9132) [Python] Support unique for dictionary type

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9132: - Description: Enabling [`strings_as_dictionary`](https://turbodbc.readthedocs.io/

  1   2   >