[jira] [Updated] (ARROW-8977) [R] Table$create with schema crashes with some dictionary index types

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8977: --- Summary: [R] Table$create with schema crashes with some dictionary index types (was: [R]

[jira] [Updated] (ARROW-9078) [C++] Parquet writing of extension type with nested storage type fails

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9078: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Parquet writing of

[jira] [Updated] (ARROW-8613) [C++][Dataset] Raise error for unparsable partition value

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-8613: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] Raise error for

[jira] [Updated] (ARROW-9105) [C++] ParquetFileFragment scanning doesn't handle filter on partition field

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9105: -- Labels: dataset dataset-dask-integration pull-request-available (was: dataset

[jira] [Commented] (ARROW-8651) [Python][Dataset] Support pickling of Dataset objects

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136071#comment-17136071 ] Neal Richardson commented on ARROW-8651: [~jorisvandenbossche] is this required for anything in

[jira] [Updated] (ARROW-8943) [C++] Add support for Partitioning to ParquetDatasetFactory

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8943?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8943: -- Labels: dataset dataset-dask-integration pull-request-available (was: dataset

[jira] [Assigned] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-8769: --- Assignee: Ben Kietzman > [C++] Add convenience methods to access fields by name in

[jira] [Assigned] (ARROW-9105) [C++] ParquetFileFragment scanning doesn't handle filter on partition field

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-9105: --- Assignee: Ben Kietzman > [C++] ParquetFileFragment scanning doesn't handle filter on

[jira] [Updated] (ARROW-9009) [C++][Dataset] ARROW:schema should be removed from schema's metadata when reading Parquet files

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9009: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] ARROW:schema

[jira] [Updated] (ARROW-9139) [Python] parquet read_table should not use_legacy_dataset

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-9139: --- Labels: dataset-parquet-read parquet (was: parquet) > [Python] parquet read_table should

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3764: --- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Port Python

[jira] [Assigned] (ARROW-8779) [R] Implement conversion to List

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-8779: -- Assignee: Romain Francois (was: Neal Richardson) > [R] Implement conversion to List

[jira] [Assigned] (ARROW-8779) [R] Implement conversion to List

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-8779: -- Assignee: Neal Richardson (was: Ben Kietzman) > [R] Implement conversion to List >

[jira] [Updated] (ARROW-9094) [Python] Bump versions of compiled dependencies in manylinux wheels

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9094?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9094: -- Labels: pull-request-available (was: ) > [Python] Bump versions of compiled dependencies in

[jira] [Commented] (ARROW-9117) [Python] Is there Pandas circular dependency problem?

2020-06-15 Thread SEUNGMIN HEO (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136142#comment-17136142 ] SEUNGMIN HEO commented on ARROW-9117: - [~jorisvandenbossche]  Hi :) I solved this problem. It was

[jira] [Assigned] (ARROW-8977) [R] Table$create with schema crashes with some dictionary index types

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-8977: -- Assignee: Romain Francois > [R] Table$create with schema crashes with some dictionary

[jira] [Commented] (ARROW-9081) [C++] Upgrade default LLVM version to 10

2020-06-15 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9081?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136073#comment-17136073 ] Neal Richardson commented on ARROW-9081: Does this need to happen for 1.0? Seems a little late to

[jira] [Assigned] (ARROW-9009) [C++][Dataset] ARROW:schema should be removed from schema's metadata when reading Parquet files

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9009: --- Assignee: Wes McKinney > [C++][Dataset] ARROW:schema should be removed from schema's

[jira] [Created] (ARROW-9140) [R] Zero-copy Arrow to R where possible

2020-06-15 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9140: -- Summary: [R] Zero-copy Arrow to R where possible Key: ARROW-9140 URL: https://issues.apache.org/jira/browse/ARROW-9140 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-9114) [C++][Packaging] Illegal instruction crash in arrow.dll

2020-06-15 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135613#comment-17135613 ] Uwe Korn commented on ARROW-9114: - [~mparry] I pushed {{arrow-cpp/pyarrow 0.17}} build number 5 to

[jira] [Created] (ARROW-9131) [C++] Faster ascii_lower and ascii_upper

2020-06-15 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9131: --- Summary: [C++] Faster ascii_lower and ascii_upper Key: ARROW-9131 URL: https://issues.apache.org/jira/browse/ARROW-9131 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-9131) [C++] Faster ascii_lower and ascii_upper

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9131: -- Labels: pull-request-available (was: ) > [C++] Faster ascii_lower and ascii_upper >

[jira] [Commented] (ARROW-9114) [C++][Packaging] Illegal instruction crash in arrow.dll

2020-06-15 Thread Morgan Parry (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9114?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135727#comment-17135727 ] Morgan Parry commented on ARROW-9114: - Thanks - I have posted updates (including results from those

[jira] [Created] (ARROW-9132) Support unique for dictionary type

2020-06-15 Thread Dave Hirschfeld (Jira)
Dave Hirschfeld created ARROW-9132: -- Summary: Support unique for dictionary type Key: ARROW-9132 URL: https://issues.apache.org/jira/browse/ARROW-9132 Project: Apache Arrow Issue Type:

[jira] [Created] (ARROW-9133) [C++] Add utf8_upper and utf_lower

2020-06-15 Thread Maarten Breddels (Jira)
Maarten Breddels created ARROW-9133: --- Summary: [C++] Add utf8_upper and utf_lower Key: ARROW-9133 URL: https://issues.apache.org/jira/browse/ARROW-9133 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-9133) [C++] Add utf8_upper and utf_lower

2020-06-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135809#comment-17135809 ] Antoine Pitrou commented on ARROW-9133: --- I think we shouldn't overthink this. Preallocating the

[jira] [Updated] (ARROW-9133) [C++] Add utf8_upper and utf_lower

2020-06-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9133: -- Component/s: C++ > [C++] Add utf8_upper and utf_lower > -- > >

[jira] [Assigned] (ARROW-9131) [C++] Faster ascii_lower and ascii_upper

2020-06-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9131?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-9131: - Assignee: Maarten Breddels > [C++] Faster ascii_lower and ascii_upper >

[jira] [Commented] (ARROW-9133) [C++] Add utf8_upper and utf_lower

2020-06-15 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135813#comment-17135813 ] Antoine Pitrou commented on ARROW-9133: --- As for utf8proc vs. unilib, I think we've determined in

[jira] [Created] (ARROW-9134) Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Nicholas Palko (Jira)
Nicholas Palko created ARROW-9134: - Summary: Parquet partitioning degrades Int32 to float64 Key: ARROW-9134 URL: https://issues.apache.org/jira/browse/ARROW-9134 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-8779) [R] Implement conversion to List

2020-06-15 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8779?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8779: -- Labels: pull-request-available (was: ) > [R] Implement conversion to List >

[jira] [Commented] (ARROW-9133) [C++] Add utf8_upper and utf_lower

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135875#comment-17135875 ] Wes McKinney commented on ARROW-9133: - > As for utf8proc vs. unilib, I think we've determined in

[jira] [Updated] (ARROW-9132) [Python] Support unique for dictionary type

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9132: Fix Version/s: 1.0.0 > [Python] Support unique for dictionary type >

[jira] [Updated] (ARROW-9132) [Python] Support unique for dictionary type

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9132: Summary: [Python] Support unique for dictionary type (was: Support unique for dictionary type) >

[jira] [Commented] (ARROW-9134) Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135877#comment-17135877 ] Wes McKinney commented on ARROW-9134: - Curious. We correctly handle converting to Arrow from the

[jira] [Updated] (ARROW-9134) Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9134: Fix Version/s: 1.0.0 > Parquet partitioning degrades Int32 to float64 >

[jira] [Updated] (ARROW-9134) [Python] Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9134: Summary: [Python] Parquet partitioning degrades Int32 to float64 (was: Parquet partitioning

[jira] [Updated] (ARROW-8802) [C++][Dataset] Schema metadata are lost when reading a subset of columns

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8802?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8802: - Fix Version/s: 1.0.0 > [C++][Dataset] Schema metadata are lost when reading a

[jira] [Commented] (ARROW-6256) [Rust] parquet-format should be released by Apache process

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6256?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135980#comment-17135980 ] Wes McKinney commented on ARROW-6256: - I haven't seen any movement on this so I'm removing it from

[jira] [Updated] (ARROW-6256) [Rust] parquet-format should be released by Apache process

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6256?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6256: Fix Version/s: (was: 1.0.0) > [Rust] parquet-format should be released by Apache process >

[jira] [Updated] (ARROW-7179) [C++][Compute] Coalesce kernel

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7179: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Compute] Coalesce kernel >

[jira] [Updated] (ARROW-9108) [C++][Dataset] Add Parquet Statistics conversion for timestamp columns

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9108?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9108: Priority: Major (was: Blocker) > [C++][Dataset] Add Parquet Statistics conversion for timestamp

[jira] [Commented] (ARROW-9107) [C++][Dataset] Time-based types support

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9107?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135990#comment-17135990 ] Wes McKinney commented on ARROW-9107: - While important, this is not a blocker > [C++][Dataset]

[jira] [Updated] (ARROW-8733) [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8733: - Fix Version/s: (was: 1.0.0) > [C++][Dataset][Python] ParquetFileFragment

[jira] [Commented] (ARROW-9108) [C++][Dataset] Add Parquet Statistics conversion for timestamp columns

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9108?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135992#comment-17135992 ] Wes McKinney commented on ARROW-9108: - While important, this should not block the release >

[jira] [Resolved] (ARROW-9127) [Rust] Update thrift library dependencies

2020-06-15 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9127. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7432

[jira] [Updated] (ARROW-8074) [C++][Dataset] Support for file-like objects (buffers) in FileSystemDataset?

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-8074: Labels: dataset dataset-dask-integration pull-request-available (was: dataset

[jira] [Resolved] (ARROW-9005) [Rust] [DataFusion] Support sort expression

2020-06-15 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9005. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7324

[jira] [Updated] (ARROW-9044) [Go][Packaging] Revisit the license file attachment to the go packages

2020-06-15 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs updated ARROW-9044: --- Priority: Minor (was: Major) > [Go][Packaging] Revisit the license file attachment to the

[jira] [Updated] (ARROW-8001) [C++][Dataset] R and Python bindings for dataset writing

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8001?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8001: - Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] R and

[jira] [Created] (ARROW-9138) [Docs][Format] Make sure format version is hard coded in the docs

2020-06-15 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9138: -- Summary: [Docs][Format] Make sure format version is hard coded in the docs Key: ARROW-9138 URL: https://issues.apache.org/jira/browse/ARROW-9138 Project: Apache

[jira] [Resolved] (ARROW-9067) [C++] Create reusable branchless / vectorized index boundschecking functions

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9067. - Resolution: Fixed This was done in ARROW-5760 > [C++] Create reusable branchless / vectorized

[jira] [Resolved] (ARROW-9124) [Rust][Datafusion] DFParser should consume sql query as instead of String

2020-06-15 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-9124. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7428

[jira] [Assigned] (ARROW-4633) [Python] ParquetFile.read(use_threads=False) creates ThreadPool anyway

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4633: --- Assignee: (was: Wes McKinney) > [Python] ParquetFile.read(use_threads=False) creates

[jira] [Updated] (ARROW-7394) [C++][DataFrame] Implement zero-copy optimizations when performing Filter

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7394?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7394: Fix Version/s: 2.0.0 > [C++][DataFrame] Implement zero-copy optimizations when performing Filter >

[jira] [Assigned] (ARROW-6235) [R] Conversion from arrow::BinaryArray to R character vector not implemented

2020-06-15 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-6235: - Assignee: Romain Francois (was: Francois Saint-Jacques) > [R]

[jira] [Resolved] (ARROW-5158) [Packaging][Wheel] Symlink libraries in wheels

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5158. - Fix Version/s: 1.0.0 Resolution: Fixed This was resolved by ARROW-5082 >

[jira] [Updated] (ARROW-6979) [R] Enable jemalloc in autobrew formula

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6979?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6979: Priority: Blocker (was: Major) > [R] Enable jemalloc in autobrew formula >

[jira] [Commented] (ARROW-6979) [R] Enable jemalloc in autobrew formula

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6979?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136005#comment-17136005 ] Wes McKinney commented on ARROW-6979: - I'd like to push to make this happen for the release. Plenty

[jira] [Updated] (ARROW-7218) [Python] Conversion from boolean numpy scalars not working

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7218: Fix Version/s: (was: 1.0.0) 2.0.0 > [Python] Conversion from boolean numpy

[jira] [Commented] (ARROW-9135) [C++][Compute] Provide a kernel property testing API

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136006#comment-17136006 ] Ben Kietzman commented on ARROW-9135: - That's accurate; currently other random tests do the same (for

[jira] [Updated] (ARROW-8655) [C++][Dataset][Python][R] Preserve partitioning information for a discovered Dataset

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-8655: - Fix Version/s: (was: 1.0.0) 2.0.0 >

[jira] [Commented] (ARROW-9135) [C++][Compute] Provide a kernel property testing API

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135935#comment-17135935 ] Wes McKinney commented on ARROW-9135: - If I have understood correctly, the implementation of property

[jira] [Commented] (ARROW-9134) [Python] Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135942#comment-17135942 ] Joris Van den Bossche commented on ARROW-9134: -- This is working correctly on pyarrow master

[jira] [Updated] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9132: - Component/s: C++ > [C++] Support unique kernel for dictionary type >

[jira] [Updated] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9132: - Summary: [C++] Support unique kernel for dictionary type (was: [Python] Support

[jira] [Commented] (ARROW-9134) [Python] Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Nicholas Palko (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135948#comment-17135948 ] Nicholas Palko commented on ARROW-9134: --- Makes sense to me. Thank you for the incredible software

[jira] [Created] (ARROW-9135) [C++][Compute] Provide a kernel property testing API

2020-06-15 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-9135: --- Summary: [C++][Compute] Provide a kernel property testing API Key: ARROW-9135 URL: https://issues.apache.org/jira/browse/ARROW-9135 Project: Apache Arrow

[jira] [Commented] (ARROW-9117) [Python] Is there Pandas circular dependency problem?

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135937#comment-17135937 ] Joris Van den Bossche commented on ARROW-9117: -- [~Seungmin] this was solved for you? >

[jira] [Commented] (ARROW-9019) [Python] hdfs fails to connect to for HDFS 3.x cluster

2020-06-15 Thread Thomas Graves (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135940#comment-17135940 ] Thomas Graves commented on ARROW-9019: -- can you give more details on what was missing?  I used the

[jira] [Updated] (ARROW-9132) [Python] Support unique for dictionary type

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9132: - Description: Enabling

[jira] [Commented] (ARROW-9134) [Python] Parquet partitioning degrades Int32 to float64

2020-06-15 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17135944#comment-17135944 ] Joris Van den Bossche commented on ARROW-9134: -- It might still be good to add a test for

[jira] [Created] (ARROW-9136) pandas index information gets lost when partition_cols are used

2020-06-15 Thread Hans Pirnay (Jira)
Hans Pirnay created ARROW-9136: -- Summary: pandas index information gets lost when partition_cols are used Key: ARROW-9136 URL: https://issues.apache.org/jira/browse/ARROW-9136 Project: Apache Arrow

[jira] [Updated] (ARROW-7853) [CI][Packaging] Add nightly test that pip-installs nightly wheels

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7853?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7853: Fix Version/s: (was: 1.0.0) 2.0.0 > [CI][Packaging] Add nightly test that

[jira] [Updated] (ARROW-7855) [Python] Always raise ArrowTypeError instead of TypeError inside pyarrow.array

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7855: Fix Version/s: (was: 1.0.0) 2.0.0 > [Python] Always raise ArrowTypeError

[jira] [Assigned] (ARROW-7925) [C++][Documentation] Instructions about running IWYU and other tasks in cpp/development.rst have gone stale

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7925: --- Assignee: Wes McKinney > [C++][Documentation] Instructions about running IWYU and other

[jira] [Updated] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8152: Fix Version/s: (was: 1.0.0) > [C++] IO: split large coalesced reads into smaller ones >

[jira] [Commented] (ARROW-8152) [C++] IO: split large coalesced reads into smaller ones

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8152?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136009#comment-17136009 ] Wes McKinney commented on ARROW-8152: - I removed any milestone. This can be pursued at any time >

[jira] [Updated] (ARROW-8618) [C++] ASSIGN_OR_RAISE should move its argument

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8618: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] ASSIGN_OR_RAISE should move its

[jira] [Updated] (ARROW-7586) [C++][Dataset] Read feather 1.0 files

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7586: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] Read feather 1.0 files >

[jira] [Updated] (ARROW-7586) [C++][Dataset] Read feather 1.0 files

2020-06-15 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7586?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-7586: Summary: [C++][Dataset] Read feather 1.0 files (was: [C++][Dataset] Read feather files) >

[jira] [Updated] (ARROW-8667) [C++] Add multi-consumer Scheduler API to sit one layer above ThreadPool

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8667: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Add multi-consumer Scheduler API

[jira] [Updated] (ARROW-8749) [C++] IpcFormatWriter writes dictionary batches with wrong ID

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8749: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] IpcFormatWriter writes dictionary

[jira] [Assigned] (ARROW-8762) [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementation

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8762?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8762: --- Assignee: Wes McKinney > [C++][Gandiva] Replace Gandiva's BitmapAnd with common

[jira] [Commented] (ARROW-8762) [C++][Gandiva] Replace Gandiva's BitmapAnd with common implementation

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8762?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136013#comment-17136013 ] Wes McKinney commented on ARROW-8762: - I see, thanks. If this optimization is beneficial then it

[jira] [Updated] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8769: Priority: Critical (was: Major) > [C++] Add convenience methods to access fields by name in

[jira] [Commented] (ARROW-8769) [C++] Add convenience methods to access fields by name in StructScalar

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8769?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136015#comment-17136015 ] Wes McKinney commented on ARROW-8769: - This should have corresponding work done in both Python and R

[jira] [Assigned] (ARROW-8921) [C++] Add "TypeResolver" class interface to replace current OutputType::Resolver pattern

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8921: --- Assignee: Wes McKinney > [C++] Add "TypeResolver" class interface to replace current >

[jira] [Updated] (ARROW-8928) [C++] Measure microperformance associated with data structure access interactions with arrow::compute::ExecBatch

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8928: Fix Version/s: (was: 1.0.0) > [C++] Measure microperformance associated with data structure

[jira] [Assigned] (ARROW-8936) [C++] Parallelize execution of arrow::compute::ScalarFunction

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8936: --- Assignee: Wes McKinney > [C++] Parallelize execution of arrow::compute::ScalarFunction >

[jira] [Assigned] (ARROW-8863) [C++] Array subclass constructors must set ArrayData::null_count to 0 when there is no validity bitmap

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8863: --- Assignee: Wes McKinney > [C++] Array subclass constructors must set ArrayData::null_count

[jira] [Commented] (ARROW-8895) [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17136016#comment-17136016 ] Wes McKinney commented on ARROW-8895: - I'll add these types to the random data test battery > [C++]

[jira] [Assigned] (ARROW-8895) [C++] Add C++ unit tests for filter and take functions on temporal type inputs, including timestamps

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8895: --- Assignee: Wes McKinney > [C++] Add C++ unit tests for filter and take functions on temporal

[jira] [Assigned] (ARROW-8933) [C++] Reduce generated code in vector_hash.cc

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8933: --- Assignee: Wes McKinney > [C++] Reduce generated code in vector_hash.cc >

[jira] [Assigned] (ARROW-8989) [C++] Document available functions in compute::FunctionRegistry

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8989: --- Assignee: Wes McKinney > [C++] Document available functions in compute::FunctionRegistry >

[jira] [Updated] (ARROW-8961) [C++] Vendor utf8proc library

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-8961: Fix Version/s: (was: 1.0.0) > [C++] Vendor utf8proc library > - >

[jira] [Assigned] (ARROW-8969) [C++] Reduce generated code in compute/kernels/scalar_compare.cc

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8969: --- Assignee: Wes McKinney > [C++] Reduce generated code in compute/kernels/scalar_compare.cc >

[jira] [Assigned] (ARROW-8991) [C++][Compute] Add scalar_hash function

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-8991: --- Assignee: Wes McKinney > [C++][Compute] Add scalar_hash function >

[jira] [Updated] (ARROW-9003) [C++] Add VectorFunction wrapping arrow::Concatenate

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9003: Fix Version/s: (was: 1.0.0) > [C++] Add VectorFunction wrapping arrow::Concatenate >

[jira] [Assigned] (ARROW-9128) [C++] Implement string space trimming kernels: trim, ltrim, and rtrim

2020-06-15 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9128: --- Assignee: Wes McKinney > [C++] Implement string space trimming kernels: trim, ltrim, and

  1   2   >