[jira] [Commented] (ARROW-8992) [CI][C++] march not passing correctly for docker-compose run

2020-06-09 Thread Elliott Kipp (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17130014#comment-17130014 ] Elliott Kipp commented on ARROW-8992: - Above "dirty workaround" worked, except I applied it to

[jira] [Created] (ARROW-9087) Missing HDFS options parsing

2020-06-09 Thread Yuan Zhou (Jira)
Yuan Zhou created ARROW-9087: Summary: Missing HDFS options parsing Key: ARROW-9087 URL: https://issues.apache.org/jira/browse/ARROW-9087 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-9086) [CI][Homebrew] Enable Gandiva

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9086?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9086: -- Labels: pull-request-available (was: ) > [CI][Homebrew] Enable Gandiva >

[jira] [Created] (ARROW-9086) [CI][Homebrew] Enable Gandiva

2020-06-09 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-9086: --- Summary: [CI][Homebrew] Enable Gandiva Key: ARROW-9086 URL: https://issues.apache.org/jira/browse/ARROW-9086 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-9085) [C++][CI] Appveyor CI test failures

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9085: -- Labels: pull-request-available (was: ) > [C++][CI] Appveyor CI test failures >

[jira] [Updated] (ARROW-9075) [C++] Optimize Filter implementation

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9075?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9075: -- Labels: pull-request-available (was: ) > [C++] Optimize Filter implementation >

[jira] [Updated] (ARROW-9084) [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM

2020-06-09 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-9084: Component/s: C++ > [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM >

[jira] [Updated] (ARROW-9084) [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM

2020-06-09 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-9084: Summary: [C++] CMake is unable to find zstd target when ZSTD_SOURCE=SYSTEM (was: [C++] cmake is

[jira] [Assigned] (ARROW-9085) [C++][CI] Appveyor CI test failures

2020-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9085: --- Assignee: Wes McKinney > [C++][CI] Appveyor CI test failures >

[jira] [Commented] (ARROW-9085) [C++][CI] Appveyor CI test failures

2020-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9085?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129888#comment-17129888 ] Wes McKinney commented on ARROW-9085: - I'm going to revert the Compression changes that caused this

[jira] [Updated] (ARROW-9085) [C++][CI] Appveyor CI test failures

2020-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9085?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9085: Priority: Blocker (was: Major) > [C++][CI] Appveyor CI test failures >

[jira] [Created] (ARROW-9085) [C++][CI] Appveyor CI test failures

2020-06-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9085: --- Summary: [C++][CI] Appveyor CI test failures Key: ARROW-9085 URL: https://issues.apache.org/jira/browse/ARROW-9085 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-9084) [C++] cmake is unable to find zstd target when ZSTD_SOURCE=SYSTEM

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9084: -- Labels: pull-request-available (was: ) > [C++] cmake is unable to find zstd target when

[jira] [Created] (ARROW-9084) [C++] cmake is unable to find zstd target when ZSTD_SOURCE=SYSTEM

2020-06-09 Thread Dmitry Kalinkin (Jira)
Dmitry Kalinkin created ARROW-9084: -- Summary: [C++] cmake is unable to find zstd target when ZSTD_SOURCE=SYSTEM Key: ARROW-9084 URL: https://issues.apache.org/jira/browse/ARROW-9084 Project: Apache

[jira] [Commented] (ARROW-7175) [Website] Add a security page to track when vulnerabilities are patched

2020-06-09 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129857#comment-17129857 ] Neal Richardson commented on ARROW-7175: Nm, looks like the 2 from 0.15.1 are the only ones we've

[jira] [Updated] (ARROW-5377) [C++] Develop interface for writing a RecordBatch IPC stream into pre-allocated space (e.g. memory map) that avoids unnecessary serialization

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5377: -- Labels: pull-request-available (was: ) > [C++] Develop interface for writing a RecordBatch

[jira] [Commented] (ARROW-7893) [Developer][GLib] Document GLib development workflow when using conda environment on GTK-based Linux systems

2020-06-09 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129787#comment-17129787 ] Kouhei Sutou commented on ARROW-7893: - Sure. It's the {{configure}} stage in

[jira] [Comment Edited] (ARROW-7893) [Developer][GLib] Document GLib development workflow when using conda environment on GTK-based Linux systems

2020-06-09 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128670#comment-17128670 ] Kouhei Sutou edited comment on ARROW-7893 at 6/9/20, 8:40 PM: --

[jira] [Updated] (ARROW-9071) [C++] MakeArrayOfNull makes invalid ListArray

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9071: -- Labels: pull-request-available (was: ) > [C++] MakeArrayOfNull makes invalid ListArray >

[jira] [Resolved] (ARROW-9046) [C++][R] Put more things in type_fwds

2020-06-09 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-9046. - Resolution: Fixed Issue resolved by pull request 7281

[jira] [Created] (ARROW-9083) [R] collect int64 as R integer type if not out of bounds

2020-06-09 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-9083: -- Summary: [R] collect int64 as R integer type if not out of bounds Key: ARROW-9083 URL: https://issues.apache.org/jira/browse/ARROW-9083 Project: Apache Arrow

[jira] [Updated] (ARROW-9081) [C++] Upgrade default LLVM version to 10

2020-06-09 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9081: Description: Upgrade llvm dependencies to default to version 10. There are several obstacles

[jira] [Resolved] (ARROW-9077) [C++] Fix aggregate/scalar-compare benchmark null_percent calculation

2020-06-09 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9077. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7383

[jira] [Updated] (ARROW-9081) [C++] Upgrade default LLVM version to 10

2020-06-09 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9081: Summary: [C++] Upgrade default LLVM version to 10 (was: [C++] Upgrade to LLVM 10) > [C++]

[jira] [Resolved] (ARROW-8766) [Python] A FileSystem implementation based on Python callbacks

2020-06-09 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8766. --- Resolution: Fixed Issue resolved by pull request 7349

[jira] [Updated] (ARROW-9081) [C++] Upgrade to LLVM 10

2020-06-09 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9081?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9081: Description: Upgrade default llvm dependencies to use version 10 (was: Upgrade llvm dependencies

[jira] [Resolved] (ARROW-3089) [Rust] Add ArrayBuilder for different Arrow arrays

2020-06-09 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neville Dipale resolved ARROW-3089. --- Assignee: Neville Dipale Resolution: Implemented The remaining task was completed >

[jira] [Updated] (ARROW-8430) [CI] Configure self-hosted runners for Github Actions

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8430: -- Labels: pull-request-available (was: ) > [CI] Configure self-hosted runners for Github

[jira] [Closed] (ARROW-9080) [C++] arrow::AllocateBuffer returns a Result>

2020-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9080?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-9080. --- Resolution: Not A Problem I misunderstood that implicit casts from unique_ptr to shared_ptr work, so

[jira] [Updated] (ARROW-9082) [Rust] - Stream reader fail when steam not ended with (optional) 0xFFFFFFFF 0x00000000"

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9082: -- Labels: pull-request-available (was: ) > [Rust] - Stream reader fail when steam not ended

[jira] [Updated] (ARROW-9074) [GLib] Add missing arrow-json check

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9074: -- Labels: pull-request-available (was: ) > [GLib] Add missing arrow-json check >

[jira] [Updated] (ARROW-5760) [C++] Optimize Take implementation

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5760: -- Labels: pull-request-available (was: ) > [C++] Optimize Take implementation >

[jira] [Updated] (ARROW-8726) [C++][Dataset] Mis-specified DirectoryPartitioning incorrectly uses the file name as value

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8726?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8726: -- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] Mis-specified

[jira] [Updated] (ARROW-9073) [C++] RapidJSON include directory detection doesn't work with RapidJSONConfig.cmake

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9073: -- Labels: pull-request-available (was: ) > [C++] RapidJSON include directory detection doesn't

[jira] [Updated] (ARROW-9077) [C++] Fix aggregate/scalar-compare benchmark null_percent calculation

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9077?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9077: -- Labels: pull-request-available (was: ) > [C++] Fix aggregate/scalar-compare benchmark

[jira] [Updated] (ARROW-8866) [C++] Split Type::UNION into Type::SPARSE_UNION and Type::DENSE_UNION

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8866: -- Labels: pull-request-available (was: ) > [C++] Split Type::UNION into Type::SPARSE_UNION and

[jira] [Updated] (ARROW-9062) [Rust] Support to read JSON into dictionary type

2020-06-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9062?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9062: -- Labels: pull-request-available (was: ) > [Rust] Support to read JSON into dictionary type >

[jira] [Created] (ARROW-9082) [Rust] - Stream reader fail when steam not ended with (optional) 0xFFFFFFFF 0x00000000"

2020-06-09 Thread Eyal Leshem (Jira)
Eyal Leshem created ARROW-9082: -- Summary: [Rust] - Stream reader fail when steam not ended with (optional) 0x 0x" Key: ARROW-9082 URL: https://issues.apache.org/jira/browse/ARROW-9082

[jira] [Created] (ARROW-9081) [C++] Upgrade to LLVM 10

2020-06-09 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-9081: --- Summary: [C++] Upgrade to LLVM 10 Key: ARROW-9081 URL: https://issues.apache.org/jira/browse/ARROW-9081 Project: Apache Arrow Issue Type: Improvement

[jira] [Commented] (ARROW-9080) [C++] arrow::AllocateBuffer returns a Result>

2020-06-09 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9080?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129369#comment-17129369 ] Antoine Pitrou commented on ARROW-9080: --- Uh, I think we discussed this already... and when I asked

[jira] [Closed] (ARROW-8487) [FlightRPC][C++] Make it possible to target a specific payload size

2020-06-09 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li closed ARROW-8487. --- Resolution: Duplicate We can expose IpcPayload instead in ARROW-5377. > [FlightRPC][C++] Make it possible

[jira] [Created] (ARROW-9080) [C++] arrow::AllocateBuffer returns a Result>

2020-06-09 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9080: --- Summary: [C++] arrow::AllocateBuffer returns a Result> Key: ARROW-9080 URL: https://issues.apache.org/jira/browse/ARROW-9080 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-7579) [FlightRPC] Make Handshake optional

2020-06-09 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7579?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-7579: Issue Type: Improvement (was: Bug) > [FlightRPC] Make Handshake optional >

[jira] [Updated] (ARROW-8888) [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions

2020-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-: Fix Version/s: 1.0.0 > [Python] Heuristic in dataframe_to_arrays that decides to multithread

[jira] [Assigned] (ARROW-9067) [C++] Create reusable branchless / vectorized index boundschecking functions

2020-06-09 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9067: --- Assignee: Wes McKinney > [C++] Create reusable branchless / vectorized index boundschecking

[jira] [Resolved] (ARROW-9074) [GLib] Add missing arrow-json check

2020-06-09 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9074?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Korn resolved ARROW-9074. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7381

[jira] [Resolved] (ARROW-9073) [C++] RapidJSON include directory detection doesn't work with RapidJSONConfig.cmake

2020-06-09 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Korn resolved ARROW-9073. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7380

[jira] [Created] (ARROW-9079) [C++] Write benchmark for arithmetic kernels

2020-06-09 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-9079: -- Summary: [C++] Write benchmark for arithmetic kernels Key: ARROW-9079 URL: https://issues.apache.org/jira/browse/ARROW-9079 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-8888) [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129062#comment-17129062 ] Joris Van den Bossche commented on ARROW-: -- Trying with strings, I don't see much

[jira] [Commented] (ARROW-8888) [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions

2020-06-09 Thread Kevin Glasson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129047#comment-17129047 ] Kevin Glasson commented on ARROW-: -- Yeah - so off the top of my head it was about 6 million rows

[jira] [Commented] (ARROW-8773) [Python] pyarrow schema.empty_table() does not preserve nullability of fields

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129039#comment-17129039 ] Joris Van den Bossche commented on ARROW-8773: -- [~Al Taylor] Thanks for the report. That's

[jira] [Commented] (ARROW-8888) [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129033#comment-17129033 ] Joris Van den Bossche commented on ARROW-: -- BTW, you mention that for your real use

[jira] [Commented] (ARROW-8888) [Python] Heuristic in dataframe_to_arrays that decides to multithread convert cause slow conversions

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129030#comment-17129030 ] Joris Van den Bossche commented on ARROW-: -- There is currently a heuristic based on the

[jira] [Reopened] (ARROW-8964) [Python][Parquet] improve reading of partitioned parquet datasets whose schema changed

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reopened ARROW-8964: -- > [Python][Parquet] improve reading of partitioned parquet datasets whose >

[jira] [Comment Edited] (ARROW-8964) [Python][Parquet] improve reading of partitioned parquet datasets whose schema changed

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129009#comment-17129009 ] Joris Van den Bossche edited comment on ARROW-8964 at 6/9/20, 8:52 AM:

[jira] [Closed] (ARROW-8964) [Python][Parquet] improve reading of partitioned parquet datasets whose schema changed

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-8964. Resolution: Duplicate > [Python][Parquet] improve reading of partitioned parquet

[jira] [Closed] (ARROW-8964) [Python][Parquet] improve reading of partitioned parquet datasets whose schema changed

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-8964. Resolution: Duplicate > [Python][Parquet] improve reading of partitioned parquet

[jira] [Commented] (ARROW-8964) [Python][Parquet] improve reading of partitioned parquet datasets whose schema changed

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8964?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129009#comment-17129009 ] Joris Van den Bossche commented on ARROW-8964: -- I am not familiar with Impala. Looking at

[jira] [Updated] (ARROW-9020) [Python] read_json won't respect explicit_schema in parse_options

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9020: - Fix Version/s: (was: 0.17.1) 1.0.0 > [Python] read_json

[jira] [Commented] (ARROW-9020) [Python] read_json won't respect explicit_schema in parse_options

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17129001#comment-17129001 ] Joris Van den Bossche commented on ARROW-9020: -- Ah, I see that in C++ there is an additional

[jira] [Commented] (ARROW-9020) [Python] read_json won't respect explicit_schema in parse_options

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9020?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128996#comment-17128996 ] Joris Van den Bossche commented on ARROW-9020: -- [~felipegssantos] thanks for the report!

[jira] [Commented] (ARROW-9078) [C++] Parquet writing of extension type with nested storage type fails

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9078?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128989#comment-17128989 ] Joris Van den Bossche commented on ARROW-9078: -- We run into this bug in pandas when trying

[jira] [Created] (ARROW-9078) [C++] Parquet writing of extension type with nested storage type fails

2020-06-09 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-9078: Summary: [C++] Parquet writing of extension type with nested storage type fails Key: ARROW-9078 URL: https://issues.apache.org/jira/browse/ARROW-9078

[jira] [Updated] (ARROW-9078) [C++] Parquet writing of extension type with nested storage type fails

2020-06-09 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9078: - Fix Version/s: 1.0.0 > [C++] Parquet writing of extension type with nested

[jira] [Created] (ARROW-9077) [C++] Fix aggregate/scalar-compare benchmark null_percent calculation

2020-06-09 Thread Frank Du (Jira)
Frank Du created ARROW-9077: --- Summary: [C++] Fix aggregate/scalar-compare benchmark null_percent calculation Key: ARROW-9077 URL: https://issues.apache.org/jira/browse/ARROW-9077 Project: Apache Arrow

[jira] [Updated] (ARROW-9076) [Rust] Async CSV reader

2020-06-09 Thread Sergey Todyshev (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Todyshev updated ARROW-9076: --- Component/s: Rust > [Rust] Async CSV reader > --- > >

[jira] [Commented] (ARROW-7893) [Developer][GLib] Document GLib development workflow when using conda environment on GTK-based Linux systems

2020-06-09 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7893?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17128898#comment-17128898 ] Uwe Korn commented on ARROW-7893: - [~kou] Can you give me a pointer at which stage the library is loaded,