[jira] [Updated] (ARROW-6302) [Python][Parquet] Reading dictionary type with serialized Arrow schema does not restore "ordered" type property

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6302: -- Labels: parquet pull-request-available (was: parquet) > [Python][Parquet] Reading dictionary

[jira] [Commented] (ARROW-4359) [Python] Column metadata is not saved or loaded in parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4359?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914733#comment-16914733 ] Wes McKinney commented on ARROW-4359: - This is tricky since there is field metadata found in each row

[jira] [Commented] (ARROW-3221) [C++][Python] Add a virtual Slice method to buffers

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914732#comment-16914732 ] Wes McKinney commented on ARROW-3221: - To do this, we'd have to make Buffer inherit from

[jira] [Updated] (ARROW-3221) [C++][Python] Add a virtual Slice method to buffers

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3221: Fix Version/s: (was: 0.15.0) > [C++][Python] Add a virtual Slice method to buffers >

[jira] [Updated] (ARROW-5906) [CI] Set -DARROW_VERBOSE_THIRDPARTY_BUILD=OFF in builds running in Travis CI, maybe all docker-compose builds by default

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5906: -- Labels: pull-request-available (was: ) > [CI] Set -DARROW_VERBOSE_THIRDPARTY_BUILD=OFF in

[jira] [Assigned] (ARROW-5906) [CI] Set -DARROW_VERBOSE_THIRDPARTY_BUILD=OFF in builds running in Travis CI, maybe all docker-compose builds by default

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5906: --- Assignee: Wes McKinney > [CI] Set -DARROW_VERBOSE_THIRDPARTY_BUILD=OFF in builds running in

[jira] [Updated] (ARROW-5910) [Python] read_tensor() fails on non-seekable streams

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5910: -- Labels: pull-request-available (was: ) > [Python] read_tensor() fails on non-seekable streams

[jira] [Assigned] (ARROW-5910) [Python] read_tensor() fails on non-seekable streams

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-5910: --- Assignee: Wes McKinney > [Python] read_tensor() fails on non-seekable streams >

[jira] [Closed] (ARROW-6168) [C++] IWYU docker-compose job is broken

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6168. --- Resolution: Cannot Reproduce I removed my hack and then was not able to reproduce this, so closing

[jira] [Created] (ARROW-6342) [Python] Add pyarrow.record_batch factory function with same basic API / semantics as pyarrow.table

2019-08-23 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6342: --- Summary: [Python] Add pyarrow.record_batch factory function with same basic API / semantics as pyarrow.table Key: ARROW-6342 URL: https://issues.apache.org/jira/browse/ARROW-6342

[jira] [Updated] (ARROW-6279) [Python] Add Table.slice method or allow slices in __getitem__

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6279: -- Labels: pull-request-available (was: ) > [Python] Add Table.slice method or allow slices in

[jira] [Assigned] (ARROW-6279) [Python] Add Table.slice method or allow slices in __getitem__

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6279: --- Assignee: Wes McKinney > [Python] Add Table.slice method or allow slices in __getitem__ >

[jira] [Updated] (ARROW-6312) [C++] Declare required Libs.private in arrow.pc package config

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6312?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6312: Summary: [C++] Declare required Libs.private in arrow.pc package config (was: Declare required

[jira] [Resolved] (ARROW-6328) Click.option-s should have help text

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6328. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5159

[jira] [Resolved] (ARROW-6271) [Rust] [DataFusion] Add example for running SQL against Parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6271. - Resolution: Fixed Issue resolved by pull request 5161

[jira] [Resolved] (ARROW-6325) [Python] wrong conversion of DataFrame with boolean values

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6325. - Resolution: Fixed Issue resolved by pull request 5176

[jira] [Resolved] (ARROW-6319) [C++] Extract the core of NumericTensor::Value as Tensor::Value

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6319. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5160

[jira] [Updated] (ARROW-6203) [GLib] Add garrow_array_sort_to_indices()

2019-08-23 Thread Yosuke Shiro (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yosuke Shiro updated ARROW-6203: Summary: [GLib] Add garrow_array_sort_to_indices() (was: [GLib] Add garrow_array_argsort()) >

[jira] [Resolved] (ARROW-6232) [C++] Rename Argsort kernel to SortToIndices

2019-08-23 Thread Sutou Kouhei (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6232?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sutou Kouhei resolved ARROW-6232. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5080

[jira] [Resolved] (ARROW-5686) [R] Review R Windows CI build

2019-08-23 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5686. --- Resolution: Fixed Issue resolved by pull request 5170

[jira] [Assigned] (ARROW-6340) [R] Implements low-level bindings to Dataset classes

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6340?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6340: -- Assignee: Romain François > [R] Implements low-level bindings to Dataset classes >

[jira] [Created] (ARROW-6341) [Python] Implements low-level bindings to Dataset classes:

2019-08-23 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-6341: - Summary: [Python] Implements low-level bindings to Dataset classes: Key: ARROW-6341 URL: https://issues.apache.org/jira/browse/ARROW-6341 Project:

[jira] [Created] (ARROW-6340) [R] Implements low-level bindings to Dataset classes

2019-08-23 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-6340: - Summary: [R] Implements low-level bindings to Dataset classes Key: ARROW-6340 URL: https://issues.apache.org/jira/browse/ARROW-6340 Project: Apache

[jira] [Assigned] (ARROW-6339) [Python][C++] Rowgroup statistics for pd.NaT array ill defined

2019-08-23 Thread Florian Jetter (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Florian Jetter reassigned ARROW-6339: - Assignee: Florian Jetter > [Python][C++] Rowgroup statistics for pd.NaT array ill

[jira] [Created] (ARROW-6339) [Python][C++] Rowgroup statistics for pd.NaT array ill defined

2019-08-23 Thread Florian Jetter (Jira)
Florian Jetter created ARROW-6339: - Summary: [Python][C++] Rowgroup statistics for pd.NaT array ill defined Key: ARROW-6339 URL: https://issues.apache.org/jira/browse/ARROW-6339 Project: Apache Arrow

[jira] [Assigned] (ARROW-6242) [C++] Implements basic Dataset/Scanner/ScannerBuilder

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6242: -- Assignee: Neal Richardson (was: Francois Saint-Jacques) > [C++] Implements basic

[jira] [Assigned] (ARROW-6242) [C++] Implements basic Dataset/Scanner/ScannerBuilder

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6242: -- Assignee: Francois Saint-Jacques (was: Neal Richardson) > [C++] Implements basic

[jira] [Updated] (ARROW-6328) Click.option-s should have help text

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6328: -- Labels: pull-request-available (was: ) > Click.option-s should have help text >

[jira] [Commented] (ARROW-6338) [R] Type function names don't match type names

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914517#comment-16914517 ] Neal Richardson commented on ARROW-6338: We are using those factory functions currently: 

[jira] [Comment Edited] (ARROW-6338) [R] Type function names don't match type names

2019-08-23 Thread Benjamin Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914516#comment-16914516 ] Benjamin Kietzman edited comment on ARROW-6338 at 8/23/19 5:58 PM: ---

[jira] [Assigned] (ARROW-1299) [Doc] Publish nightly documentation against master somewhere

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1299?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-1299: -- Assignee: Neal Richardson > [Doc] Publish nightly documentation against master

[jira] [Updated] (ARROW-3054) [Packaging] Tooling to enable nightly conda packages to be updated to some anaconda.org channel

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3054: --- Labels: conda (was: ) > [Packaging] Tooling to enable nightly conda packages to be updated

[jira] [Updated] (ARROW-5158) [Packaging][Wheel] Symlink libraries in wheels

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5158: --- Labels: wheel (was: ) > [Packaging][Wheel] Symlink libraries in wheels >

[jira] [Updated] (ARROW-3277) [Python] Validate manylinux1 builds with crossbow instead of each Travis CI build

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-3277: --- Labels: wheel (was: ) > [Python] Validate manylinux1 builds with crossbow instead of each

[jira] [Updated] (ARROW-1581) [Python] Set up nightly wheel builds for Linux, macOS

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-1581: --- Labels: nightly wheel (was: nightly) > [Python] Set up nightly wheel builds for Linux,

[jira] [Updated] (ARROW-5522) [Packaging] Comments out of date in python/manylinux1/build_arrow.sh

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5522?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5522: --- Labels: wheel (was: ) > [Packaging] Comments out of date in

[jira] [Updated] (ARROW-5082) [Python][Packaging] Reduce size of macOS and manylinux1 wheels

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5082: --- Labels: pull-request-available wheel (was: pull-request-available) > [Python][Packaging]

[jira] [Updated] (ARROW-5101) [Packaging] Avoid bundling static libraries in Windows conda packages

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5101: --- Labels: conda (was: ) > [Packaging] Avoid bundling static libraries in Windows conda

[jira] [Updated] (ARROW-6015) [Python] pyarrow: `DLL load failed` when importing on windows

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6015: --- Labels: wheel (was: ) > [Python] pyarrow: `DLL load failed` when importing on windows >

[jira] [Updated] (ARROW-6015) [Python] pyarrow: `DLL load failed` when importing on windows

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6015: --- Issue Type: Bug (was: Improvement) > [Python] pyarrow: `DLL load failed` when importing on

[jira] [Updated] (ARROW-6015) [Python] pyarrow: `DLL load failed` when importing on windows

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6015: --- Component/s: Python > [Python] pyarrow: `DLL load failed` when importing on windows >

[jira] [Updated] (ARROW-6119) [Python] PyArrow import fails on Windows Python 3.7

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6119: --- Labels: wheel (was: ) > [Python] PyArrow import fails on Windows Python 3.7 >

[jira] [Commented] (ARROW-6338) [R] Type function names don't match type names

2019-08-23 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914484#comment-16914484 ] Francois Saint-Jacques commented on ARROW-6338: --- The names you want to follow are the

[jira] [Resolved] (ARROW-6202) [Java] Exception in thread "main" org.apache.arrow.memory.OutOfMemoryException: Unable to allocate buffer of size 4 due to memory limit. Current allocation: 2147483646

2019-08-23 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler resolved ARROW-6202. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5134

[jira] [Assigned] (ARROW-6271) [Rust] [DataFusion] Add example for running SQL against Parquet

2019-08-23 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6271?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan reassigned ARROW-6271: -- Assignee: Andrew Schoenberger > [Rust] [DataFusion] Add example for running SQL against

[jira] [Created] (ARROW-6338) [R] Type function names don't match type names

2019-08-23 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-6338: -- Summary: [R] Type function names don't match type names Key: ARROW-6338 URL: https://issues.apache.org/jira/browse/ARROW-6338 Project: Apache Arrow

[jira] [Resolved] (ARROW-6126) [C++] IPC stream reader handling of empty streams potentially not robust

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6126. - Resolution: Fixed Issue resolved by pull request 5146

[jira] [Commented] (ARROW-6317) [JS] Implement changes to ensure flatbuffer alignment

2019-08-23 Thread Brian Hulette (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914459#comment-16914459 ] Brian Hulette commented on ARROW-6317: -- oh! so he did, thanks for letting me know. I'll assign this

[jira] [Assigned] (ARROW-6317) [JS] Implement changes to ensure flatbuffer alignment

2019-08-23 Thread Brian Hulette (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brian Hulette reassigned ARROW-6317: Assignee: Paul Taylor (was: Brian Hulette) > [JS] Implement changes to ensure flatbuffer

[jira] [Commented] (ARROW-6337) [R] as_tibble in R API is a misnomer

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914457#comment-16914457 ] Neal Richardson commented on ARROW-6337: Ben and François are making them. > [R] as_tibble in R

[jira] [Resolved] (ARROW-6311) [Java] Make ApproxEqualsVisitor accept DiffFunction to make it more flexible

2019-08-23 Thread Pindikura Ravindra (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6311?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pindikura Ravindra resolved ARROW-6311. --- Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5155

[jira] [Commented] (ARROW-6317) [JS] Implement changes to ensure flatbuffer alignment

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914453#comment-16914453 ] Wes McKinney commented on ARROW-6317: - [~paul.e.taylor] wrote on the mailing list that he would also

[jira] [Assigned] (ARROW-6317) [JS] Implement changes to ensure flatbuffer alignment

2019-08-23 Thread Brian Hulette (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Brian Hulette reassigned ARROW-6317: Assignee: Brian Hulette > [JS] Implement changes to ensure flatbuffer alignment >

[jira] [Commented] (ARROW-6317) [JS] Implement changes to ensure flatbuffer alignment

2019-08-23 Thread Brian Hulette (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6317?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914451#comment-16914451 ] Brian Hulette commented on ARROW-6317: -- I can take a look tonight. > [JS] Implement changes to

[jira] [Commented] (ARROW-6337) [R] as_tibble in R API is a misnomer

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914450#comment-16914450 ] Wes McKinney commented on ARROW-6337: - [~npr] are there associated JIRA issues? > [R] as_tibble in R

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914439#comment-16914439 ] Antoine Pitrou commented on ARROW-5691: --- Ah... Then perhaps the Parquet-Arrow bridge needs to be in

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914438#comment-16914438 ] Wes McKinney commented on ARROW-5691: - I'm OK to leave things as they are now, since it's not really

[jira] [Commented] (ARROW-6337) [R] as_tibble in R API is a misnomer

2019-08-23 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914436#comment-16914436 ] Neal Richardson commented on ARROW-6337: It is slightly more nuanced than that. It is a tibble,

[jira] [Comment Edited] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914433#comment-16914433 ] Wes McKinney edited comment on ARROW-5691 at 8/23/19 4:26 PM: -- Correct. This

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914384#comment-16914384 ] Antoine Pitrou commented on ARROW-5691: --- If we have a "formats" directory I'd still like to have

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914378#comment-16914378 ] Wes McKinney commented on ARROW-5691: - We can also flatten these directories and instead use

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914381#comment-16914381 ] Wes McKinney commented on ARROW-5691: - We still have the problem of which shared library to put the

[jira] [Updated] (ARROW-6332) [Java][C++][Gandiva] Handle size of varchar vectors correctly

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6332: Summary: [Java][C++][Gandiva] Handle size of varchar vectors correctly (was: [Java] [CPP] Handle

[jira] [Updated] (ARROW-6332) [Java][C++][Gandiva] Handle size of varchar vectors correctly

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6332: Component/s: Java C++ - Gandiva > [Java][C++][Gandiva] Handle size of varchar

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914337#comment-16914337 ] Antoine Pitrou commented on ARROW-5691: --- Then "formats" plural, because "format" singular sounds

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914327#comment-16914327 ] Francois Saint-Jacques commented on ARROW-5691: --- I prefer `src/arrow/format`, where we'd

[jira] [Comment Edited] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914327#comment-16914327 ] Francois Saint-Jacques edited comment on ARROW-5691 at 8/23/19 3:01 PM:

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914320#comment-16914320 ] Antoine Pitrou commented on ARROW-5691: --- Inside "src/arrow/dataset" directory, intuitively I'd only

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914317#comment-16914317 ] Antoine Pitrou commented on ARROW-5691: --- I also dislike the "adapters" name (too generic, it could

[jira] [Commented] (ARROW-5691) [C++] Relocate src/parquet/arrow code to src/arrow/dataset/parquet

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914304#comment-16914304 ] Wes McKinney commented on ARROW-5691: - FWIW I dislike the "adapters" name and I don't know that the

[jira] [Assigned] (ARROW-1456) [Python] Run s3fs unit tests in Travis CI

2019-08-23 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1456?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc reassigned ARROW-1456: - Assignee: Rok Mihevc > [Python] Run s3fs unit tests in Travis CI >

[jira] [Assigned] (ARROW-4208) [CI/Python] Have automatized tests for S3

2019-08-23 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc reassigned ARROW-4208: - Assignee: Rok Mihevc > [CI/Python] Have automatized tests for S3 >

[jira] [Resolved] (ARROW-4111) [Python] Create time types from Python sequences of integers

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4111?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4111. - Resolution: Fixed Resolved as part of ARROW-6227 in

[jira] [Resolved] (ARROW-6227) [Python] pyarrow.array() shouldn't coerce np.nan to string

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6227. - Resolution: Fixed Issue resolved by pull request 5150

[jira] [Created] (ARROW-6336) [Python] Clarify pyarrow.serialize/deserialize docstrings viz-a-viz relationship with Arrow IPC protocol

2019-08-23 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-6336: --- Summary: [Python] Clarify pyarrow.serialize/deserialize docstrings viz-a-viz relationship with Arrow IPC protocol Key: ARROW-6336 URL:

[GitHub] [arrow-testing] wesm commented on issue #9: ARROW-6318: add generated files from integration test to testing

2019-08-23 Thread GitBox
wesm commented on issue #9: ARROW-6318: add generated files from integration test to testing URL: https://github.com/apache/arrow-testing/pull/9#issuecomment-524328481 Can we gzip the json? I also wonder if we can trim the size of the decimal test

[jira] [Resolved] (ARROW-6330) [C++] Include missing headers in api.h

2019-08-23 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6330. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5175

[jira] [Updated] (ARROW-6332) [Java] [CPP] Handle size of varchar vectors correctly

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6332?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6332: -- Labels: pull-request-available (was: ) > [Java] [CPP] Handle size of varchar vectors

[jira] [Commented] (ARROW-2428) [Python] Add API to map Arrow types (including extension types) to pandas ExtensionArray instances for to_pandas conversions

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914231#comment-16914231 ] Joris Van den Bossche commented on ARROW-2428: -- I am working on the actual ability to create

[jira] [Updated] (ARROW-6334) [Java] Improve the dictionary builder API to return the position of the value in the dictionary

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6334?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6334: -- Labels: pull-request-available (was: ) > [Java] Improve the dictionary builder API to return

[jira] [Created] (ARROW-6334) [Java] Improve the dictionary builder API to return the position of the value in the dictionary

2019-08-23 Thread Liya Fan (Jira)
Liya Fan created ARROW-6334: --- Summary: [Java] Improve the dictionary builder API to return the position of the value in the dictionary Key: ARROW-6334 URL: https://issues.apache.org/jira/browse/ARROW-6334

[jira] [Created] (ARROW-6333) [C++] Third party download URLs are duplicated

2019-08-23 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-6333: - Summary: [C++] Third party download URLs are duplicated Key: ARROW-6333 URL: https://issues.apache.org/jira/browse/ARROW-6333 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-5220) [Python] index / unknown columns in specified schema in Table.from_pandas

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-5220: - Fix Version/s: 0.15.0 > [Python] index / unknown columns in specified schema in

[jira] [Commented] (ARROW-5220) [Python] index / unknown columns in specified schema in Table.from_pandas

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5220?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914040#comment-16914040 ] Joris Van den Bossche commented on ARROW-5220: -- What I suggested here (about

[jira] [Comment Edited] (ARROW-5494) [Python] Create FileSystem bindings

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914027#comment-16914027 ] Joris Van den Bossche edited comment on ARROW-5494 at 8/23/19 7:48 AM:

[jira] [Commented] (ARROW-5494) [Python] Create FileSystem bindings

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5494?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16914027#comment-16914027 ] Joris Van den Bossche commented on ARROW-5494: -- I would happy to help here, although I will

[jira] [Updated] (ARROW-6325) [Python] wrong conversion of DataFrame with boolean values

2019-08-23 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6325: -- Labels: pull-request-available (was: ) > [Python] wrong conversion of DataFrame with boolean

[jira] [Assigned] (ARROW-6325) [Python] wrong conversion of DataFrame with boolean values

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-6325: Assignee: Joris Van den Bossche > [Python] wrong conversion of DataFrame

[jira] [Created] (ARROW-6332) [Java] [CPP] Handle size of varchar vectors correctly

2019-08-23 Thread Praveen Kumar Desabandu (Jira)
Praveen Kumar Desabandu created ARROW-6332: -- Summary: [Java] [CPP] Handle size of varchar vectors correctly Key: ARROW-6332 URL: https://issues.apache.org/jira/browse/ARROW-6332 Project:

[GitHub] [arrow-testing] emkornfield opened a new pull request #9: ARROW-6318: add generated files from integration test to testing

2019-08-23 Thread GitBox
emkornfield opened a new pull request #9: ARROW-6318: add generated files from integration test to testing URL: https://github.com/apache/arrow-testing/pull/9 First part in incorporating these into the integration test is getting them checked in. CC @wesm

[jira] [Commented] (ARROW-5337) [C++] Add RecordBatch::field method, possibly deprecate "column"

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16913991#comment-16913991 ] Joris Van den Bossche commented on ARROW-5337: -- Since there is also a {{arrow::Field}} which

[jira] [Commented] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-08-23 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16913988#comment-16913988 ] Joris Van den Bossche commented on ARROW-5630: -- Yes, get the same error on latest master. >

[jira] [Updated] (ARROW-6144) [C++][Gandiva] Implement random function in Gandiva

2019-08-23 Thread Prudhvi Porandla (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Prudhvi Porandla updated ARROW-6144: Description: Implement random(), random(int seed) functions. The values are sampled from a