[jira] [Commented] (ARROW-2709) [Python] write_to_dataset poor performance when splitting

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729054#comment-16729054 ] Wes McKinney commented on ARROW-2709: - We do plan to implement group-by operations on Arrow tables

[jira] [Created] (ARROW-4116) [Python] Clarify in development.rst that virtualenv cannot be used with miniconda/Anaconda

2018-12-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4116: --- Summary: [Python] Clarify in development.rst that virtualenv cannot be used with miniconda/Anaconda Key: ARROW-4116 URL: https://issues.apache.org/jira/browse/ARROW-4116

[jira] [Created] (ARROW-4115) [Gandiva] valgrind complains that boolean output data buffer has uninited data

2018-12-26 Thread Pindikura Ravindra (JIRA)
Pindikura Ravindra created ARROW-4115: - Summary: [Gandiva] valgrind complains that boolean output data buffer has uninited data Key: ARROW-4115 URL: https://issues.apache.org/jira/browse/ARROW-4115

[jira] [Updated] (ARROW-4115) [Gandiva] valgrind complains that boolean output data buffer has uninited data

2018-12-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4115: -- Labels: pull-request-available (was: ) > [Gandiva] valgrind complains that boolean output

[jira] [Commented] (ARROW-2709) [Python] write_to_dataset poor performance when splitting

2018-12-26 Thread Lee June Woo (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16728948#comment-16728948 ] Lee June Woo commented on ARROW-2709: - Hello, May I ask you simple question about the improvement? I

[jira] [Comment Edited] (ARROW-2709) [Python] write_to_dataset poor performance when splitting

2018-12-26 Thread Lee June Woo (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2709?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16728948#comment-16728948 ] Lee June Woo edited comment on ARROW-2709 at 12/26/18 9:20 AM: --- Hello, May

[jira] [Commented] (ARROW-3133) [C++] Logical boolean kernels in kernels/boolean.cc cannot write into preallocated memory

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729096#comment-16729096 ] Wes McKinney commented on ARROW-3133: - No, there is unavoidable memory allocation in all of the

[jira] [Commented] (ARROW-3324) [Python] Users reporting memory leaks using pa.pq.ParquetDataset

2018-12-26 Thread Tanya Schlusser (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729131#comment-16729131 ] Tanya Schlusser commented on ARROW-3324: The file

[jira] [Closed] (ARROW-3968) Standalone CSV to Arrow Conversion Tool

2018-12-26 Thread Bhaskar Mookerji (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bhaskar Mookerji closed ARROW-3968. --- Resolution: Won't Do See discussion around adding C++ CLI tools in

[jira] [Updated] (ARROW-4118) [Python] Error with "asv run"

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4118: Fix Version/s: (was: 0.12.0) 0.13.0 > [Python] Error with "asv run" >

[jira] [Updated] (ARROW-4117) [Python] "asv dev" command fails with latest revision

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4117: Fix Version/s: (was: 0.12.0) 0.13.0 > [Python] "asv dev" command fails with

[jira] [Commented] (ARROW-4117) [Python] "asv dev" command fails with latest revision

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729185#comment-16729185 ] Wes McKinney commented on ARROW-4117: - I implemented a workaround in

[jira] [Updated] (ARROW-4118) [Python] Error with "asv run"

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4118: Summary: [Python] Error with "asv run" (was: [Python] More detailed benchmarking documentation)

[jira] [Assigned] (ARROW-4102) [C++] FixedSizeBinary identity cast not implemented

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4102: --- Assignee: Wes McKinney (was: Francois Saint-Jacques) > [C++] FixedSizeBinary identity cast

[jira] [Commented] (ARROW-3133) [C++] Logical boolean kernels in kernels/boolean.cc cannot write into preallocated memory

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729118#comment-16729118 ] Wes McKinney commented on ARROW-3133: - It's not urgent so feel free to have a hack at it. You can

[jira] [Resolved] (ARROW-4100) [Gandiva][C++] Fix regex to ignore "." character

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4100. - Resolution: Fixed Issue resolved by pull request 3241

[jira] [Updated] (ARROW-3324) [Python] Users reporting memory leaks using pa.pq.ParquetDataset

2018-12-26 Thread Tanya Schlusser (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tanya Schlusser updated ARROW-3324: --- Attachment: arrow_3324_leak_on_write.py > [Python] Users reporting memory leaks using

[jira] [Updated] (ARROW-4118) [Python] More detailed benchmarking documentation

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4118: Issue Type: Bug (was: Improvement) > [Python] More detailed benchmarking documentation >

[jira] [Updated] (ARROW-4118) [Python] Error with "asv run"

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4118: Fix Version/s: (was: 0.13.0) 0.12.0 > [Python] Error with "asv run" >

[jira] [Commented] (ARROW-4118) [Python] More detailed benchmarking documentation

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729178#comment-16729178 ] Wes McKinney commented on ARROW-4118: - I'm not able to run "asv run" (getting the error above) and I

[jira] [Created] (ARROW-4117) [Python] "asv dev" command fails with latest revision

2018-12-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4117: --- Summary: [Python] "asv dev" command fails with latest revision Key: ARROW-4117 URL: https://issues.apache.org/jira/browse/ARROW-4117 Project: Apache Arrow

[jira] [Updated] (ARROW-4078) [CI] Run Travis job where documentation is built when docs/ is changed

2018-12-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4078: -- Labels: pull-request-available (was: ) > [CI] Run Travis job where documentation is built

[jira] [Resolved] (ARROW-4114) [C++][DOCUMENTATION]

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4114?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4114. - Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 3260

[jira] [Created] (ARROW-4118) [Python] More detailed benchmarking documentation

2018-12-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4118: --- Summary: [Python] More detailed benchmarking documentation Key: ARROW-4118 URL: https://issues.apache.org/jira/browse/ARROW-4118 Project: Apache Arrow Issue

[jira] [Commented] (ARROW-4118) [Python] More detailed benchmarking documentation

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729176#comment-16729176 ] Wes McKinney commented on ARROW-4118: - Hm that actually turned out to not be the problem. Still

[jira] [Assigned] (ARROW-4116) [Python] Clarify in development.rst that virtualenv cannot be used with miniconda/Anaconda

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4116: --- Assignee: Wes McKinney > [Python] Clarify in development.rst that virtualenv cannot be used

[jira] [Assigned] (ARROW-4078) [CI] Run Travis job where documentation is built when docs/ is changed

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4078: --- Assignee: Wes McKinney > [CI] Run Travis job where documentation is built when docs/ is

[jira] [Updated] (ARROW-4102) [C++] FixedSizeBinary identity cast not implemented

2018-12-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4102?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4102: -- Labels: pull-request-available (was: ) > [C++] FixedSizeBinary identity cast not implemented

[jira] [Updated] (ARROW-4118) [Python] More detailed benchmarking documentation

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4118?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-4118: Description: We should write more documentation about common asv workflows. Having not run them

[jira] [Resolved] (ARROW-4103) [Documentation] Add README to docs/ root

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4103. - Resolution: Fixed Issue resolved by pull request 3243

[jira] [Assigned] (ARROW-4103) [Documentation] Add README to docs/ root

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-4103: --- Assignee: Wes McKinney > [Documentation] Add README to docs/ root >

[jira] [Commented] (ARROW-3133) [C++] Logical boolean kernels in kernels/boolean.cc cannot write into preallocated memory

2018-12-26 Thread Micah Kornfield (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729111#comment-16729111 ] Micah Kornfield commented on ARROW-3133: Missed those. If this isn't urgent I can take a look at

[jira] [Resolved] (ARROW-4115) [Gandiva] valgrind complains that boolean output data buffer has uninited data

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4115?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4115. - Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 3263

[jira] [Updated] (ARROW-4116) [Python] Clarify in development.rst that virtualenv cannot be used with miniconda/Anaconda

2018-12-26 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4116: -- Labels: pull-request-available (was: ) > [Python] Clarify in development.rst that virtualenv

[jira] [Created] (ARROW-4119) [C++] Clean up cast implementation from null to other types

2018-12-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4119: --- Summary: [C++] Clean up cast implementation from null to other types Key: ARROW-4119 URL: https://issues.apache.org/jira/browse/ARROW-4119 Project: Apache Arrow

[jira] [Resolved] (ARROW-4116) [Python] Clarify in development.rst that virtualenv cannot be used with miniconda/Anaconda

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-4116. - Resolution: Fixed Issue resolved by pull request 3264

[jira] [Resolved] (ARROW-4112) [Packaging][Gandiva] Add support for deb packages

2018-12-26 Thread Kouhei Sutou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-4112. - Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 3258

[jira] [Assigned] (ARROW-3324) [Python] Users reporting memory leaks using pa.pq.ParquetDataset

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-3324: --- Assignee: Wes McKinney > [Python] Users reporting memory leaks using pa.pq.ParquetDataset >

[jira] [Resolved] (ARROW-3324) [Python] Users reporting memory leaks using pa.pq.ParquetDataset

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-3324. - Resolution: Fixed Issue resolved by pull request 3261

[jira] [Commented] (ARROW-4120) [Python] Define process for testing procedures that check for no macro-level memory leaks

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729203#comment-16729203 ] Wes McKinney commented on ARROW-4120: - I'm implementing something very simple for ARROW-3324, but we

[jira] [Created] (ARROW-4120) [Python] Define process for testing procedures that check for no macro-level memory leaks

2018-12-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-4120: --- Summary: [Python] Define process for testing procedures that check for no macro-level memory leaks Key: ARROW-4120 URL: https://issues.apache.org/jira/browse/ARROW-4120

[jira] [Assigned] (ARROW-3133) [C++] Logical boolean kernels in kernels/boolean.cc cannot write into preallocated memory

2018-12-26 Thread Micah Kornfield (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield reassigned ARROW-3133: -- Assignee: Micah Kornfield > [C++] Logical boolean kernels in kernels/boolean.cc

[jira] [Created] (ARROW-4121) [C++] Remove memory allocation from InvertKernel

2018-12-26 Thread Micah Kornfield (JIRA)
Micah Kornfield created ARROW-4121: -- Summary: [C++] Remove memory allocation from InvertKernel Key: ARROW-4121 URL: https://issues.apache.org/jira/browse/ARROW-4121 Project: Apache Arrow

[jira] [Commented] (ARROW-4121) [C++] Remove memory allocation from InvertKernel

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729330#comment-16729330 ] Wes McKinney commented on ARROW-4121: - To be clear, we need to have a usable code path for both the

[jira] [Resolved] (ARROW-4078) [CI] Run Travis job where documentation is built when docs/ is changed

2018-12-26 Thread Kouhei Sutou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-4078. - Resolution: Fixed Issue resolved by pull request 3266

[jira] [Updated] (ARROW-3324) [Parquet] Free more internal resources when writing multiple row groups

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3324: Summary: [Parquet] Free more internal resources when writing multiple row groups (was: [Python]

[jira] [Commented] (ARROW-4120) [Python] Define process for testing procedures that check for no macro-level memory leaks

2018-12-26 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729209#comment-16729209 ] Antoine Pitrou commented on ARROW-4120: --- Is it specifically about Python reference leaks? I have

[jira] [Updated] (ARROW-3324) [Python] Free more internal resources when writing multiple row groups

2018-12-26 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-3324: Summary: [Python] Free more internal resources when writing multiple row groups (was: [Python]