[jira] [Updated] (ARROW-4739) [Rust] [DataFusion] It should be possible to share a logical plan between threads

2019-03-03 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4739?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4739:
--
Labels: pull-request-available  (was: )

> [Rust] [DataFusion] It should be possible to share a logical plan between 
> threads
> -
>
> Key: ARROW-4739
> URL: https://issues.apache.org/jira/browse/ARROW-4739
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust, Rust - DataFusion
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> I want to be able to compile sql to a logical plan and then share that plan 
> with other threads ( so I can run the same query in parallel on partitions of 
> my input relation).
>  
> A/C
>  * LogicalPlan uses Arc instead of Rc
>  * ExecutionContext has a create_logical_plan method
>  * ExecutionContext.sql() is refactored to call create_logical_plan



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4745) [C++][Documentation] Document process for replicating static_crt builds on windows

2019-03-03 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4745?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4745:
--
Labels: pull-request-available  (was: )

> [C++][Documentation] Document process for replicating static_crt builds on 
> windows
> --
>
> Key: ARROW-4745
> URL: https://issues.apache.org/jira/browse/ARROW-4745
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Documentation
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>Priority: Minor
>  Labels: pull-request-available
>
> Based on collective wisdom of the mailing list. Give some step by step 
> instructions to getting things to build.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4749) [Rust] RecordBatch::new() should return result instead of panicking

2019-03-03 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4749:
--
Labels: pull-request-available  (was: )

> [Rust] RecordBatch::new() should return result instead of panicking
> ---
>
> Key: ARROW-4749
> URL: https://issues.apache.org/jira/browse/ARROW-4749
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Neville Dipale
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> RecordBatch::new() has some good validation checks, but calls assert_eq 
> instead of returning a Result



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4754) [CI][Java] Flaky TestAuth Flight test

2019-03-03 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4754:
--
Labels: pull-request-available  (was: )

> [CI][Java] Flaky TestAuth Flight test
> -
>
> Key: ARROW-4754
> URL: https://issues.apache.org/jira/browse/ARROW-4754
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Continuous Integration, FlightRPC, Java
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>Priority: Blocker
>  Labels: pull-request-available
>
> org.apache.arrow.flight.auth.TestAuth
> [ERROR] invalidAuth(org.apache.arrow.flight.auth.TestAuth) Time elapsed: 
> 0.013 s <<< ERROR!
> java.io.IOException: Failed to bind
>  at org.apache.arrow.flight.auth.TestAuth.setup(TestAuth.java:108)
> Caused by: java.net.BindException: Address already in use



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4756) [CI] document the procedure to update docker image for manylinux1 builds

2019-03-03 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4756?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4756:
--
Labels: pull-request-available  (was: )

> [CI] document the procedure to update docker image for manylinux1 builds
> 
>
> Key: ARROW-4756
> URL: https://issues.apache.org/jira/browse/ARROW-4756
> Project: Apache Arrow
>  Issue Type: Task
>  Components: Continuous Integration
>Reporter: Pindikura Ravindra
>Assignee: Pindikura Ravindra
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4758) [Flight] Build fails on Mac due to missing Schema_generated.h

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4758:
--
Labels: pull-request-available  (was: )

> [Flight] Build fails on Mac due to missing Schema_generated.h
> -
>
> Key: ARROW-4758
> URL: https://issues.apache.org/jira/browse/ARROW-4758
> Project: Apache Arrow
>  Issue Type: Task
>  Components: FlightRPC
>Reporter: Pindikura Ravindra
>Priority: Major
>  Labels: pull-request-available
>
> I saw this on CI, a retrigger of the build fixed the issue and I am not able 
> to get the link of the previous build failure.
> The error happened for the file flight/client.cc, which includes 
> -ipc/metadata--internal.h, which includes arrow/ipc/Schema_generated.h
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4707) [C++] move BitsetStack to bit-util.h

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4707?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4707:
--
Labels: pull-request-available  (was: )

> [C++] move BitsetStack to bit-util.h
> 
>
> Key: ARROW-4707
> URL: https://issues.apache.org/jira/browse/ARROW-4707
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: C++
>Reporter: Benjamin Kietzman
>Assignee: Benjamin Kietzman
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> BitsetStack was written for use in the JSON parser, but it's useful enough 
> that it should be made available in bit-util.h



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4448) [JAVA][Flight] Flaky Flight java test

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4448:
--
Labels: ci-failure pull-request-available  (was: ci-failure)

> [JAVA][Flight] Flaky Flight java test
> -
>
> Key: ARROW-4448
> URL: https://issues.apache.org/jira/browse/ARROW-4448
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: FlightRPC, Java
>Reporter: Micah Kornfield
>Assignee: Francois Saint-Jacques
>Priority: Major
>  Labels: ci-failure, pull-request-available
> Fix For: 0.13.0
>
>
> Pull requests rebeased off of  the laster master seem to pass so this is 
> probably a flake: [https://travis-ci.org/apache/arrow/jobs/487275065]  
>  {{[INFO] Running org.apache.arrow.flight.TestBackPressure
> [ERROR] Tests run: 2, Failures: 0, Errors: 1, Skipped: 1, Time elapsed: 1.396 
> s <<< FAILURE! - in org.apache.arrow.flight.TestBackPressure
> [ERROR] ensureIndependentSteams(org.apache.arrow.flight.TestBackPressure)  
> Time elapsed: 1.394 s  <<< ERROR!
> java.lang.IllegalStateException: 
> Memory was leaked by query. Memory leaked: (131072)
> Allocator(perf-server) 0/131072/589824/9223372036854775807 
> (res/actual/peak/limit)
>   at 
> org.apache.arrow.flight.TestBackPressure.ensureIndependentSteams(TestBackPressure.java:76)
> [INFO] 
> [INFO] Results:
> [INFO] 
> [ERROR] Errors: 
> [ERROR]   TestBackPressure.ensureIndependentSteams:76 » IllegalState Memory 
> was leaked b...
> [INFO] 
> [ERROR] Tests run: 13, Failures: 0, Errors: 1, Skipped: 3
> [INFO] 
> [INFO] 
> 
> [INFO] Reactor Summary:
> [INFO] 
> [INFO] Apache Arrow Java Root POM 0.13.0-SNAPSHOT . SUCCESS [  8.590 
> s]
> [INFO] Arrow Format ... SUCCESS [  5.985 
> s]
> [INFO] Arrow Memory ... SUCCESS [ 12.750 
> s]
> [INFO] Arrow Vectors .. SUCCESS [01:13 
> min]
> [INFO] Arrow Tools  SUCCESS [ 15.462 
> s]
> [INFO] Arrow JDBC Adapter . SUCCESS [ 11.906 
> s]
> [INFO] Arrow Plasma Client  SUCCESS [  3.967 
> s]
> [INFO] Arrow Flight 0.13.0-SNAPSHOT ... FAILURE [ 18.237 
> s]
> [INFO] 
> 
> [INFO] BUILD FAILURE
> [INFO] 
> 
> [INFO] Total time: 02:30 min
> [INFO] Finished at: 2019-02-01T05:15:34Z
> [INFO] 
> 
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-surefire-plugin:2.20:test (default-test) on 
> project arrow-flight: There are test failures.
> [ERROR] 
> [ERROR] Please refer to 
> /home/travis/build/apache/arrow/java/flight/target/surefire-reports for the 
> individual test results.
> [ERROR] Please refer to dump files (if any exist) [date]-jvmRun[N].dump, 
> [date].dumpstream and [date]-jvmRun[N].dumpstream.
> [ERROR] -> [Help 1]
> [ERROR] 
> [ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
> switch.
> [ERROR] Re-run Maven using the -X switch to enable full debug logging.
> [ERROR] 
> [ERROR] For more information about the errors and possible solutions, please 
> read the following articles:
> [ERROR] [Help 1] 
> http://cwiki.apache.org/confluence/display/MAVEN/MojoFailureException
> [ERROR] 
> [ERROR] After correcting the problems, you can resume the build with the 
> command
> [ERROR]   mvn  -rf :arrow-flight
> }}
>  
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4719) [C#] Implement ChunkedArray, Column and Table in C#

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4719:
--
Labels: pull-request-available  (was: )

> [C#] Implement ChunkedArray, Column and Table in C#
> ---
>
> Key: ARROW-4719
> URL: https://issues.apache.org/jira/browse/ARROW-4719
> Project: Apache Arrow
>  Issue Type: Task
>  Components: C#
>Reporter: Prashanth Govindarajan
>Priority: Major
>  Labels: pull-request-available
>
> Implement "ChunkedArray", "Column" and "Table" in arrow/csharp. They are 
> useful types to represent pieces in a dataset



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4767) [C#] ArrowStreamReader crashes while reading the end of a stream

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4767:
--
Labels: pull-request-available  (was: )

> [C#] ArrowStreamReader crashes while reading the end of a stream
> 
>
> Key: ARROW-4767
> URL: https://issues.apache.org/jira/browse/ARROW-4767
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C#
>Reporter: Prashanth Govindarajan
>Priority: Major
>  Labels: pull-request-available
>
> ReadRecordBatchAsync crashes at the end of a stream when messageLength is 0. 
> "0" indicates the end of the stream, so we should just return null. The call 
> Flatbug.Message.GetRootAsMessage seems to be crashing. The fix is simple and 
> safe. I'll have a PR up soon. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4768) [C++][CI] arrow-test-array sometimes gets stuck in MinGW build

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4768?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4768:
--
Labels: pull-request-available  (was: )

> [C++][CI] arrow-test-array sometimes gets stuck in MinGW build
> --
>
> Key: ARROW-4768
> URL: https://issues.apache.org/jira/browse/ARROW-4768
> Project: Apache Arrow
>  Issue Type: Test
>  Components: C++, Continuous Integration
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Minor
>  Labels: pull-request-available
>
> Example: 
> https://ci.appveyor.com/project/ApacheSoftwareFoundation/arrow/builds/22804493/job/6mbpslm97p4yj31c#L726
> {noformat}
>   Start  2: arrow-array-test
> {noformat}
> isn't finished.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4335) [C++] Better document sparse tensor support

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4335?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4335:
--
Labels: pull-request-available  (was: )

> [C++] Better document sparse tensor support
> ---
>
> Key: ARROW-4335
> URL: https://issues.apache.org/jira/browse/ARROW-4335
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.12.0
>Reporter: Antoine Pitrou
>Assignee: Kenta Murata
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Currently the documentation (including docstrings) for the sparse tensor 
> classes and methods is very... sparse. It would be nice to make those 
> approachable.
> (also, a suggestion: rename {{SparseCSRIndex::indptr()}} to something else? 
> perhaps {{SparseCSRIndex::row_indices()}}?)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4769) [Rust] Improve array limit function where max records > len

2019-03-04 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4769:
--
Labels: pull-request-available  (was: )

> [Rust] Improve array limit function where max records > len
> ---
>
> Key: ARROW-4769
> URL: https://issues.apache.org/jira/browse/ARROW-4769
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.12.0
>Reporter: Neville Dipale
>Assignee: Neville Dipale
>Priority: Trivial
>  Labels: pull-request-available
>
> When we have an array of n records, and we want to take a limit that's higher 
> or equat to n, we still iterate through the array values and create a new 
> array.
> We could improve this by returning a copy of the array as-is.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2022) [Format] Add custom metadata field specific to a RecordBatch message

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-2022:
--
Labels: pull-request-available  (was: )

> [Format] Add custom metadata field specific to a RecordBatch message
> 
>
> Key: ARROW-2022
> URL: https://issues.apache.org/jira/browse/ARROW-2022
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Format
>Reporter: Wes McKinney
>Assignee: Micah Kornfield
>Priority: Major
>  Labels: pull-request-available
>
> While we can have schema- and field-level custom metadata, we cannot send 
> metadata at the record batch level. This could include things like statistics 
> (although statistics isn't a great example, because this might be something 
> we want to eventually standardize), but other things too
> See message definitions in 
> https://github.com/apache/arrow/blob/master/format/Message.fbs



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4206) [Gandiva] Implement decimal divide

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4206?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4206:
--
Labels: pull-request-available  (was: )

> [Gandiva] Implement decimal divide
> --
>
> Key: ARROW-4206
> URL: https://issues.apache.org/jira/browse/ARROW-4206
> Project: Apache Arrow
>  Issue Type: Task
>  Components: C++ - Gandiva
>Reporter: Pindikura Ravindra
>Assignee: Pindikura Ravindra
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4775) [Website] Site navbar cannot be expanded

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4775:
--
Labels: pull-request-available  (was: )

> [Website] Site navbar cannot be expanded
> 
>
> Key: ARROW-4775
> URL: https://issues.apache.org/jira/browse/ARROW-4775
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Website
>Reporter: Kenta Murata
>Assignee: Kenta Murata
>Priority: Major
>  Labels: pull-request-available
>
> I found that the navbar at the top of the page cannot be expanded when the 
> page is narrow.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4766) [C++] Casting empty boolean array causes segfault

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4766?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4766:
--
Labels: pull-request-available  (was: )

> [C++] Casting empty boolean array causes segfault
> -
>
> Key: ARROW-4766
> URL: https://issues.apache.org/jira/browse/ARROW-4766
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Affects Versions: 0.12.0
>Reporter: Keith Kraus
>Assignee: Francois Saint-Jacques
>Priority: Blocker
>  Labels: pull-request-available
>
> Reproducer:
> {code:python}
> import pyarrow as pa
> test = pa.array([], type=pa.bool_())
> test2 = test.cast(pa.int8())
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3770) [C++] Validate or add option to validate arrow::Table schema in parquet::arrow::FileWriter::WriteTable

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3770?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3770:
--
Labels: parquet pull-request-available  (was: parquet)

> [C++] Validate or add option to validate arrow::Table schema in 
> parquet::arrow::FileWriter::WriteTable
> --
>
> Key: ARROW-3770
> URL: https://issues.apache.org/jira/browse/ARROW-3770
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Priority: Major
>  Labels: parquet, pull-request-available
> Fix For: 0.13.0
>
>
> Failing to validate will cause a segfault when the passed table does not 
> match the schema used to instantiate the writer. See ARROW-2926 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4777) [C++/Python] manylinux1: Update lz4 to 1.8.3

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4777:
--
Labels: pull-request-available  (was: )

> [C++/Python] manylinux1: Update lz4 to 1.8.3
> 
>
> Key: ARROW-4777
> URL: https://issues.apache.org/jira/browse/ARROW-4777
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Packaging, Python
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4780) [JS] Package sourcemap files, update default package JS version

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4780:
--
Labels: pull-request-available  (was: )

> [JS] Package sourcemap files, update default package JS version
> ---
>
> Key: ARROW-4780
> URL: https://issues.apache.org/jira/browse/ARROW-4780
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: JavaScript
>Affects Versions: JS-0.4.0
>Reporter: Paul Taylor
>Assignee: Paul Taylor
>Priority: Minor
>  Labels: pull-request-available
> Fix For: JS-0.4.1
>
>
> The build should split the sourcemaps out to speed up client builds, and 
> include a "module" entry in the package.json for @pika/web, and the main 
> package should ship the latest ESNext JS versions.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4781) [JS] Ensure empty data initializes empty typed arrays

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4781:
--
Labels: pull-request-available  (was: )

> [JS] Ensure empty data initializes empty typed arrays
> -
>
> Key: ARROW-4781
> URL: https://issues.apache.org/jira/browse/ARROW-4781
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: JavaScript
>Affects Versions: JS-0.4.0
>Reporter: Paul Taylor
>Assignee: Paul Taylor
>Priority: Major
>  Labels: pull-request-available
> Fix For: JS-0.4.1
>
>
> Empty ArrayData instances should initialize with the appropriate 0-length 
> buffers.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4783) [C++][CI] Mingw32 builds sometimes timeout

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4783:
--
Labels: ci-failure pull-request-available  (was: ci-failure)

> [C++][CI] Mingw32 builds sometimes timeout
> --
>
> Key: ARROW-4783
> URL: https://issues.apache.org/jira/browse/ARROW-4783
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, Continuous Integration
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>Priority: Major
>  Labels: ci-failure, pull-request-available
>
> From 
> [https://ci.appveyor.com/project/ApacheSoftwareFoundation/arrow/builds/22853367/job/e461la5roy7edpsb]
>  
> It looks like the thread-pool test.  I'm going to disable it for the time 
> being.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4782) [C++] Prototype scalar and array expression types for developing deferred operator algebra

2019-03-05 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4782:
--
Labels: pull-request-available  (was: )

> [C++] Prototype scalar and array expression types for developing deferred 
> operator algebra
> --
>
> Key: ARROW-4782
> URL: https://issues.apache.org/jira/browse/ARROW-4782
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> I am beginning to develop a C++ API for describing analytical expressions on 
> scalars, arrays, and tables in the general style of Ibis 
> (https://docs.ibis-project.org/sql.html) which has been used to successfully 
> model SQL relational algebra as well as a broader variety of analytical 
> operations. This is a large project so I'm starting small



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4778) [C++/Python] manylinux1: Update Thrift to 0.12.0

2019-03-06 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4778:
--
Labels: pull-request-available  (was: )

> [C++/Python] manylinux1: Update Thrift to 0.12.0
> 
>
> Key: ARROW-4778
> URL: https://issues.apache.org/jira/browse/ARROW-4778
> Project: Apache Arrow
>  Issue Type: Task
>  Components: C++, Packaging, Python
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4786) [C++/Python] Support better parallelisation in manylinux1 base build

2019-03-06 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4786:
--
Labels: pull-request-available  (was: )

> [C++/Python] Support better parallelisation in manylinux1 base build
> 
>
> Key: ARROW-4786
> URL: https://issues.apache.org/jira/browse/ARROW-4786
> Project: Apache Arrow
>  Issue Type: Task
>  Components: C++, Packaging, Python
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Currently we're building some dependencies single-threaded but could build 
> them with much higher parallelisation.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4791) Unused dependencies in arrow and datafusion

2019-03-06 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4791?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4791:
--
Labels: pull-request-available  (was: )

> Unused dependencies in arrow and datafusion
> ---
>
> Key: ARROW-4791
> URL: https://issues.apache.org/jira/browse/ARROW-4791
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Reporter: Yu Ding
>Priority: Minor
>  Labels: pull-request-available
>
> One unused dependency found in rust/arrow
> {{[dependencies]}}
> {{ bytes = "0.4"}}
> Another unused dependency found in rust/datafusion
> {{[dependencies]}}
> {{ clap = "2.31.2"}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4793) [Ruby] Suppress unused variable warning

2019-03-06 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4793:
--
Labels: pull-request-available  (was: )

> [Ruby] Suppress unused variable warning
> ---
>
> Key: ARROW-4793
> URL: https://issues.apache.org/jira/browse/ARROW-4793
> Project: Apache Arrow
>  Issue Type: Test
>  Components: Ruby
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Minor
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4792) [Ruby] Don't raise an error by #==

2019-03-06 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4792?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4792:
--
Labels: pull-request-available  (was: )

> [Ruby] Don't raise an error by #==
> --
>
> Key: ARROW-4792
> URL: https://issues.apache.org/jira/browse/ARROW-4792
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Ruby
>Affects Versions: 0.12.1
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4790) [Python/Packaging] Update manylinux docker image in crossbow task

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4790:
--
Labels: pull-request-available  (was: )

> [Python/Packaging] Update manylinux docker image in crossbow task
> -
>
> Key: ARROW-4790
> URL: https://issues.apache.org/jira/browse/ARROW-4790
> Project: Apache Arrow
>  Issue Type: Task
>  Components: Python
>Reporter: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> to {{ARROW - 4778}} see 
> https://github.com/apache/arrow/pull/3823#issuecomment-470129575



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4789) [C++] Deprecate and and later remove arrow::io::ReadableFileInterface

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4789:
--
Labels: pull-request-available  (was: )

> [C++] Deprecate and and later remove arrow::io::ReadableFileInterface
> -
>
> Key: ARROW-4789
> URL: https://issues.apache.org/jira/browse/ARROW-4789
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> See arrow/io/interfaces.h. This is a legacy alias



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4645) [C++/Packaging] Ship Gandiva with OSX and Windows wheels

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4645?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4645:
--
Labels: pull-request-available  (was: )

> [C++/Packaging] Ship Gandiva with OSX and Windows wheels
> 
>
> Key: ARROW-4645
> URL: https://issues.apache.org/jira/browse/ARROW-4645
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++ - Gandiva, Packaging
>Reporter: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Gandiva is only installed via the linux wheels, We should support it on all 
> platforms.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4640) [Python] Add docker-compose configuration to build and test the project without pandas installed

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4640:
--
Labels: pull-request-available  (was: )

> [Python] Add docker-compose configuration to build and test the project 
> without pandas installed
> 
>
> Key: ARROW-4640
> URL: https://issues.apache.org/jira/browse/ARROW-4640
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Wes McKinney
>Assignee: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> This will help detect problems where we have unintentionally introduced a 
> hard dependency



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4796) [Flight][Python] segfault in simple server implementation

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4796:
--
Labels: flight pull-request-available  (was: flight)

> [Flight][Python] segfault in simple server implementation
> -
>
> Key: ARROW-4796
> URL: https://issues.apache.org/jira/browse/ARROW-4796
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: FlightRPC, Python
>Reporter: David Li
>Assignee: David Li
>Priority: Major
>  Labels: flight, pull-request-available
> Fix For: 0.13.0
>
>
> Python segfaults if you implement a Flight server that returns a data stream 
> but does not keep a reference to the underlying data source (the Table, 
> RecordBatch, etc). The Flight bindings themselves do not keep a reference to 
> the object, so the server will segfault as the memory has been reclaimed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4797) [Plasma] Avoid store crash if not enough memory is available

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4797:
--
Labels: pull-request-available  (was: )

> [Plasma] Avoid store crash if not enough memory is available
> 
>
> Key: ARROW-4797
> URL: https://issues.apache.org/jira/browse/ARROW-4797
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++ - Plasma
>Reporter: Philipp Moritz
>Assignee: Philipp Moritz
>Priority: Major
>  Labels: pull-request-available
>
> Currently, the plasma server exists with a fatal check if not enough memory 
> is available. This can lead to errors that are hard to diagnose, see
> [https://github.com/ray-project/ray/issues/3670]
> Instead, we should keep the store alive in these circumstances, taking up 
> some of the remaining memory and allow the client to check if enough memory 
> has been allocating.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4801) [GLib] Suppress pkgconfig.generate() warnings

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4801:
--
Labels: pull-request-available  (was: )

> [GLib] Suppress pkgconfig.generate() warnings
> -
>
> Key: ARROW-4801
> URL: https://issues.apache.org/jira/browse/ARROW-4801
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: GLib
>Affects Versions: 0.12.1
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Minor
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4807) [Rust] Fix csv_writer benchmark

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4807?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4807:
--
Labels: pull-request-available  (was: )

> [Rust] Fix csv_writer benchmark
> ---
>
> Key: ARROW-4807
> URL: https://issues.apache.org/jira/browse/ARROW-4807
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Rust
>Reporter: Chao Sun
>Assignee: Chao Sun
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> CSV writer benchmark suite isn't working due to `RecordBatch::try_new` 
> returns a `Result` now.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4713) [C++] Improve C++ Orc Adapter performance and memory footprint

2019-03-07 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4713?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4713:
--
Labels: pull-request-available  (was: )

> [C++] Improve C++ Orc Adapter performance and memory footprint
> --
>
> Key: ARROW-4713
> URL: https://issues.apache.org/jira/browse/ARROW-4713
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Yurui Zhou
>Assignee: Yurui Zhou
>Priority: Major
>  Labels: pull-request-available
>
> Currently the Arrow C++ provide a naive adapter implementation that allow 
> user to read orc file to Arrow RecordBatch. However, this implementation have 
> several drawbacks:
>  * Inefficient conversion that incurs huge memcpy overhead
>  ** currently the ORC adapter are performing byte to byte memcpy to move data 
> to ORC VectorBatch to Arrow RecordBatch regardless of the fact that ORC 
> VectorBatch shares the same memory layout with Arrow in most of the Data Types
>  * Huge memory footprint because the lack of TableReader implementation
>  ** The ORC adapter currently only allow user to read data with the unit of 
> stripe. However, as a columnar format with high compression ration, data read 
> from a ORC stripe can potential takes over gigabytes of memory, which makes 
> the ORC adapter not quite usable in production environment.
> Here we propose a new ORC adapter implementation to fix the issues mentioned 
> above:
>  * To reduce conversion overhead, instead of performing naive data copy, the 
> new adapter would be able to fully taking advantage of the memory layout 
> similarity between ORC VectorBatch and Arrow RecordBatch. Namely the new 
> adapter will perform pointer manipulation to transfer the memory ownership 
> from VectorBatch to Arrow RecordBatch whenever possible.
>  * The new ORC Adapter would be able to provide user a row level granularity 
> when reading data from Orc File. The user should be able to specify how many 
> rows should be expected on output RecordBatch and the ORC Adapter should make 
> sure no more the requested number of rows would be returned.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4808) [Java][Vector] Convenience methods for setting decimal vector

2019-03-08 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4808?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4808:
--
Labels: pull-request-available  (was: )

> [Java][Vector] Convenience methods for setting decimal vector
> -
>
> Key: ARROW-4808
> URL: https://issues.apache.org/jira/browse/ARROW-4808
> Project: Apache Arrow
>  Issue Type: Task
>Reporter: Praveen Kumar Desabandu
>Assignee: Praveen Kumar Desabandu
>Priority: Major
>  Labels: pull-request-available
>
> Provide more convenience methods to set decimal vector, specifically
>  # Accept arrow buffers encode in little endian bytes that are of size < 16 
> bytes
>  # Accept arrow buffers that are encoded in big endian and could of size <=16 
> bytes



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4774) [C++][Parquet] crash writing nested array to parquet

2019-03-08 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4774:
--
Labels: pull-request-available  (was: )

> [C++][Parquet] crash writing nested array to parquet
> 
>
> Key: ARROW-4774
> URL: https://issues.apache.org/jira/browse/ARROW-4774
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Affects Versions: 0.11.1, 0.12.1
> Environment: Windows 10 16299.431, Python 3.6.4 64-bit, pyarrow 0.11.1
> Windows Linux (WSL) Ubuntu 18.04.1, Python 3.6.5 64-bit, pyarrow 0.12.1
>Reporter: Stephen Gallagher
>Assignee: Francois Saint-Jacques
>Priority: Blocker
>  Labels: pull-request-available
>
> When writing a table to a parquet file that contains both flat arrays and 
> nested array data it causes the Python to crash
> Reproducer:
> {code:python}
> import pyarrow as pa
> import pyarrow.parquet as pq
> import numpy as np
> array1 = np.array([0, 1, 2], dtype=np.uint8)
> array2 = np.array([[0,1,2], [3, 4, 5]], dtype=np.uint8).T
> t1 = pa.uint8()
> t2 = pa.list_(pa.uint8())
> fields = [
> pa.field('a1', t1),
> pa.field('a2', t2)
> ]
> myschema = pa.schema(fields)
> mytable = pa.Table.from_arrays([
> pa.array(array1, type=t1),
> pa.array([array2[:,0], array2[:,1]], type=t2)],
> schema=myschema)
> pq.write_table(mytable, 'example.parquet')
> {code}
> Windows 10 (Python 3.6.4 64-bit, pyarrow 0.11.1) crash code:
> {code:bash}
> Process finished with exit code -1073741819 (0xC005)
> {code}
> WSL (Python 3.6.5 64-bit, pyarrow 0.12.1) Crash code:
> {code:bash}
> Segmentation fault (core dumped)
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4810) [Format][C++] Add "LargeList" type with 64-bit offsets

2019-03-08 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4810:
--
Labels: pull-request-available  (was: )

> [Format][C++] Add "LargeList" type with 64-bit offsets
> --
>
> Key: ARROW-4810
> URL: https://issues.apache.org/jira/browse/ARROW-4810
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Format
>Reporter: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.14.0
>
>
> Mentioned in https://github.com/apache/arrow/issues/3845



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4681) [Rust] [DataFusion] Implement parallel query execution using threads

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4681:
--
Labels: pull-request-available  (was: )

> [Rust] [DataFusion] Implement parallel query execution using threads
> 
>
> Key: ARROW-4681
> URL: https://issues.apache.org/jira/browse/ARROW-4681
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust, Rust - DataFusion
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> I am planning on tackling this soon. The basic plan is to change the table 
> scan() method to return multiple scanners (one per partition) so that the 
> query execution can process each partition on a separate thread.
> This will involve changing some of the DataFusion APIs to use Arc instead of 
> Rc.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4486) [Python][CUDA] pyarrow.cuda.Context.foreign_buffer should have a `base=None` argument

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4486?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4486:
--
Labels: pull-request-available  (was: )

> [Python][CUDA] pyarrow.cuda.Context.foreign_buffer should have a `base=None` 
> argument
> -
>
> Key: ARROW-4486
> URL: https://issues.apache.org/jira/browse/ARROW-4486
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Pearu Peterson
>Assignee: Pearu Peterson
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>  Time Spent: 2h
>  Remaining Estimate: 2h
>
> Similar to `pyarrow.foreign_buffer`, we need to keep the owner of cuda memory 
> alive.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4466) [Rust] [DataFusion] Add support for Parquet data sources

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4466:
--
Labels: pull-request-available  (was: )

> [Rust] [DataFusion] Add support for Parquet data sources
> 
>
> Key: ARROW-4466
> URL: https://issues.apache.org/jira/browse/ARROW-4466
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust, Rust - DataFusion
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.14.0
>
>
> As a user, I would like to be able to run SQL queries against Parquet files.
> For the initial implementation we can just support primitive types.
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2409) [Rust] Test for build warnings, remove current warnings

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-2409:
--
Labels: pull-request-available  (was: )

> [Rust] Test for build warnings, remove current warnings
> ---
>
> Key: ARROW-2409
> URL: https://issues.apache.org/jira/browse/ARROW-2409
> Project: Apache Arrow
>  Issue Type: Task
>  Components: Rust
>Reporter: Maximilian Roos
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Test for build warnings, remove current warnings



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4813) [Ruby] Add tests for #== and #!=

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4813?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4813:
--
Labels: pull-request-available  (was: )

> [Ruby] Add tests for #== and #!=
> 
>
> Key: ARROW-4813
> URL: https://issues.apache.org/jira/browse/ARROW-4813
> Project: Apache Arrow
>  Issue Type: Test
>  Components: Ruby
>Affects Versions: 0.12.1
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Minor
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4664) [C++] DCHECK macro conditions are evaluated in release builds

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4664:
--
Labels: pull-request-available  (was: )

> [C++] DCHECK macro conditions are evaluated in release builds
> -
>
> Key: ARROW-4664
> URL: https://issues.apache.org/jira/browse/ARROW-4664
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Reporter: Benjamin Kietzman
>Assignee: Wes McKinney
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> {{DCHECK(potentially_expensive())}} will evaluate the argument even in 
> release mode, and is used in several places with the assumption that it will 
> do so (which means removing the guarantee of evaluation causes numerous 
> failures). By contrast, most debug assertion macros elide their arguments 
> entirely ({{.assert}}, {{}}) in release mode



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3954) [Rust] Add Slice to Array and ArrayData

2019-03-09 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3954:
--
Labels: pull-request-available  (was: )

> [Rust] Add Slice to Array and ArrayData
> ---
>
> Key: ARROW-3954
> URL: https://issues.apache.org/jira/browse/ARROW-3954
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Reporter: Chao Sun
>Assignee: Chao Sun
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Similar to C++, we should be able to construct zero-copy slice from {{Array}} 
> and {{ArrayData}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4822) [C++/Python] pyarrow.Table.equals segmentation fault on None

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4822:
--
Labels: pull-request-available  (was: )

> [C++/Python] pyarrow.Table.equals segmentation fault on None
> 
>
> Key: ARROW-4822
> URL: https://issues.apache.org/jira/browse/ARROW-4822
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, Python
>Affects Versions: 0.12.1
>Reporter: Andreas
>Assignee: Uwe L. Korn
>Priority: Trivial
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Calling {{pyarrow.Table.equals}} with {{None}} causes a segmentation fault; 
> this should be caught.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3735) [Python] Proper error handling in _ensure_type

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3735:
--
Labels: pull-request-available  (was: )

> [Python] Proper error handling in _ensure_type
> --
>
> Key: ARROW-3735
> URL: https://issues.apache.org/jira/browse/ARROW-3735
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Krisztian Szucs
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> We have multiple _ensure_type like functions, the in defined in array.pxi 
> bypasses None which causes segfault in the following example:
> {code}
> pa.array([1, 2, 3]).cast(None)
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2905) [C++] Investigate if the *_data_ pointers used in Builder classes improve performance on hot paths

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-2905:
--
Labels: pull-request-available  (was: )

> [C++] Investigate if the *_data_ pointers used in Builder classes improve 
> performance on hot paths
> --
>
> Key: ARROW-2905
> URL: https://issues.apache.org/jira/browse/ARROW-2905
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.14.0
>
>
> See [~alendit] comment in 
> https://github.com/apache/arrow/pull/2315#discussion_r204668176



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4828) [Python] manylinux1 docker-compose context should be python/manylinux1

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4828:
--
Labels: pull-request-available  (was: )

> [Python] manylinux1 docker-compose context should be python/manylinux1
> --
>
> Key: ARROW-4828
> URL: https://issues.apache.org/jira/browse/ARROW-4828
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
>
> Currently it doesn't find the {{scripts}} folder on running {{docker-compose 
> build python-manylinux1}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4826) [Go] export Flush method for CSV writer

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4826:
--
Labels: pull-request-available  (was: )

> [Go] export Flush method for CSV writer
> ---
>
> Key: ARROW-4826
> URL: https://issues.apache.org/jira/browse/ARROW-4826
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Sebastien Binet
>Assignee: Anson Qian
>Priority: Major
>  Labels: pull-request-available
>
> it should be possible to flush out to the underlying io.Writer the data that 
> has been passed to the arrow/csv.Writer.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4734) [Go] Add option to write a header for CSV writer

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4734?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4734:
--
Labels: pull-request-available  (was: )

> [Go] Add option to write a header for CSV writer
> 
>
> Key: ARROW-4734
> URL: https://issues.apache.org/jira/browse/ARROW-4734
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Anson Qian
>Assignee: Anson Qian
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4776) [C++] DictionaryBuilder should support bootstrapping from an existing dict type

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4776:
--
Labels: pull-request-available  (was: )

> [C++] DictionaryBuilder should support bootstrapping from an existing dict 
> type
> ---
>
> Key: ARROW-4776
> URL: https://issues.apache.org/jira/browse/ARROW-4776
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Francois Saint-Jacques
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.14.0
>
>
> This would mean adding a new DictionaryBuilder constructor that receives a 
> dictionary type and performs a lazy deep copy if there's any modification. 
> We'll have to investigate how this translate in API ergonomics.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-1639) [Python] More efficient serialization for RangeIndex in serialize_pandas

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-1639?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-1639:
--
Labels: pull-request-available  (was: )

> [Python] More efficient serialization for RangeIndex in serialize_pandas
> 
>
> Key: ARROW-1639
> URL: https://issues.apache.org/jira/browse/ARROW-1639
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4724) [C++] Python not being built nor test under MinGW builds

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4724?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4724:
--
Labels: pull-request-available  (was: )

> [C++] Python not being built nor test under MinGW builds
> 
>
> Key: ARROW-4724
> URL: https://issues.apache.org/jira/browse/ARROW-4724
> Project: Apache Arrow
>  Issue Type: Test
>  Components: C++
>Reporter: Javier Luraschi
>Priority: Major
>  Labels: pull-request-available
>
> Follow up to needed for 
> [arrow/pull/3693/files|https://github.com/apache/arrow/pull/3693/files].
> appveyor-cpp-build-mingw.bat has not yet enabled Python tests, need to revert,
> -DARROW_PYTHON=OFF
> Suggestion was to use,
> {code:java}
> diff --git a/ci/appveyor-cpp-build-mingw.bat b/ci/appveyor-cpp-build-mingw.bat
> index 06e8b7f7..3a853031 100644
> --- a/ci/appveyor-cpp-build-mingw.bat
> +++ b/ci/appveyor-cpp-build-mingw.bat
> @@ -24,6 +24,15 @@ set INSTALL_DIR=%HOMEDRIVE%%HOMEPATH%\install
> set PATH=%INSTALL_DIR%\bin;%PATH%
> set PKG_CONFIG_PATH=%INSTALL_DIR%\lib\pkgconfig
> +for /f "usebackq" %%v in (`python3 -c "import sys; print('.'.join(map(str, 
> sys.version_info[0:2])))"`) do (
> + set PYTHON_VERSION=%%v
> +)
> +
> +set PYTHONHOME=%MINGW_PREFIX%\lib\python%PYTHON_VERSION%
> +set PYTHONPATH=%PYTHONHOME%
> +set 
> PYTHONPATH=%PYTHONPATH%;%MINGW_PREFIX%\lib\python%PYTHON_VERSION%\lib-dynload
> +set 
> PYTHONPATH=%PYTHONPATH%;%MINGW_PREFIX%\lib\python%PYTHON_VERSION%\site-packages
> +
> {code}
> However, this suggestion currently trigger a built error in Travis,
> {code:java}
> [ 43%] Building CXX object 
> src/arrow/CMakeFiles/arrow_objlib.dir/ipc/json-simple.cc.obj
> [ 44%] Building CXX object 
> src/arrow/CMakeFiles/arrow_objlib.dir/ipc/message.cc.obj
> [ 44%] Building CXX object 
> src/arrow/CMakeFiles/arrow_objlib.dir/ipc/metadata-internal.cc.obj
> [ 45%] Building CXX object 
> src/arrow/CMakeFiles/arrow_objlib.dir/ipc/reader.cc.obj
> [ 45%] Building CXX object 
> src/arrow/CMakeFiles/arrow_objlib.dir/ipc/writer.cc.obj
> [ 45%] Built target arrow_objlib
> make: *** [Makefile:141: all] Error 2
> C:\projects\arrow\cpp\build>goto scriptexit{code}
> Therefore, additional investigation is needed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4834) [R] Feature flag to disable parquet

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4834?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4834:
--
Labels: pull-request-available  (was: )

> [R] Feature flag to disable parquet
> ---
>
> Key: ARROW-4834
> URL: https://issues.apache.org/jira/browse/ARROW-4834
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: R
>Reporter: Javier Luraschi
>Priority: Major
>  Labels: pull-request-available
>
> Support for a ARROW_R_PARQUET_OFF feature flag that will disable building the 
> R package with parquet support.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2119) [C++][Java] Handle Arrow stream with zero record batch

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2119?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-2119:
--
Labels: pull-request-available  (was: )

> [C++][Java] Handle Arrow stream with zero record batch
> --
>
> Key: ARROW-2119
> URL: https://issues.apache.org/jira/browse/ARROW-2119
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, Java
>Reporter: Jingyuan Wang
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> It looks like currently many places of the code assume that there needs to be 
> at least one record batch for streaming format. Is zero-recordbatch not 
> supported by design?
> e.g. 
> [https://github.com/apache/arrow/blob/master/java/tools/src/main/java/org/apache/arrow/tools/StreamToFile.java#L45]
> {code:none}
>   public static void convert(InputStream in, OutputStream out) throws 
> IOException {
> BufferAllocator allocator = new RootAllocator(Integer.MAX_VALUE);
> try (ArrowStreamReader reader = new ArrowStreamReader(in, allocator)) {
>   VectorSchemaRoot root = reader.getVectorSchemaRoot();
>   // load the first batch before instantiating the writer so that we have 
> any dictionaries
>   if (!reader.loadNextBatch()) {
> throw new IOException("Unable to read first record batch");
>   }
>   ...
> {code}
> Pyarrow-0.8.0 does not load 0-recordbatch stream either. It would throw an 
> exception originated from 
> [https://github.com/apache/arrow/blob/a95465b8ce7a32feeaae3e13d0a64102ffa590d9/cpp/src/arrow/table.cc#L309:]
> {code:none}
> Status Table::FromRecordBatches(const 
> std::vector>& batches,
> std::shared_ptr* table) {
>   if (batches.size() == 0) {
> return Status::Invalid("Must pass at least one record batch");
>   }
>   ...{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4820) [Python] hadoop class path derived not correct

2019-03-11 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4820?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4820:
--
Labels: pull-request-available  (was: )

> [Python] hadoop class path derived not correct
> --
>
> Key: ARROW-4820
> URL: https://issues.apache.org/jira/browse/ARROW-4820
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.12.0, 0.12.1
>Reporter: Tiger068
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
> Attachments: image-2019-03-11-16-39-25-195.png
>
>
> in hdfs.py, the method  _derive_hadoop_classpath add jar files under 
> $HADOOP_HOME into hadoop classpath,but the hadoop config directory is not 
> contained in  classpath.
>  
> when hadoop HA mode enabled,the hdfs uri like this: hdfs://ns
> when the HADOOP_CONF_DIR directory is not in the hadoop classpath,the libhdfs 
> can not locate the right  hdfs-site.xml, in the HA mode, hdfs service name 
> was parsed as host name ,it is not correct 
> !image-2019-03-11-16-39-25-195.png!  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4835) [GLib] Add boolean operations

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4835?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4835:
--
Labels: pull-request-available  (was: )

> [GLib] Add boolean operations
> -
>
> Key: ARROW-4835
> URL: https://issues.apache.org/jira/browse/ARROW-4835
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: GLib
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4837) [C++] Support c++filt on a custom path in the run-test.sh script

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4837:
--
Labels: pull-request-available  (was: )

> [C++] Support c++filt on a custom path in the run-test.sh script
> 
>
> Key: ARROW-4837
> URL: https://issues.apache.org/jira/browse/ARROW-4837
> Project: Apache Arrow
>  Issue Type: Improvement
>Reporter: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
>
> On conda this is CXXFILT=/opt/conda/bin/x86_64-conda_cos6-linux-gnu-c++filt



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4673) [C++] Implement AssertDatumEquals

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4673?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4673:
--
Labels: pull-request-available  (was: )

> [C++] Implement AssertDatumEquals
> -
>
> Key: ARROW-4673
> URL: https://issues.apache.org/jira/browse/ARROW-4673
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Francois Saint-Jacques
>Assignee: Francois Saint-Jacques
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.14.0
>
>
> Aggregate tests could benefit from this.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4825) [Python][C++] MemoryPool is destructed before deallocating its buffers leads to segfault

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4825?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4825:
--
Labels: pull-request-available  (was: )

> [Python][C++] MemoryPool is destructed before deallocating its buffers leads 
> to segfault 
> -
>
> Key: ARROW-4825
> URL: https://issues.apache.org/jira/browse/ARROW-4825
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, Python
>Affects Versions: 0.13.0
>Reporter: Pearu Peterson
>Priority: Major
>  Labels: pull-request-available
>
> Consider the following test function:
> ```
> def test_memory_pool():
>     import pyarrow as pa
>     pool = pa.logging_memory_pool(pa.default_memory_pool())
>     buf = pa.allocate_buffer(10, memory_pool=pool)
> ```
> that will fail with segfault when `pool` is garbage collected before `buf`. 
> However, the following test function succeeds:
> ```
> def test_memory_pool():
>     import pyarrow as pa
>     pool = pa.logging_memory_pool(pa.default_memory_pool())
>     buf = pa.allocate_buffer(10, memory_pool=pool)
>     del buf
> ```
> because all buffers are freed before `pool` destruction.
> To fix this issue, the pool instance should be attached to buffer instances 
> that the pool is creating. This will ensure that `pool` will be alive until 
> all its buffers are destroyed.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2501) [Java] Remove Jackson from compile-time dependencies for arrow-vector

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-2501:
--
Labels: pull-request-available  (was: )

> [Java] Remove Jackson from compile-time dependencies for arrow-vector
> -
>
> Key: ARROW-2501
> URL: https://issues.apache.org/jira/browse/ARROW-2501
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Affects Versions: 0.9.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.14.0
>
>
> I would like to upgrade Jackson to the latest version (2.9.5). If there are 
> no objections I will create a PR (it is literally just changing the version 
> number in the pom - no code changes required).



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4421) [Flight][C++] Handle large Flight data messages

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4421:
--
Labels: pull-request-available  (was: )

> [Flight][C++] Handle large Flight data messages
> ---
>
> Key: ARROW-4421
> URL: https://issues.apache.org/jira/browse/ARROW-4421
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, FlightRPC
>Reporter: Wes McKinney
>Assignee: David Li
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> I believe the message payloads are currently limited to 4MB by default, see 
> one developer's discussion here:
> https://nanxiao.me/en/message-length-setting-in-grpc/
> While it is a good idea to break large messages into smaller ones, we will 
> need to address how to gracefully send larger payloads that may be provided 
> by a user's server implementation. Either we can increase the limit or break 
> up the record batches into smaller chunks in the Flight server base (or both, 
> of course)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4735) [Go] Benchmark strconv.Format vs. fmt.Sprintf for CSV writer

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4735:
--
Labels: pull-request-available  (was: )

> [Go] Benchmark strconv.Format vs. fmt.Sprintf for CSV writer
> 
>
> Key: ARROW-4735
> URL: https://issues.apache.org/jira/browse/ARROW-4735
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Anson Qian
>Assignee: Sebastien Binet
>Priority: Major
>  Labels: pull-request-available
>
> Need test out strconv.Format\{Bool,Float,Int,Uint} instead of fmt.Sprintf and 
> see if we can improve write performance



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4846) [Java] Update Jackson to 2.9.8

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4846:
--
Labels: pull-request-available  (was: )

> [Java] Update Jackson to 2.9.8
> --
>
> Key: ARROW-4846
> URL: https://issues.apache.org/jira/browse/ARROW-4846
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Reporter: Wes McKinney
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> We are looking at removing Jackson from arrow-vector dependencies in 
> ARROW-2501



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3364) [Doc] Document docker compose setup

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3364:
--
Labels: pull-request-available  (was: )

> [Doc] Document docker compose setup
> ---
>
> Key: ARROW-3364
> URL: https://issues.apache.org/jira/browse/ARROW-3364
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Documentation
>Reporter: Krisztian Szucs
>Assignee: Francois Saint-Jacques
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Introduced by https://github.com/apache/arrow/pull/2572



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4811) [C++] An incorrect dependency leads "ninja" to re-evaluate steps unnecessarily on subsequent calls

2019-03-12 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4811?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4811:
--
Labels: pull-request-available  (was: )

> [C++] An incorrect dependency leads "ninja" to re-evaluate steps 
> unnecessarily on subsequent calls
> --
>
> Key: ARROW-4811
> URL: https://issues.apache.org/jira/browse/ARROW-4811
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Not sure about the root cause yet but here are the 5 steps that are 
> re-executing
> {code}
> $ ninja -v
> [1/5] /usr/bin/ccache /usr/bin/g++  -DARROW_EXTRA_ERROR_CONTEXT 
> -DARROW_JEMALLOC 
> -DARROW_JEMALLOC_INCLUDE_DIR=/home/wesm/code/arrow/cpp/build/jemalloc_ep-prefix/src/jemalloc_ep/dist//include
>  -DARROW_NO_DEPRECATED_API -DARROW_PYTHON_EXPORTING -DARROW_USE_GLOG 
> -DARROW_USE_SIMD -DARROW_WITH_BROTLI -DARROW_WITH_BZ2 -DARROW_WITH_LZ4 
> -DARROW_WITH_SNAPPY -DARROW_WITH_ZLIB -DARROW_WITH_ZSTD -Isrc -I../src 
> -isystem /home/wesm/cpp-toolchain/include -isystem 
> gbenchmark_ep/src/gbenchmark_ep-install/include -isystem 
> jemalloc_ep-prefix/src -isystem ../thirdparty/hadoop/include -isystem 
> orc_ep-install/include -isystem /home/wesm/cpp-toolchain/include/thrift 
> -isystem 
> /home/wesm/miniconda/envs/arrow-3.7/lib/python3.7/site-packages/numpy/core/include
>  -isystem /home/wesm/miniconda/envs/arrow-3.7/include/python3.7m 
> -Wno-noexcept-type  -fdiagnostics-color=always -O3 -DNDEBUG  -Wall 
> -Wno-unused-variable -msse4.2 -fno-omit-frame-pointer -O3 -DNDEBUG -fPIC   
> -std=gnu++11 -MD -MT 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/flight.cc.o -MF 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/flight.cc.o.d -o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/flight.cc.o -c 
> ../src/arrow/python/flight.cc
> [2/5] : && /usr/bin/ccache /home/wesm/miniconda/envs/arrow-3.7/bin/cmake -E 
> remove release/libarrow_python.a && /usr/bin/ccache /usr/bin/ar qc 
> release/libarrow_python.a  
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/arrow_to_pandas.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/benchmark.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/common.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/config.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/decimal.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/deserialize.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/helpers.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/inference.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/init.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/io.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/numpy_convert.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/numpy_to_arrow.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/python_to_arrow.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/pyarrow.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/serialize.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/flight.cc.o && 
> /usr/bin/ccache /usr/bin/ranlib release/libarrow_python.a && :
> [3/5] : && /usr/bin/ccache /usr/bin/g++ -fPIC -Wno-noexcept-type  
> -fdiagnostics-color=always -O3 -DNDEBUG  -Wall -Wno-unused-variable -msse4.2 
> -fno-omit-frame-pointer -O3 -DNDEBUG   -shared 
> -Wl,-soname,libarrow_python.so.13 -o release/libarrow_python.so.13.0.0 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/arrow_to_pandas.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/benchmark.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/common.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/config.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/decimal.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/deserialize.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/helpers.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/inference.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/init.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/io.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/numpy_convert.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/numpy_to_arrow.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/python_to_arrow.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/pyarrow.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/serialize.cc.o 
> src/arrow/python/CMakeFiles/arrow_python_objlib.dir/flight.cc.o  
> -Wl,-rpath,/home/wesm/code/arrow/cpp/build/re

[jira] [Updated] (ARROW-4850) [CI] Integration test failures do not fail the Travis CI build

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4850:
--
Labels: pull-request-available  (was: )

> [CI] Integration test failures do not fail the Travis CI build
> --
>
> Key: ARROW-4850
> URL: https://issues.apache.org/jira/browse/ARROW-4850
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Continuous Integration
>Reporter: Wes McKinney
>Assignee: Francois Saint-Jacques
>Priority: Blocker
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> See https://github.com/apache/arrow/pull/3871
> These changes fail the build, but it is reported as success
> The errors can be seen in https://travis-ci.org/apache/arrow/jobs/505028161



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4831) [C++] CMAKE_AR is not passed to ZSTD thirdparty dependency

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4831:
--
Labels: pull-request-available  (was: )

> [C++] CMAKE_AR is not passed to ZSTD thirdparty dependency 
> ---
>
> Key: ARROW-4831
> URL: https://issues.apache.org/jira/browse/ARROW-4831
> Project: Apache Arrow
>  Issue Type: Improvement
>Reporter: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
>
> ZSTD_CMAKE_ARGS should utilize 
> https://github.com/apache/arrow/blob/master/cpp/cmake_modules/ThirdpartyToolchain.cmake#L359



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4855) [Packaging] Generate default package version based on cpp tags in crossbow.py

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4855:
--
Labels: pull-request-available  (was: )

> [Packaging] Generate default package version based on cpp tags in crossbow.py
> -
>
> Key: ARROW-4855
> URL: https://issues.apache.org/jira/browse/ARROW-4855
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Packaging
>Reporter: Krisztian Szucs
>Assignee: Krisztian Szucs
>Priority: Major
>  Labels: pull-request-available
>
> It picked up wrong version because of the recent JS release.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4859) [GLib] Add garrow_numeric_array_mean()

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4859:
--
Labels: pull-request-available  (was: )

> [GLib] Add garrow_numeric_array_mean()
> --
>
> Key: ARROW-4859
> URL: https://issues.apache.org/jira/browse/ARROW-4859
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: GLib
>Reporter: Yosuke Shiro
>Assignee: Yosuke Shiro
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4858) [Flight][Python] Enable custom FlightDataStream in Python

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4858?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4858:
--
Labels: flight pull-request-available  (was: flight)

> [Flight][Python] Enable custom FlightDataStream in Python
> -
>
> Key: ARROW-4858
> URL: https://issues.apache.org/jira/browse/ARROW-4858
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: FlightRPC, Python
>Reporter: David Li
>Assignee: David Li
>Priority: Minor
>  Labels: flight, pull-request-available
> Fix For: 0.14.0
>
>
> We should be able to provide a custom data stream as the result of Flight 
> do_get in Python. In particular, when returning data produced on the fly, or 
> when returning a large Pandas DataFrame, it'd be nice to provide data in 
> chunks as it becomes available, rather than having to copy everything into a 
> Table first.
> On the Python side, a FlightDataStream wrapper that accepts RecordBatches 
> from a Python generator should suffice.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4839) [C#] Add NuGet support

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4839:
--
Labels: pull-request-available  (was: )

> [C#] Add NuGet support
> --
>
> Key: ARROW-4839
> URL: https://issues.apache.org/jira/browse/ARROW-4839
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C#
>Reporter: Eric Erhardt
>Assignee: Eric Erhardt
>Priority: Major
>  Labels: pull-request-available
>
> We should add the metadata to the .csproj so we can create a NuGet package 
> without changing any source code.
> Also, we should add any scripts and documentation on how to create the NuGet 
> package to allow ease of creation at release time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4012) [Documentation][C++] Document how to install Apache Arrow on MSYS2

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4012:
--
Labels: pull-request-available  (was: )

> [Documentation][C++] Document how to install Apache Arrow on MSYS2
> --
>
> Key: ARROW-4012
> URL: https://issues.apache.org/jira/browse/ARROW-4012
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Documentation
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4637) [Python] Avoid importing Pandas unless necessary

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4637?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4637:
--
Labels: pull-request-available  (was: )

> [Python] Avoid importing Pandas unless necessary
> 
>
> Key: ARROW-4637
> URL: https://issues.apache.org/jira/browse/ARROW-4637
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Affects Versions: 0.12.0
>Reporter: Antoine Pitrou
>Assignee: Wes McKinney
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Importing PyArrow is more than twice slower when Pandas is installed:
> {code}
> $ time python -c "import pyarrow"
> real  0m0,360s
> user  0m0,305s
> sys   0m0,037s
> $ time python -c "import sys; sys.modules['pandas'] = None; import pyarrow"
> real  0m0,144s
> user  0m0,124s
> sys   0m0,020s
> {code}
> We should only import Pandas when necessary, e.g. when asked to ingest or 
> create Pandas data.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4862) [GLib] Add GArrowCastOptions::allow-invalid-utf8 property

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4862:
--
Labels: pull-request-available  (was: )

> [GLib] Add GArrowCastOptions::allow-invalid-utf8 property
> -
>
> Key: ARROW-4862
> URL: https://issues.apache.org/jira/browse/ARROW-4862
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: GLib
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4705) [Rust] CSV reader should show line number and error message when failing to parse a line

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4705:
--
Labels: newbie pull-request-available  (was: newbie)

> [Rust] CSV reader should show line number and error message when failing to 
> parse a line
> 
>
> Key: ARROW-4705
> URL: https://issues.apache.org/jira/browse/ARROW-4705
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: newbie, pull-request-available
> Fix For: 0.13.0
>
>
> We currently throw away the original error and do not report line number, 
> making it very difficult to debug.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4705) [Rust] CSV reader should show line number and error message when failing to parse a line

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4705:
--
Labels: newbie pull-request-available  (was: newbie)

> [Rust] CSV reader should show line number and error message when failing to 
> parse a line
> 
>
> Key: ARROW-4705
> URL: https://issues.apache.org/jira/browse/ARROW-4705
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: newbie, pull-request-available
> Fix For: 0.13.0
>
>
> We currently throw away the original error and do not report line number, 
> making it very difficult to debug.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4865) [Rust] Support casting lists and primitives to lists

2019-03-13 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4865:
--
Labels: pull-request-available  (was: )

> [Rust] Support casting lists and primitives to lists
> 
>
> Key: ARROW-4865
> URL: https://issues.apache.org/jira/browse/ARROW-4865
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.12.0
>Reporter: Neville Dipale
>Assignee: Neville Dipale
>Priority: Minor
>  Labels: pull-request-available
>
> This adds support for casting between list arrays and from primitive arrays 
> to single-value list arrays



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4871) [Flight][Java] Handle large Flight messages

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4871:
--
Labels: flight pull-request-available  (was: flight)

> [Flight][Java] Handle large Flight messages
> ---
>
> Key: ARROW-4871
> URL: https://issues.apache.org/jira/browse/ARROW-4871
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: FlightRPC, Java
>Reporter: David Li
>Assignee: David Li
>Priority: Major
>  Labels: flight, pull-request-available
> Fix For: 0.14.0
>
>
> Similarly to ARROW-4421, Java/gRPC needs to be configured to allow large 
> messages. The integration tests should also be updated to cover this.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4751) [C++] Add pkg-config to conda_env_cpp.yml

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4751:
--
Labels: pull-request-available  (was: )

> [C++] Add pkg-config to conda_env_cpp.yml
> -
>
> Key: ARROW-4751
> URL: https://issues.apache.org/jira/browse/ARROW-4751
> Project: Apache Arrow
>  Issue Type: Task
>  Components: C++, Continuous Integration
>Reporter: Uwe L. Korn
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Once the CMake refactor has been merged, we should add {{pkg-config}} to the 
> dependencies as it should be also available for Windows now: 
> https://github.com/conda-forge/pkg-config-feedstock/pull/27 This will 
> simplify some packaging.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4873) [C++] ARROW_DEPENDENCY_SOURCE should not be overridden to CONDA if ARROW_PACKAGE_PREFIX is set by user

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4873:
--
Labels: pull-request-available  (was: )

> [C++] ARROW_DEPENDENCY_SOURCE should not be overridden to CONDA if 
> ARROW_PACKAGE_PREFIX is set by user
> --
>
> Key: ARROW-4873
> URL: https://issues.apache.org/jira/browse/ARROW-4873
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> I use conda to manage Python dependencies but keep my C++ toolchain in a 
> separate directory. This organizational scheme is incompatible with the new 
> options after the CMake refactor
> I think if you pass {{-DARROW_PREFIX_PATH=$MY_CPP_TOOLCHAIN}} then this 
> should not be overridden with {{$CONDA_PREFIX}}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4251) [C++] Add option to use vendored Boost in verify-release-candidate.sh

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4251:
--
Labels: pull-request-available  (was: )

> [C++] Add option to use vendored Boost in verify-release-candidate.sh
> -
>
> Key: ARROW-4251
> URL: https://issues.apache.org/jira/browse/ARROW-4251
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Got burned by this while verifying 0.12 RC2. It is useful to have the option 
> to use the system Boost during verification, but it would also be nice to opt 
> in to using the vendored Boost build



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4866) [C++] zstd ExternalProject failing on Windows

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4866:
--
Labels: ci-failure pull-request-available  (was: ci-failure)

> [C++] zstd ExternalProject failing on Windows
> -
>
> Key: ARROW-4866
> URL: https://issues.apache.org/jira/browse/ARROW-4866
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, Packaging
>Reporter: Uwe L. Korn
>Assignee: Wes McKinney
>Priority: Major
>  Labels: ci-failure, pull-request-available
> Fix For: 0.13.0
>
>
> After 
> [https://github.com/apache/arrow/pull/3885|https://github.com/apache/arrow/pull/3885,]
>  the zstd ExternalProject is failing in the Windows builds, see 
> [https://ci.appveyor.com/project/ApacheSoftwareFoundation/arrow/builds/23063072/job/bd0gom16atlkddtx]
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4876) Port MutableBuffer to csharp

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4876:
--
Labels: pull-request-available  (was: )

> Port MutableBuffer to csharp
> 
>
> Key: ARROW-4876
> URL: https://issues.apache.org/jira/browse/ARROW-4876
> Project: Apache Arrow
>  Issue Type: Task
>  Components: C#
>Reporter: Prashanth Govindarajan
>Priority: Major
>  Labels: pull-request-available
>
> C++ has a "MutableBuffer" that exposes the underlying T*. Port it to csharp. 
> It's an easy port. ArrowBuffer at the moment is exposed as ReadOnlyMemory. 
> The builder actually hands it a "Memory" object, so it ought to be a simple 
> change



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4056) [C++] Upgrade to boost-cpp 1.69.0 again

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4056?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4056:
--
Labels: pull-request-available  (was: )

> [C++] Upgrade to boost-cpp 1.69.0 again
> ---
>
> Key: ARROW-4056
> URL: https://issues.apache.org/jira/browse/ARROW-4056
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> EDIT: the issue has been present for a large portion of 2018. I found this 
> when merging the macOS C++ builds and changed the build type to Xcode 8.3:
> https://travis-ci.org/wesm/arrow/jobs/469297420#L2856
> I reported the issue into conda-forge at 
> https://github.com/conda-forge/boost-cpp-feedstock/issues/40
> It seems that the Ray project worked around this earlier this year: 
> https://github.com/ray-project/ray/pull/1688



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4878) [C++] ARROW_DEPENDENCY_SOURCE=CONDA does not work properly with MSVC

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4878?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4878:
--
Labels: pull-request-available  (was: )

> [C++] ARROW_DEPENDENCY_SOURCE=CONDA does not work properly with MSVC
> 
>
> Key: ARROW-4878
> URL: https://issues.apache.org/jira/browse/ARROW-4878
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Blocker
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> The prefix must have {{\Library}} added to it



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4870) [Ruby] gemspec has wrong msys2 dependency listed

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4870:
--
Labels: pull-request-available  (was: )

> [Ruby] gemspec has wrong msys2 dependency listed
> 
>
> Key: ARROW-4870
> URL: https://issues.apache.org/jira/browse/ARROW-4870
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Ruby
>Affects Versions: 0.12.1
>Reporter: Dominic Sisneros
>Assignee: Dominic Sisneros
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
>  ruby gemspec has wrong msys2 dependency listed
> change mys2_mingw_dependencies to correct package
> pacman -Ss arrow
> mingw32/mingw-w64-i686-arrow 0.11.1-1
> Apache Arrow is a cross-language development platform for in-memory data 
> (mingw-w64)
> mingw64/mingw-w64-x86_64-arrow 0.11.1-1 [installed]
> Apache Arrow is a cross-language development platform for in-memory data 
> (mingw-w64)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3822) [C++] parquet::arrow::FileReader::GetRecordBatchReader has logical error on row groups with chunked columns

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3822:
--
Labels: pull-request-available  (was: )

> [C++] parquet::arrow::FileReader::GetRecordBatchReader has logical error on 
> row groups with chunked columns
> ---
>
> Key: ARROW-3822
> URL: https://issues.apache.org/jira/browse/ARROW-3822
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Benjamin Kietzman
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> If a BinaryArray / StringArray overflows a single column when reading a row 
> group, the resulting table will have a ChunkedArray. Using TableBatchReader 
> in 
> https://github.com/apache/arrow/blob/master/cpp/src/parquet/arrow/reader.cc#L176
> will therefore only return a part of the row group, discarding the rest



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4887) [GLib] Add garrow_array_count()

2019-03-14 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4887:
--
Labels: pull-request-available  (was: )

> [GLib] Add garrow_array_count()
> ---
>
> Key: ARROW-4887
> URL: https://issues.apache.org/jira/browse/ARROW-4887
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: GLib
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4889) [C++] Add STATUS messages for Protobuf in CMake

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4889:
--
Labels: pull-request-available  (was: )

> [C++] Add STATUS messages for Protobuf in CMake
> ---
>
> Key: ARROW-4889
> URL: https://issues.apache.org/jira/browse/ARROW-4889
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> With Protobuf it can easily happen that {{protoc}} and {{libprotobuf}} 
> mismatch. We should have some output about this in CMake to better debug this 
> when users report issues.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4891) [C++] ZLIB include directories not added

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4891:
--
Labels: pull-request-available  (was: )

> [C++] ZLIB include directories not added
> 
>
> Key: ARROW-4891
> URL: https://issues.apache.org/jira/browse/ARROW-4891
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> This causes a failing centos-7 build.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4892) Move SQL parser and planner into sql package

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4892:
--
Labels: pull-request-available  (was: )

> Move SQL parser and planner into sql package
> 
>
> Key: ARROW-4892
> URL: https://issues.apache.org/jira/browse/ARROW-4892
> Project: Apache Arrow
>  Issue Type: Sub-task
>  Components: Rust, Rust - DataFusion
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4893) [C++] conda packages should use $PREFIX inside of conda-build

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4893:
--
Labels: pull-request-available  (was: )

> [C++] conda packages should use $PREFIX inside of conda-build
> -
>
> Key: ARROW-4893
> URL: https://issues.apache.org/jira/browse/ARROW-4893
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Packaging
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4896) [Rust] [DataFusion] Remove all uses of panic! from tests

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4896:
--
Labels: pull-request-available  (was: )

> [Rust] [DataFusion] Remove all uses of panic! from tests
> 
>
> Key: ARROW-4896
> URL: https://issues.apache.org/jira/browse/ARROW-4896
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust, Rust - DataFusion
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Tests should use assert!(false) rather than panic!()



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4898) [C++] Old versions of FindProtobuf.cmake use ALL-CAPS for variables

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4898:
--
Labels: pull-request-available  (was: )

> [C++] Old versions of FindProtobuf.cmake use ALL-CAPS for variables
> ---
>
> Key: ARROW-4898
> URL: https://issues.apache.org/jira/browse/ARROW-4898
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Uwe L. Korn
>Assignee: Uwe L. Korn
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> We only need to handle {{PROTOBUF_PROTOC_LIBRARY}} vs 
> {{Protobuf_PROTOC_LIBRARY}}.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4899) [Rust] [DataFusion] Remove all uses of panic! from expression.rs

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4899?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4899:
--
Labels: pull-request-available  (was: )

> [Rust] [DataFusion] Remove all uses of panic! from expression.rs
> 
>
> Key: ARROW-4899
> URL: https://issues.apache.org/jira/browse/ARROW-4899
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust, Rust - DataFusion
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4894) [Rust] [DataFusion] Remove all uses of panic! from aggregate.rs

2019-03-15 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4894:
--
Labels: pull-request-available  (was: )

> [Rust] [DataFusion] Remove all uses of panic! from aggregate.rs
> ---
>
> Key: ARROW-4894
> URL: https://issues.apache.org/jira/browse/ARROW-4894
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust, Rust - DataFusion
>Affects Versions: 0.12.0
>Reporter: Andy Grove
>Assignee: Andy Grove
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>
> Code cleanup



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


  1   2   3   4   5   6   7   8   9   10   >