[jira] [Updated] (ARROW-5251) [C++][Parquet] Bad initialization in statistics computation

2019-05-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5251?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5251: -- Component/s: parquet C++ > [C++][Parquet] Bad initialization

[jira] [Created] (ARROW-5251) [C++][Parquet] Bad initialization in statistics computation

2019-05-02 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5251: - Summary: [C++][Parquet] Bad initialization in statistics computation Key: ARROW-5251 URL: https://issues.apache.org/jira/browse/ARROW-5251 Project:

[jira] [Created] (ARROW-5253) [C++] external Snappy fails on Alpine

2019-05-03 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5253: - Summary: [C++] external Snappy fails on Alpine Key: ARROW-5253 URL: https://issues.apache.org/jira/browse/ARROW-5253 Project: Apache Arrow

[jira] [Commented] (ARROW-5130) [Python] Segfault when importing TensorFlow after Pyarrow

2019-04-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16826967#comment-16826967 ] Francois Saint-Jacques commented on ARROW-5130: --- It's a component called crossbow, the gist

[jira] [Commented] (ARROW-5130) [Python] Segfault when importing TensorFlow after Pyarrow

2019-04-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16826968#comment-16826968 ] Francois Saint-Jacques commented on ARROW-5130: --- You'll have to replicate

[jira] [Commented] (ARROW-5214) [C++] Offline dependency downloader misses some libraries

2019-04-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16826979#comment-16826979 ] Francois Saint-Jacques commented on ARROW-5214: --- The script is exiting silently, but with a

[jira] [Comment Edited] (ARROW-5214) [C++] Offline dependency downloader misses some libraries

2019-04-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16826979#comment-16826979 ] Francois Saint-Jacques edited comment on ARROW-5214 at 4/26/19 1:59 PM:

[jira] [Resolved] (ARROW-4187) [C++] file-benchmark uses

2019-07-05 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-4187?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-4187. --- Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request

[jira] [Resolved] (ARROW-5849) Compiler warnings on mingw-w64

2019-07-05 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5849?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5849. --- Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request

[jira] [Resolved] (ARROW-5851) [C++] Compilation of reference benchmarks fails

2019-07-05 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5851?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5851. --- Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request

[jira] [Commented] (ARROW-5759) Suspend CI builds for draft pull requests on GitHub

2019-06-27 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16874302#comment-16874302 ] Francois Saint-Jacques commented on ARROW-5759: --- I don't agree with this one, often the CI

[jira] [Resolved] (ARROW-5718) [R] auto splice data frames in record_batch() and table()

2019-06-27 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5718. --- Resolution: Fixed > [R] auto splice data frames in record_batch() and

[jira] [Resolved] (ARROW-3732) [R] Add functions to write RecordBatch or Schema to Message value, then read back

2019-06-27 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-3732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-3732. --- Resolution: Fixed Fix Version/s: 0.14.0 > [R] Add functions to write

[jira] [Resolved] (ARROW-5749) [Python] Add Python binding for Table::CombineChunks()

2019-06-27 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5749. --- Resolution: Fixed Issue resolved by pull request 4712

[jira] [Created] (ARROW-5739) [CI] Fix docker python build

2019-06-26 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5739: - Summary: [CI] Fix docker python build Key: ARROW-5739 URL: https://issues.apache.org/jira/browse/ARROW-5739 Project: Apache Arrow Issue

[jira] [Resolved] (ARROW-5045) [Rust] Code coverage silently failing in CI

2019-06-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5045. --- Resolution: Fixed Fix Version/s: 0.14.0 Issue resolved by pull

[jira] [Commented] (ARROW-5730) [CI] Dask integration tests are failing

2019-06-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5730?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16873369#comment-16873369 ] Francois Saint-Jacques commented on ARROW-5730: --- Note that the local error I got is fixed

[jira] [Commented] (ARROW-5745) [C++] properties of Map(Array|Type) are confusingly named

2019-06-26 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5745?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16873499#comment-16873499 ] Francois Saint-Jacques commented on ARROW-5745: --- Just note that for PrimitiveArray, it

[jira] [Created] (ARROW-5779) [R][CI] R's docker image fails due to incompatibility

2019-06-28 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5779: - Summary: [R][CI] R's docker image fails due to incompatibility Key: ARROW-5779 URL: https://issues.apache.org/jira/browse/ARROW-5779 Project: Apache

[jira] [Created] (ARROW-5914) [CI] Build bundled dependencies in docker build step

2019-07-11 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5914: - Summary: [CI] Build bundled dependencies in docker build step Key: ARROW-5914 URL: https://issues.apache.org/jira/browse/ARROW-5914 Project: Apache

[jira] [Resolved] (ARROW-5923) [C++] Fix int96 comment

2019-07-12 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5923?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5923. --- Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request

[jira] [Created] (ARROW-5923) [C++] Fix int96 comment

2019-07-12 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5923: - Summary: [C++] Fix int96 comment Key: ARROW-5923 URL: https://issues.apache.org/jira/browse/ARROW-5923 Project: Apache Arrow Issue Type:

[jira] [Resolved] (ARROW-5588) [C++] Better support for building UnionArrays

2019-07-12 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5588. --- Resolution: Fixed Fix Version/s: 1.0.0 Issue resolved by pull request

[jira] [Assigned] (ARROW-5588) [C++] Better support for building UnionArrays

2019-07-12 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5588: - Assignee: Benjamin Kietzman > [C++] Better support for building

[jira] [Updated] (ARROW-5921) [C++][Fuzzing] Missing nullptr checks in IPC

2019-07-12 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5921: -- Fix Version/s: 0.14.1 > [C++][Fuzzing] Missing nullptr checks in IPC >

[jira] [Updated] (ARROW-5921) [C++][Fuzzing] Missing nullptr checks in IPC

2019-07-12 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5921: -- Fix Version/s: 1.0.0 > [C++][Fuzzing] Missing nullptr checks in IPC >

[jira] [Resolved] (ARROW-5781) [Archery] Ensure benchmark clone accepts remotes in revision

2019-06-28 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5781. --- Resolution: Fixed Fix Version/s: 0.14.0 Issue resolved by pull

[jira] [Assigned] (ARROW-5781) [Archery] Ensure benchmark clone accepts remotes in revision

2019-06-28 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5781: - Assignee: Francois Saint-Jacques > [Archery] Ensure benchmark clone

[jira] [Updated] (ARROW-5527) [C++] HashTable/MemoTable should use Buffer(s)/Builder(s) for heap data

2019-07-08 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5527: -- Description: The current implementation uses `std::vector` and `std::string`

[jira] [Updated] (ARROW-5527) [C++] HashTable/MemoTable should use Buffer(s)/Builder(s) for heap data

2019-07-08 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5527?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5527: -- Description: The current implementation uses `std::vector` and `std::string`

[jira] [Updated] (ARROW-5202) [C++] Test and benchmark libraries library search path subtly affected by installation

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5202: -- Description: Test and benchmark binaries should always favor the local

[jira] [Updated] (ARROW-5202) [C++] Test and benchmark libraries library search path subtly affected by installation

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5202: -- Priority: Minor (was: Major) > [C++] Test and benchmark libraries library

[jira] [Updated] (ARROW-5202) [C++] Test and benchmark libraries library search path subtly affected by installation

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5202: -- Fix Version/s: 0.14.0 > [C++] Test and benchmark libraries library search path

[jira] [Updated] (ARROW-5196) [C++] Uniform usage of Google cpu_features library accross the codebase

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5196?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5196: -- Summary: [C++] Uniform usage of Google cpu_features library accross the

[jira] [Created] (ARROW-5202) [C++

2019-04-23 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5202: - Summary: [C++ Key: ARROW-5202 URL: https://issues.apache.org/jira/browse/ARROW-5202 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-5202) [C++] Test and benchmark libraries library search path subtly affected by installation

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5202: -- Description: Test and benchmark binaries should always favor the local

[jira] [Updated] (ARROW-5202) [C++] Test and benchmark libraries library search path subtly affected by installation

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5202: -- Description: Test and benchmark binaries should always favor the local

[jira] [Updated] (ARROW-5202) [C++] Test and benchmark libraries library search path subtly affected by installation

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5202?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5202: -- Description: Test and benchmark binaries should always favor the local

[jira] [Updated] (ARROW-5071) [Benchmarking] Performs a benchmark run with archery

2019-04-24 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5071: -- Description: Run all regression benchmarks, consume output and re-format

[jira] [Assigned] (ARROW-5214) [C++] Offline dependency downloader misses some libraries

2019-04-25 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5214: - Assignee: Francois Saint-Jacques > [C++] Offline dependency downloader

[jira] [Commented] (ARROW-5130) [Python] Segfault when importing TensorFlow after Pyarrow

2019-04-25 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16826333#comment-16826333 ] Francois Saint-Jacques commented on ARROW-5130: --- This is not fixed with in master, I

[jira] [Commented] (ARROW-5130) [Python] Segfault when importing TensorFlow after Pyarrow

2019-04-25 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16826335#comment-16826335 ] Francois Saint-Jacques commented on ARROW-5130: --- Also note that I can't trigger this from a

[jira] [Updated] (ARROW-5071) [Benchmarking] Performs a benchmark run with archery

2019-04-23 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-5071: -- Description: Run all regression benchmarks, consume output and re-format

[jira] [Created] (ARROW-5781) [Archery] Ensure benchmark clone accepts remotes in revision

2019-06-28 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-5781: - Summary: [Archery] Ensure benchmark clone accepts remotes in revision Key: ARROW-5781 URL: https://issues.apache.org/jira/browse/ARROW-5781

[jira] [Resolved] (ARROW-5780) [C++] Add benchmark for Decimal128 operations

2019-06-28 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5780. --- Resolution: Fixed Fix Version/s: 0.14.0 Issue resolved by pull

[jira] [Assigned] (ARROW-5803) [C++] Dockerize C++ with clang 7 Travis CI unit test logic

2019-07-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5803?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5803: - Assignee: Francois Saint-Jacques > [C++] Dockerize C++ with clang 7

[jira] [Commented] (ARROW-6004) [C++] CSV reader ignore_empty_lines option doesn't handle empty lines

2019-07-31 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16897238#comment-16897238 ] Francois Saint-Jacques commented on ARROW-6004: --- I'd expect the empty lines to be skipped,

[jira] [Commented] (ARROW-6004) [C++] CSV reader ignore_empty_lines option doesn't handle empty lines

2019-07-31 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6004?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16897264#comment-16897264 ] Francois Saint-Jacques commented on ARROW-6004: --- I was agreeing with the (current) default

[jira] [Updated] (ARROW-6123) [C++] IsIn kernel should not materialize the output internal

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6123: -- Affects Version/s: 0.15.0 > [C++] IsIn kernel should not materialize the

[jira] [Created] (ARROW-6123) [C++] IsIn kernel should not materialize the output internal

2019-08-02 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6123: - Summary: [C++] IsIn kernel should not materialize the output internal Key: ARROW-6123 URL: https://issues.apache.org/jira/browse/ARROW-6123

[jira] [Updated] (ARROW-6123) [C++] IsIn kernel should not materialize the output internal

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6123: -- Labels: ana (was: ) > [C++] IsIn kernel should not materialize the output

[jira] [Updated] (ARROW-6123) [C++] IsIn kernel should not materialize the output internal

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6123: -- Component/s: C++ > [C++] IsIn kernel should not materialize the output

[jira] [Created] (ARROW-6121) [Tools] Improve merge tool cli ergonomic

2019-08-02 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6121: - Summary: [Tools] Improve merge tool cli ergonomic Key: ARROW-6121 URL: https://issues.apache.org/jira/browse/ARROW-6121 Project: Apache Arrow

[jira] [Created] (ARROW-6122) [C++] IsIn kernel must support FixedSizeBinary

2019-08-02 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6122: - Summary: [C++] IsIn kernel must support FixedSizeBinary Key: ARROW-6122 URL: https://issues.apache.org/jira/browse/ARROW-6122 Project: Apache Arrow

[jira] [Commented] (ARROW-5932) undefined reference to `__cxa_init_primary_exception@CXXABI_1.3.11'

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-5932?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16899167#comment-16899167 ] Francois Saint-Jacques commented on ARROW-5932: --- How did you install arrow, from sources?

[jira] [Created] (ARROW-6124) [C++] IsIn kernel should sort in a single pass (with nulls)

2019-08-02 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6124: - Summary: [C++] IsIn kernel should sort in a single pass (with nulls) Key: ARROW-6124 URL: https://issues.apache.org/jira/browse/ARROW-6124 Project:

[jira] [Updated] (ARROW-6123) [C++] IsIn kernel should not materialize the output internal

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6123: -- Labels: (was: ana) > [C++] IsIn kernel should not materialize the output

[jira] [Updated] (ARROW-6122) [C++] ArgSort kernel must support FixedSizeBinary

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6122: -- Summary: [C++] ArgSort kernel must support FixedSizeBinary (was: [C++] IsIn

[jira] [Updated] (ARROW-6123) [C++] ArgSort kernel should not materialize the output internal

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6123: -- Summary: [C++] ArgSort kernel should not materialize the output internal

[jira] [Resolved] (ARROW-1566) [C++] Implement non-materializing sort kernels

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-1566. --- Resolution: Fixed Fix Version/s: 0.15.0 Issue resolved by pull

[jira] [Assigned] (ARROW-1566) [C++] Implement non-materializing sort kernels

2019-08-02 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-1566: - Assignee: Artem Alekseev > [C++] Implement non-materializing sort

[jira] [Created] (ARROW-6244) [C++] Implement Partition DataSource

2019-08-14 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6244: - Summary: [C++] Implement Partition DataSource Key: ARROW-6244 URL: https://issues.apache.org/jira/browse/ARROW-6244 Project: Apache Arrow

[jira] [Assigned] (ARROW-6242) [C++] Implements basic Dataset/Scanner/ScannerBuilder

2019-08-14 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-6242: - Assignee: Francois Saint-Jacques > [C++] Implements basic

[jira] [Created] (ARROW-6243) [C++] Implement basic Filter expression classes

2019-08-14 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6243: - Summary: [C++] Implement basic Filter expression classes Key: ARROW-6243 URL: https://issues.apache.org/jira/browse/ARROW-6243 Project: Apache Arrow

[jira] [Created] (ARROW-6242) [C++] Implements basic Dataset/Scanner/ScannerBuilder

2019-08-14 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6242: - Summary: [C++] Implements basic Dataset/Scanner/ScannerBuilder Key: ARROW-6242 URL: https://issues.apache.org/jira/browse/ARROW-6242 Project: Apache

[jira] [Comment Edited] (ARROW-6278) [R] Handle raw vector from read_parquet

2019-08-16 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16909248#comment-16909248 ] Francois Saint-Jacques edited comment on ARROW-6278 at 8/16/19 5:29 PM:

[jira] [Commented] (ARROW-6278) [R] Handle raw vector from read_parquet

2019-08-16 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16909248#comment-16909248 ] Francois Saint-Jacques commented on ARROW-6278: --- There's the BufferReader in C++

[jira] [Resolved] (ARROW-6258) [R] Add macOS build scripts

2019-08-19 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6258?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-6258. --- Resolution: Fixed Issue resolved by pull request 5095

[jira] [Created] (ARROW-6238) [C++] Implement SimpleDataSource

2019-08-14 Thread Francois Saint-Jacques (JIRA)
Francois Saint-Jacques created ARROW-6238: - Summary: [C++] Implement SimpleDataSource Key: ARROW-6238 URL: https://issues.apache.org/jira/browse/ARROW-6238 Project: Apache Arrow

[jira] [Updated] (ARROW-6238) [C++] Implement SimpleDataSource/SimpleDataFragment

2019-08-14 Thread Francois Saint-Jacques (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-6238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6238: -- Summary: [C++] Implement SimpleDataSource/SimpleDataFragment (was: [C++]

[jira] [Updated] (ARROW-3705) [Python] Add "nrows" argument to parquet.read_table read indicated number of rows from file instead of whole file

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-3705: -- Labels: dataset datasets parquet (was: datasets parquet) > [Python] Add

[jira] [Updated] (ARROW-3379) [C++] Implement regex/multichar delimiter tokenizer

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-3379: -- Labels: csv dataset datasets (was: csv datasets) > [C++] Implement

[jira] [Updated] (ARROW-2801) [Python] Implement splt_row_groups for ParquetDataset

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2801?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-2801: -- Labels: dataset datasets parquet pull-request-available (was: datasets

[jira] [Updated] (ARROW-3538) [Python] ability to override the automated assignment of uuid for filenames when writing datasets

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3538?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-3538: -- Labels: dataset datasets features parquet pull-request-available (was:

[jira] [Updated] (ARROW-6238) [C++] Implement SimpleDataSource/SimpleDataFragment

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6238: -- Labels: dataset datasets pull-request-available (was: datasets

[jira] [Updated] (ARROW-6242) [C++] Implements basic Dataset/Scanner/ScannerBuilder

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6242: -- Labels: dataset datasets (was: datasets) > [C++] Implements basic

[jira] [Updated] (ARROW-3764) [C++] Port Python "ParquetDataset" business logic to C++

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-3764: -- Labels: dataset datasets parquet (was: datasets parquet) > [C++] Port Python

[jira] [Updated] (ARROW-6161) [C++] Implements dataset::ParquetFile and associated Scan structures

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6161: -- Labels: dataset datasets pull-request-available (was: datasets

[jira] [Updated] (ARROW-3408) [C++] Add option to CSV reader to dictionary encode individual columns or all string / binary columns

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3408?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-3408: -- Labels: csv dataset datasets (was: csv datasets) > [C++] Add option to CSV

[jira] [Updated] (ARROW-2882) [C++][Python] Support AWS Firehose partition_scheme implementation for Parquet datasets

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-2882: -- Labels: dataset datasets parquet (was: datasets parquet) > [C++][Python]

[jira] [Updated] (ARROW-6244) [C++] Implement Partition DataSource

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6244: -- Labels: dataset datasets (was: datasets) > [C++] Implement Partition

[jira] [Updated] (ARROW-4076) [Python] schema validation and filters

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4076?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-4076: -- Labels: dataset datasets easyfix parquet pull-request-available (was:

[jira] [Assigned] (ARROW-6214) [R] Sanitizer errors triggered via R bindings

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-6214: - Assignee: Francois Saint-Jacques > [R] Sanitizer errors triggered via R

[jira] [Updated] (ARROW-4470) [Python] Pyarrow using considerable more memory when reading partitioned Parquet file

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-4470: -- Labels: dataset datasets parquet (was: datasets parquet) > [Python] Pyarrow

[jira] [Updated] (ARROW-1036) [C++] Define abstract API for filtering Arrow streams (e.g. predicate evaluation)

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-1036: -- Labels: dataset datasets (was: datasets) > [C++] Define abstract API for

[jira] [Updated] (ARROW-1089) [C++/Python] Add API to write an Arrow stream into either the stream or file formats on disk

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-1089: -- Labels: dataset datasets (was: datasets) > [C++/Python] Add API to write an

[jira] [Updated] (ARROW-6243) [C++] Implement basic Filter expression classes

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6243: -- Labels: dataset datasets (was: datasets) > [C++] Implement basic Filter

[jira] [Updated] (ARROW-2366) [Python] Support reading Parquet files having a permutation of column order

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-2366: -- Labels: dataset datasets parquet (was: datasets parquet) > [Python] Support

[jira] [Updated] (ARROW-3424) [Python] Improved workflow for loading an arbitrary collection of Parquet files

2019-08-21 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3424?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-3424: -- Labels: dataset datasets parquet (was: datasets parquet) > [Python] Improved

[jira] [Resolved] (ARROW-5992) [C++] Array::View fails for string/utf8 as binary

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5992. --- Resolution: Fixed Issue resolved by pull request 5125

[jira] [Resolved] (ARROW-6183) [R] Document that you don't have to use tidyselect if you don't want

2019-08-22 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-6183. --- Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull

[jira] [Commented] (ARROW-6214) [R] Sanitizer errors triggered via R bindings

2019-08-22 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16913446#comment-16913446 ] Francois Saint-Jacques commented on ARROW-6214: --- See attached files for full stack traces

[jira] [Updated] (ARROW-6214) [R] Sanitizer errors triggered via R bindings

2019-08-22 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-6214: -- Attachment: RDsan.failures RDcsan.failures > [R] Sanitizer

[jira] [Resolved] (ARROW-5966) [Python] Capacity error when converting large UTF32 numpy array to arrow array

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5966. --- Resolution: Fixed Issue resolved by pull request 5122

[jira] [Resolved] (ARROW-6048) [C++] Add ChunkedArray::View which calls to Array::View

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6048?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-6048. --- Resolution: Fixed Issue resolved by pull request 5127

[jira] [Assigned] (ARROW-5141) [C++] Share more of the IPC testing utils with the rest of Arrow

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5141: - Assignee: (was: Francois Saint-Jacques) > [C++] Share more of the

[jira] [Assigned] (ARROW-5082) [Python][Packaging] Reduce size of macOS and manylinux1 wheels

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5082: - Assignee: (was: Francois Saint-Jacques) > [Python][Packaging]

[jira] [Assigned] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5630: - Assignee: (was: Francois Saint-Jacques) > [Python] Table of nested

[jira] [Resolved] (ARROW-6046) [C++] Slice RecordBatch of String array with offset 0 returns whole batch

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6046?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-6046. --- Resolution: Fixed Issue resolved by pull request 5126

[jira] [Commented] (ARROW-6362) [C++] S3: more flexible credential options

2019-08-26 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6362?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16916011#comment-16916011 ] Francois Saint-Jacques commented on ARROW-6362: --- I think the exposed (high level) interface

<    1   2   3   4   5   6   7   8   9   10   >