[jira] [Updated] (ARROW-3519) [Gandiva] Add support for functions that can return variable len output
[ https://issues.apache.org/jira/browse/ARROW-3519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3519: -- Labels: pull-request-available (was: ) > [Gandiva] Add support for functions that can return variable len output > --- > > Key: ARROW-3519 > URL: https://issues.apache.org/jira/browse/ARROW-3519 > Project: Apache Arrow > Issue Type: Task > Components: Gandiva >Reporter: Pindikura Ravindra >Assignee: Pindikura Ravindra >Priority: Major > Labels: pull-request-available > > This is a pre-requisite for ARROW-3459. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3625) [Go] add examples for Table, Record and {Table,Record}Reader
Sebastien Binet created ARROW-3625: -- Summary: [Go] add examples for Table, Record and {Table,Record}Reader Key: ARROW-3625 URL: https://issues.apache.org/jira/browse/ARROW-3625 Project: Apache Arrow Issue Type: Improvement Components: Go Reporter: Sebastien Binet -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3626) [Go] add a CSV TableReader
Sebastien Binet created ARROW-3626: -- Summary: [Go] add a CSV TableReader Key: ARROW-3626 URL: https://issues.apache.org/jira/browse/ARROW-3626 Project: Apache Arrow Issue Type: Improvement Components: Go Reporter: Sebastien Binet assuming ARROW-3621 goes in, it should be relatively straightforward to implement a TableReader off a CSV file, using `encoding/csv`. also drawing inspiration from: - https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/reader.h - https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/reader.cc -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3624) [Python/C++] Support for zero-sized device buffers
Pearu Peterson created ARROW-3624: - Summary: [Python/C++] Support for zero-sized device buffers Key: ARROW-3624 URL: https://issues.apache.org/jira/browse/ARROW-3624 Project: Apache Arrow Issue Type: Improvement Components: C++, Python Reporter: Pearu Peterson CUDA cuMemAlloc raises an error when trying to allocate device memory with zero-size. On the other, using zero sized buffers or arrays is important for generic programming. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3622) [Go] implement Schema.Equal
[ https://issues.apache.org/jira/browse/ARROW-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Binet reassigned ARROW-3622: -- Assignee: Sebastien Binet > [Go] implement Schema.Equal > --- > > Key: ARROW-3622 > URL: https://issues.apache.org/jira/browse/ARROW-3622 > Project: Apache Arrow > Issue Type: Improvement > Components: Go >Reporter: Sebastien Binet >Assignee: Sebastien Binet >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3612) [Go] implement RecordBatch and RecordBatchReader
[ https://issues.apache.org/jira/browse/ARROW-3612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Binet reassigned ARROW-3612: -- Assignee: Sebastien Binet > [Go] implement RecordBatch and RecordBatchReader > > > Key: ARROW-3612 > URL: https://issues.apache.org/jira/browse/ARROW-3612 > Project: Apache Arrow > Issue Type: Improvement > Components: Go >Reporter: Sebastien Binet >Assignee: Sebastien Binet >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3584) [Go] add support for Table
[ https://issues.apache.org/jira/browse/ARROW-3584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Binet reassigned ARROW-3584: -- Assignee: Sebastien Binet > [Go] add support for Table > -- > > Key: ARROW-3584 > URL: https://issues.apache.org/jira/browse/ARROW-3584 > Project: Apache Arrow > Issue Type: Improvement > Components: Go >Reporter: Sebastien Binet >Assignee: Sebastien Binet >Priority: Major > Labels: pull-request-available > Time Spent: 1.5h > Remaining Estimate: 0h > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3621) [Go] implement TableBatchReader
Sebastien Binet created ARROW-3621: -- Summary: [Go] implement TableBatchReader Key: ARROW-3621 URL: https://issues.apache.org/jira/browse/ARROW-3621 Project: Apache Arrow Issue Type: Improvement Components: Go Reporter: Sebastien Binet -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3624) [Python/C++] Support for zero-sized device buffers
[ https://issues.apache.org/jira/browse/ARROW-3624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3624: -- Labels: pull-request-available (was: ) > [Python/C++] Support for zero-sized device buffers > -- > > Key: ARROW-3624 > URL: https://issues.apache.org/jira/browse/ARROW-3624 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Python >Reporter: Pearu Peterson >Priority: Major > Labels: pull-request-available > > CUDA cuMemAlloc raises an error when trying to allocate device memory with > zero-size. > On the other, using zero sized buffers or arrays is important for generic > programming. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3621) [Go] implement TableBatchReader
[ https://issues.apache.org/jira/browse/ARROW-3621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Binet reassigned ARROW-3621: -- Assignee: Sebastien Binet > [Go] implement TableBatchReader > --- > > Key: ARROW-3621 > URL: https://issues.apache.org/jira/browse/ARROW-3621 > Project: Apache Arrow > Issue Type: Improvement > Components: Go >Reporter: Sebastien Binet >Assignee: Sebastien Binet >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3623) [Go] implement Field.Equal
[ https://issues.apache.org/jira/browse/ARROW-3623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastien Binet reassigned ARROW-3623: -- Assignee: Sebastien Binet > [Go] implement Field.Equal > -- > > Key: ARROW-3623 > URL: https://issues.apache.org/jira/browse/ARROW-3623 > Project: Apache Arrow > Issue Type: Improvement > Components: Go >Reporter: Sebastien Binet >Assignee: Sebastien Binet >Priority: Major > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3622) [Go] implement Schema.Equal
Sebastien Binet created ARROW-3622: -- Summary: [Go] implement Schema.Equal Key: ARROW-3622 URL: https://issues.apache.org/jira/browse/ARROW-3622 Project: Apache Arrow Issue Type: Improvement Components: Go Reporter: Sebastien Binet -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3624) [Python/C++] Support for zero-sized device buffers
[ https://issues.apache.org/jira/browse/ARROW-3624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Pearu Peterson reassigned ARROW-3624: - Assignee: Pearu Peterson > [Python/C++] Support for zero-sized device buffers > -- > > Key: ARROW-3624 > URL: https://issues.apache.org/jira/browse/ARROW-3624 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Python >Reporter: Pearu Peterson >Assignee: Pearu Peterson >Priority: Major > Labels: pull-request-available > Time Spent: 10m > Remaining Estimate: 0h > > CUDA cuMemAlloc raises an error when trying to allocate device memory with > zero-size. > On the other, using zero sized buffers or arrays is important for generic > programming. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3627) [Go] add RecordBatchBuilder
Sebastien Binet created ARROW-3627: -- Summary: [Go] add RecordBatchBuilder Key: ARROW-3627 URL: https://issues.apache.org/jira/browse/ARROW-3627 Project: Apache Arrow Issue Type: Improvement Components: Go Reporter: Sebastien Binet -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3628) [R] Make decimal128 data available in R without requiring lossy conversions
Wes McKinney created ARROW-3628: --- Summary: [R] Make decimal128 data available in R without requiring lossy conversions Key: ARROW-3628 URL: https://issues.apache.org/jira/browse/ARROW-3628 Project: Apache Arrow Issue Type: New Feature Components: R Reporter: Wes McKinney Assignee: Romain François Fix For: 0.12.0 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-3593) [R] CI builds failing due to GitHub API rate limits
[ https://issues.apache.org/jira/browse/ARROW-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665486#comment-16665486 ] Wes McKinney commented on ARROW-3593: - This is still ongoing {code} * DONE (devtools) The downloaded source packages are in ‘/tmp/Rtmp4MyGYN/downloaded_packages’ 2.85s$ Rscript -e 'deps <- devtools::dev_package_deps(dependencies = NA);devtools::install_deps(dependencies = TRUE);if (!all(deps$package %in% installed.packages())) { message("missing: ", paste(setdiff(deps$package, installed.packages()), collapse=", ")); q(status = 1, save = "no")}' Error: HTTP error 403. API rate limit exceeded for 35.193.7.13. (But here's the good news: Authenticated requests get a higher rate limit. Check out the documentation for more details.) Rate limit remaining: 0 Rate limit reset at: 2018-10-26 17:17:46 UTC Execution halted The command "Rscript -e 'deps <- devtools::dev_package_deps(dependencies = NA);devtools::install_deps(dependencies = TRUE);if (!all(deps$package %in% installed.packages())) { message("missing: ", paste(setdiff(deps$package, installed.packages()), collapse=", ")); q(status = 1, save = "no")}'" failed and exited with 1 during . {code} I suggest moving R to the "allowed failures" section until this resolves > [R] CI builds failing due to GitHub API rate limits > --- > > Key: ARROW-3593 > URL: https://issues.apache.org/jira/browse/ARROW-3593 > Project: Apache Arrow > Issue Type: Bug > Components: R >Reporter: Wes McKinney >Priority: Major > Fix For: 0.12.0 > > > Could be due to other GitHub issues of late. [~romainfrancois] > [~javierluraschi] could you have a look? > https://travis-ci.org/apache/arrow/jobs/445003873#L2325 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3623) [Go] implement Field.Equal
Sebastien Binet created ARROW-3623: -- Summary: [Go] implement Field.Equal Key: ARROW-3623 URL: https://issues.apache.org/jira/browse/ARROW-3623 Project: Apache Arrow Issue Type: Improvement Components: Go Reporter: Sebastien Binet -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3629) [Python] Add write_to_dataset to Python Sphinx API listing
Wes McKinney created ARROW-3629: --- Summary: [Python] Add write_to_dataset to Python Sphinx API listing Key: ARROW-3629 URL: https://issues.apache.org/jira/browse/ARROW-3629 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Wes McKinney Fix For: 0.12.0 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-3593) [R] CI builds failing due to GitHub API rate limits
[ https://issues.apache.org/jira/browse/ARROW-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665493#comment-16665493 ] Wes McKinney commented on ARROW-3593: - See https://github.com/apache/arrow/pull/2846 > [R] CI builds failing due to GitHub API rate limits > --- > > Key: ARROW-3593 > URL: https://issues.apache.org/jira/browse/ARROW-3593 > Project: Apache Arrow > Issue Type: Bug > Components: R >Reporter: Wes McKinney >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 10m > Remaining Estimate: 0h > > Could be due to other GitHub issues of late. [~romainfrancois] > [~javierluraschi] could you have a look? > https://travis-ci.org/apache/arrow/jobs/445003873#L2325 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3593) [R] CI builds failing due to GitHub API rate limits
[ https://issues.apache.org/jira/browse/ARROW-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3593: -- Labels: pull-request-available (was: ) > [R] CI builds failing due to GitHub API rate limits > --- > > Key: ARROW-3593 > URL: https://issues.apache.org/jira/browse/ARROW-3593 > Project: Apache Arrow > Issue Type: Bug > Components: R >Reporter: Wes McKinney >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > > Could be due to other GitHub issues of late. [~romainfrancois] > [~javierluraschi] could you have a look? > https://travis-ci.org/apache/arrow/jobs/445003873#L2325 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3628) [R] Make decimal128 data available in R without requiring lossy conversions
[ https://issues.apache.org/jira/browse/ARROW-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3628: -- Labels: pull-request-available (was: ) > [R] Make decimal128 data available in R without requiring lossy conversions > --- > > Key: ARROW-3628 > URL: https://issues.apache.org/jira/browse/ARROW-3628 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Assignee: Romain François >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3630) [Plasma] [GLib] Add GLib bindings of Plasma
Yosuke Shiro created ARROW-3630: --- Summary: [Plasma] [GLib] Add GLib bindings of Plasma Key: ARROW-3630 URL: https://issues.apache.org/jira/browse/ARROW-3630 Project: Apache Arrow Issue Type: New Feature Components: GLib, Plasma (C++) Reporter: Yosuke Shiro Assignee: Yosuke Shiro Fix For: 0.12.0 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-2592) [Python] AssertionError in to_pandas()
[ https://issues.apache.org/jira/browse/ARROW-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665710#comment-16665710 ] Wes McKinney commented on ARROW-2592: - I'm not quite sure what can be done here. We might have to add an option to ignore the pandas metadata, if any > [Python] AssertionError in to_pandas() > -- > > Key: ARROW-2592 > URL: https://issues.apache.org/jira/browse/ARROW-2592 > Project: Apache Arrow > Issue Type: Bug > Components: Python >Affects Versions: 0.8.0, 0.9.0, 0.10.0, 0.11.0, 0.11.1 >Reporter: Dima Ryazanov >Priority: Major > Fix For: 0.12.0 > > > Pyarrow 0.8 and 0.9 raises an AssertionError for one of the datasets I have > (created using an older version of pyarrow). Repro steps: > {{In [1]: from pyarrow.parquet import ParquetDataset}} > {{In [2]: d = ParquetDataset(['bug.parq'])}} > {{In [3]: t = d.read()}} > {{In [4]: t.to_pandas()}} > {{---}} > {{AssertionError Traceback (most recent call > last)}} > {{ in ()}} > {{> 1 t.to_pandas()}} > {{table.pxi in pyarrow.lib.Table.to_pandas()}} > {{~/envs/cli3/lib/python3.6/site-packages/pyarrow/pandas_compat.py in > table_to_blockmanager(options, table, memory_pool, nthreads, categories)}} > {{ 529 # There must be the same number of field names and physical > names}} > {{ 530 # (fields in the arrow Table)}} > {{--> 531 assert len(logical_index_names) == len(index_columns_set)}} > {{ 532 }} > {{ 533 # It can never be the case in a released version of pyarrow > that}} > {{AssertionError: }} > > Here's the file: [https://www.dropbox.com/s/oja3khjsc5tycfh/bug.parq] > (I was not able to attach it here due to a "missing token", whatever that > means.) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-2592) [Python] AssertionError in to_pandas()
[ https://issues.apache.org/jira/browse/ARROW-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2592: Fix Version/s: 0.12.0 > [Python] AssertionError in to_pandas() > -- > > Key: ARROW-2592 > URL: https://issues.apache.org/jira/browse/ARROW-2592 > Project: Apache Arrow > Issue Type: Bug > Components: Python >Affects Versions: 0.8.0, 0.9.0, 0.10.0, 0.11.0, 0.11.1 >Reporter: Dima Ryazanov >Priority: Major > Fix For: 0.12.0 > > > Pyarrow 0.8 and 0.9 raises an AssertionError for one of the datasets I have > (created using an older version of pyarrow). Repro steps: > {{In [1]: from pyarrow.parquet import ParquetDataset}} > {{In [2]: d = ParquetDataset(['bug.parq'])}} > {{In [3]: t = d.read()}} > {{In [4]: t.to_pandas()}} > {{---}} > {{AssertionError Traceback (most recent call > last)}} > {{ in ()}} > {{> 1 t.to_pandas()}} > {{table.pxi in pyarrow.lib.Table.to_pandas()}} > {{~/envs/cli3/lib/python3.6/site-packages/pyarrow/pandas_compat.py in > table_to_blockmanager(options, table, memory_pool, nthreads, categories)}} > {{ 529 # There must be the same number of field names and physical > names}} > {{ 530 # (fields in the arrow Table)}} > {{--> 531 assert len(logical_index_names) == len(index_columns_set)}} > {{ 532 }} > {{ 533 # It can never be the case in a released version of pyarrow > that}} > {{AssertionError: }} > > Here's the file: [https://www.dropbox.com/s/oja3khjsc5tycfh/bug.parq] > (I was not able to attach it here due to a "missing token", whatever that > means.) -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-3324) [Python] Users reporting memory leaks using pa.pq.ParquetDataset
[ https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665729#comment-16665729 ] Wes McKinney commented on ARROW-3324: - Here's another memory leak report {code} import resource import random import string import pyarrow as pa import pyarrow.parquet as pq import pandas as pd def id_generator(size=6, chars=string.ascii_uppercase + string.digits): return ''.join(random.choice(chars) for _ in range(size)) schema = pa.schema([ pa.field('test', pa.string()), ]) resource.setrlimit(resource.RLIMIT_NOFILE, (100, 100)) number_files = 1 number_rows_increment = 1000 number_iterations = 100 writers = [pq.ParquetWriter('test_'+id_generator()+'.parquet', schema) for i in range(number_files)] for i in range(number_iterations): for writer in writers: table_to_write = pa.Table.from_pandas( pd.DataFrame({'test': [id_generator() for i in range(number_rows_increment)]}), preserve_index=False, schema = schema, nthreads = 1) table_to_write = table_to_write.replace_schema_metadata(None) writer.write_table(table_to_write) print(i) for writer in writers: writer.close() {code} https://stackoverflow.com/questions/53016802/memory-leak-from-pyarrow > [Python] Users reporting memory leaks using pa.pq.ParquetDataset > > > Key: ARROW-3324 > URL: https://issues.apache.org/jira/browse/ARROW-3324 > Project: Apache Arrow > Issue Type: Bug > Components: Python >Reporter: Wes McKinney >Priority: Major > Fix For: 0.12.0 > > > See: > * https://github.com/apache/arrow/issues/2614 > * https://github.com/apache/arrow/issues/2624 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3617) [R] support for decimal128
Romain François created ARROW-3617: -- Summary: [R] support for decimal128 Key: ARROW-3617 URL: https://issues.apache.org/jira/browse/ARROW-3617 Project: Apache Arrow Issue Type: New Feature Components: R Reporter: Romain François implemented as a vctrs record with one complex field so that it has the same layout -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3307) [R] Convert chunked arrow::Column to R vector
[ https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Romain François resolved ARROW-3307. Resolution: Fixed > [R] Convert chunked arrow::Column to R vector > - > > Key: ARROW-3307 > URL: https://issues.apache.org/jira/browse/ARROW-3307 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Priority: Major > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3616) [Java] checkstyle - fix remaining coding checks
[ https://issues.apache.org/jira/browse/ARROW-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3616: -- Labels: pull-request-available (was: ) > [Java] checkstyle - fix remaining coding checks > --- > > Key: ARROW-3616 > URL: https://issues.apache.org/jira/browse/ARROW-3616 > Project: Apache Arrow > Issue Type: Sub-task > Components: Java >Reporter: Bryan Cutler >Assignee: Bryan Cutler >Priority: Major > Labels: pull-request-available > > enable the remaining style checks -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Closed] (ARROW-3307) [R] Convert chunked arrow::Column to R vector
[ https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Romain François closed ARROW-3307. -- > [R] Convert chunked arrow::Column to R vector > - > > Key: ARROW-3307 > URL: https://issues.apache.org/jira/browse/ARROW-3307 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Priority: Major > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3487) [Gandiva] simplify NULL_IF_NULL functions that can return errors
[ https://issues.apache.org/jira/browse/ARROW-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-3487. - Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 2840 [https://github.com/apache/arrow/pull/2840] > [Gandiva] simplify NULL_IF_NULL functions that can return errors > > > Key: ARROW-3487 > URL: https://issues.apache.org/jira/browse/ARROW-3487 > Project: Apache Arrow > Issue Type: Task > Components: Gandiva >Reporter: Pindikura Ravindra >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 1h 10m > Remaining Estimate: 0h > > NULL_IF_NULL functions that can return errors eg. divide currently look at > the validity bits in each function (to avoid returning spurious errors). > > {code:java} > divide(TYPE in1, boolean is_valid1, TYPE in2, boolean is_valid2, ..) { > if (!is_valid1 || !is_valid2) { > return 0; > } > if (in2 == 0) { /* set error */ } > } > {code} > > This validity check is duplicated for multiple functions and should be moved > to the common layer (for all NULL_IF_NULL functions that can return error). > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3618) [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError
[ https://issues.apache.org/jira/browse/ARROW-3618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-3618. Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 2841 [https://github.com/apache/arrow/pull/2841] > [Packaging/Documentation] Add `-c conda-forge` option to avoid > PackagesNotFoundError > > > Key: ARROW-3618 > URL: https://issues.apache.org/jira/browse/ARROW-3618 > Project: Apache Arrow > Issue Type: Improvement > Components: Documentation, Packaging >Reporter: Krisztian Szucs >Assignee: Yasuo Honda >Priority: Minor > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 10m > Remaining Estimate: 0h > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-3618) [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError
[ https://issues.apache.org/jira/browse/ARROW-3618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3618: -- Labels: pull-request-available (was: ) > [Packaging/Documentation] Add `-c conda-forge` option to avoid > PackagesNotFoundError > > > Key: ARROW-3618 > URL: https://issues.apache.org/jira/browse/ARROW-3618 > Project: Apache Arrow > Issue Type: Improvement > Components: Documentation, Packaging >Reporter: Krisztian Szucs >Assignee: Yasuo Honda >Priority: Minor > Labels: pull-request-available > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3619) [R] Expose global thread pool optins
Wes McKinney created ARROW-3619: --- Summary: [R] Expose global thread pool optins Key: ARROW-3619 URL: https://issues.apache.org/jira/browse/ARROW-3619 Project: Apache Arrow Issue Type: Improvement Components: R Reporter: Wes McKinney Fix For: 0.12.0 This will permit users to configure multithreading options e.g. for conversions. See https://github.com/apache/arrow/blob/master/python/pyarrow/lib.pyx#L40 in Python -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3620) [Python] Document multithreading options in Sphinx and add to api.rst
Wes McKinney created ARROW-3620: --- Summary: [Python] Document multithreading options in Sphinx and add to api.rst Key: ARROW-3620 URL: https://issues.apache.org/jira/browse/ARROW-3620 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Wes McKinney Fix For: 0.12.0 I was looking for these functions (like {{cpu_count}}) in api.rst and did not find them -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-2969) [R] Convert between StructArray and "nested" data.frame column containing data frame in each cell
[ https://issues.apache.org/jira/browse/ARROW-2969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16664993#comment-16664993 ] Wes McKinney commented on ARROW-2969: - Having inner cells as lists seems like a reasonable start to me > [R] Convert between StructArray and "nested" data.frame column containing > data frame in each cell > - > > Key: ARROW-2969 > URL: https://issues.apache.org/jira/browse/ARROW-2969 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Priority: Major > > These are starting to be more widely used with the tidyr package and > elsewhere cc [~hadley] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3618) [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError
Krisztian Szucs created ARROW-3618: -- Summary: [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError Key: ARROW-3618 URL: https://issues.apache.org/jira/browse/ARROW-3618 Project: Apache Arrow Issue Type: Improvement Components: Documentation, Packaging Reporter: Krisztian Szucs Assignee: Yasuo Honda -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-3487) [Gandiva] simplify NULL_IF_NULL functions that can return errors
[ https://issues.apache.org/jira/browse/ARROW-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-3487: --- Assignee: Pindikura Ravindra > [Gandiva] simplify NULL_IF_NULL functions that can return errors > > > Key: ARROW-3487 > URL: https://issues.apache.org/jira/browse/ARROW-3487 > Project: Apache Arrow > Issue Type: Task > Components: Gandiva >Reporter: Pindikura Ravindra >Assignee: Pindikura Ravindra >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 1h 10m > Remaining Estimate: 0h > > NULL_IF_NULL functions that can return errors eg. divide currently look at > the validity bits in each function (to avoid returning spurious errors). > > {code:java} > divide(TYPE in1, boolean is_valid1, TYPE in2, boolean is_valid2, ..) { > if (!is_valid1 || !is_valid2) { > return 0; > } > if (in2 == 0) { /* set error */ } > } > {code} > > This validity check is duplicated for multiple functions and should be moved > to the common layer (for all NULL_IF_NULL functions that can return error). > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Reopened] (ARROW-3307) [R] Convert chunked arrow::Column to R vector
[ https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reopened ARROW-3307: - Assignee: Romain François > [R] Convert chunked arrow::Column to R vector > - > > Key: ARROW-3307 > URL: https://issues.apache.org/jira/browse/ARROW-3307 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Assignee: Romain François >Priority: Major > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-3307) [R] Convert chunked arrow::Column to R vector
[ https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-3307. - Resolution: Fixed > [R] Convert chunked arrow::Column to R vector > - > > Key: ARROW-3307 > URL: https://issues.apache.org/jira/browse/ARROW-3307 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Assignee: Romain François >Priority: Major > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-3307) [R] Convert chunked arrow::Column to R vector
[ https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16664974#comment-16664974 ] Wes McKinney commented on ARROW-3307: - Leaving in resolved status > [R] Convert chunked arrow::Column to R vector > - > > Key: ARROW-3307 > URL: https://issues.apache.org/jira/browse/ARROW-3307 > Project: Apache Arrow > Issue Type: New Feature > Components: R >Reporter: Wes McKinney >Assignee: Romain François >Priority: Major > Fix For: 0.12.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-2767) [JS] Add generic to Table for column names
[ https://issues.apache.org/jira/browse/ARROW-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-2767: Fix Version/s: (was: 0.12.0) JS-0.4.0 > [JS] Add generic to Table for column names > -- > > Key: ARROW-2767 > URL: https://issues.apache.org/jira/browse/ARROW-2767 > Project: Apache Arrow > Issue Type: Improvement > Components: JavaScript >Reporter: Brian Hulette >Assignee: Brian Hulette >Priority: Major > Labels: pull-request-available > Fix For: JS-0.4.0 > > Time Spent: 1.5h > Remaining Estimate: 0h > > Requested by [~domoritz] > Something like: > {code:javascript} > class Table { > ... > getColumn(name: ColName): Vector { > } > ... > } > {code} > It would be even better if we could find a way to map the column names to the > actual vector data types, but one thing at a time. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Assigned] (ARROW-2767) [JS] Add generic to Table for column names
[ https://issues.apache.org/jira/browse/ARROW-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-2767: --- Assignee: Brian Hulette (was: Paul Taylor) > [JS] Add generic to Table for column names > -- > > Key: ARROW-2767 > URL: https://issues.apache.org/jira/browse/ARROW-2767 > Project: Apache Arrow > Issue Type: Improvement > Components: JavaScript >Reporter: Brian Hulette >Assignee: Brian Hulette >Priority: Major > Labels: pull-request-available > Fix For: JS-0.4.0 > > Time Spent: 1.5h > Remaining Estimate: 0h > > Requested by [~domoritz] > Something like: > {code:javascript} > class Table { > ... > getColumn(name: ColName): Vector { > } > ... > } > {code} > It would be even better if we could find a way to map the column names to the > actual vector data types, but one thing at a time. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-2767) [JS] Add generic to Table for column names
[ https://issues.apache.org/jira/browse/ARROW-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-2767. - Resolution: Fixed Fix Version/s: (was: JS-0.4.0) 0.12.0 Issue resolved by pull request 2256 [https://github.com/apache/arrow/pull/2256] > [JS] Add generic to Table for column names > -- > > Key: ARROW-2767 > URL: https://issues.apache.org/jira/browse/ARROW-2767 > Project: Apache Arrow > Issue Type: Improvement > Components: JavaScript >Reporter: Brian Hulette >Assignee: Paul Taylor >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 1h 20m > Remaining Estimate: 0h > > Requested by [~domoritz] > Something like: > {code:javascript} > class Table { > ... > getColumn(name: ColName): Vector { > } > ... > } > {code} > It would be even better if we could find a way to map the column names to the > actual vector data types, but one thing at a time. -- This message was sent by Atlassian JIRA (v7.6.3#76005)