[jira] [Updated] (ARROW-3519) [Gandiva] Add support for functions that can return variable len output

2018-10-26 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3519:
--
Labels: pull-request-available  (was: )

> [Gandiva] Add support for functions that can return variable len output
> ---
>
> Key: ARROW-3519
> URL: https://issues.apache.org/jira/browse/ARROW-3519
> Project: Apache Arrow
>  Issue Type: Task
>  Components: Gandiva
>Reporter: Pindikura Ravindra
>Assignee: Pindikura Ravindra
>Priority: Major
>  Labels: pull-request-available
>
> This is a pre-requisite for ARROW-3459.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3625) [Go] add examples for Table, Record and {Table,Record}Reader

2018-10-26 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-3625:
--

 Summary: [Go] add examples for Table, Record and 
{Table,Record}Reader
 Key: ARROW-3625
 URL: https://issues.apache.org/jira/browse/ARROW-3625
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Go
Reporter: Sebastien Binet






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3626) [Go] add a CSV TableReader

2018-10-26 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-3626:
--

 Summary: [Go] add a CSV TableReader
 Key: ARROW-3626
 URL: https://issues.apache.org/jira/browse/ARROW-3626
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Go
Reporter: Sebastien Binet


assuming ARROW-3621 goes in, it should be relatively straightforward to 
implement a TableReader off a CSV file, using `encoding/csv`.

also drawing inspiration from:
- https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/reader.h
- https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/reader.cc



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3624) [Python/C++] Support for zero-sized device buffers

2018-10-26 Thread Pearu Peterson (JIRA)
Pearu Peterson created ARROW-3624:
-

 Summary: [Python/C++] Support for zero-sized device buffers
 Key: ARROW-3624
 URL: https://issues.apache.org/jira/browse/ARROW-3624
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++, Python
Reporter: Pearu Peterson


CUDA cuMemAlloc raises an error when trying to allocate device memory with 
zero-size.
On the other, using zero sized buffers or arrays is important for generic 
programming.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3622) [Go] implement Schema.Equal

2018-10-26 Thread Sebastien Binet (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastien Binet reassigned ARROW-3622:
--

Assignee: Sebastien Binet

> [Go] implement Schema.Equal
> ---
>
> Key: ARROW-3622
> URL: https://issues.apache.org/jira/browse/ARROW-3622
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Sebastien Binet
>Assignee: Sebastien Binet
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3612) [Go] implement RecordBatch and RecordBatchReader

2018-10-26 Thread Sebastien Binet (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastien Binet reassigned ARROW-3612:
--

Assignee: Sebastien Binet

> [Go] implement RecordBatch and RecordBatchReader
> 
>
> Key: ARROW-3612
> URL: https://issues.apache.org/jira/browse/ARROW-3612
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Sebastien Binet
>Assignee: Sebastien Binet
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3584) [Go] add support for Table

2018-10-26 Thread Sebastien Binet (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3584?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastien Binet reassigned ARROW-3584:
--

Assignee: Sebastien Binet

> [Go] add support for Table
> --
>
> Key: ARROW-3584
> URL: https://issues.apache.org/jira/browse/ARROW-3584
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Sebastien Binet
>Assignee: Sebastien Binet
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3621) [Go] implement TableBatchReader

2018-10-26 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-3621:
--

 Summary: [Go] implement TableBatchReader
 Key: ARROW-3621
 URL: https://issues.apache.org/jira/browse/ARROW-3621
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Go
Reporter: Sebastien Binet






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3624) [Python/C++] Support for zero-sized device buffers

2018-10-26 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3624:
--
Labels: pull-request-available  (was: )

> [Python/C++] Support for zero-sized device buffers
> --
>
> Key: ARROW-3624
> URL: https://issues.apache.org/jira/browse/ARROW-3624
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Python
>Reporter: Pearu Peterson
>Priority: Major
>  Labels: pull-request-available
>
> CUDA cuMemAlloc raises an error when trying to allocate device memory with 
> zero-size.
> On the other, using zero sized buffers or arrays is important for generic 
> programming.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3621) [Go] implement TableBatchReader

2018-10-26 Thread Sebastien Binet (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastien Binet reassigned ARROW-3621:
--

Assignee: Sebastien Binet

> [Go] implement TableBatchReader
> ---
>
> Key: ARROW-3621
> URL: https://issues.apache.org/jira/browse/ARROW-3621
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Sebastien Binet
>Assignee: Sebastien Binet
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3623) [Go] implement Field.Equal

2018-10-26 Thread Sebastien Binet (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sebastien Binet reassigned ARROW-3623:
--

Assignee: Sebastien Binet

> [Go] implement Field.Equal
> --
>
> Key: ARROW-3623
> URL: https://issues.apache.org/jira/browse/ARROW-3623
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Go
>Reporter: Sebastien Binet
>Assignee: Sebastien Binet
>Priority: Major
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3622) [Go] implement Schema.Equal

2018-10-26 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-3622:
--

 Summary: [Go] implement Schema.Equal
 Key: ARROW-3622
 URL: https://issues.apache.org/jira/browse/ARROW-3622
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Go
Reporter: Sebastien Binet






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3624) [Python/C++] Support for zero-sized device buffers

2018-10-26 Thread Pearu Peterson (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pearu Peterson reassigned ARROW-3624:
-

Assignee: Pearu Peterson

> [Python/C++] Support for zero-sized device buffers
> --
>
> Key: ARROW-3624
> URL: https://issues.apache.org/jira/browse/ARROW-3624
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++, Python
>Reporter: Pearu Peterson
>Assignee: Pearu Peterson
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> CUDA cuMemAlloc raises an error when trying to allocate device memory with 
> zero-size.
> On the other, using zero sized buffers or arrays is important for generic 
> programming.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3627) [Go] add RecordBatchBuilder

2018-10-26 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-3627:
--

 Summary: [Go] add RecordBatchBuilder
 Key: ARROW-3627
 URL: https://issues.apache.org/jira/browse/ARROW-3627
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Go
Reporter: Sebastien Binet






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3628) [R] Make decimal128 data available in R without requiring lossy conversions

2018-10-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3628:
---

 Summary: [R] Make decimal128 data available in R without requiring 
lossy conversions
 Key: ARROW-3628
 URL: https://issues.apache.org/jira/browse/ARROW-3628
 Project: Apache Arrow
  Issue Type: New Feature
  Components: R
Reporter: Wes McKinney
Assignee: Romain François
 Fix For: 0.12.0






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-3593) [R] CI builds failing due to GitHub API rate limits

2018-10-26 Thread Wes McKinney (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665486#comment-16665486
 ] 

Wes McKinney commented on ARROW-3593:
-

This is still ongoing

{code}
* DONE (devtools)
The downloaded source packages are in
‘/tmp/Rtmp4MyGYN/downloaded_packages’
2.85s$ Rscript -e 'deps <- devtools::dev_package_deps(dependencies = 
NA);devtools::install_deps(dependencies = TRUE);if (!all(deps$package %in% 
installed.packages())) { message("missing: ", paste(setdiff(deps$package, 
installed.packages()), collapse=", ")); q(status = 1, save = "no")}'
Error: HTTP error 403.
  API rate limit exceeded for 35.193.7.13. (But here's the good news: 
Authenticated requests get a higher rate limit. Check out the documentation for 
more details.)
  Rate limit remaining: 0
  Rate limit reset at: 2018-10-26 17:17:46 UTC
Execution halted
The command "Rscript -e 'deps <- devtools::dev_package_deps(dependencies = 
NA);devtools::install_deps(dependencies = TRUE);if (!all(deps$package %in% 
installed.packages())) { message("missing: ", paste(setdiff(deps$package, 
installed.packages()), collapse=", ")); q(status = 1, save = "no")}'" failed 
and exited with 1 during .
{code}

I suggest moving R to the "allowed failures" section until this resolves

> [R] CI builds failing due to GitHub API rate limits
> ---
>
> Key: ARROW-3593
> URL: https://issues.apache.org/jira/browse/ARROW-3593
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: R
>Reporter: Wes McKinney
>Priority: Major
> Fix For: 0.12.0
>
>
> Could be due to other GitHub issues of late. [~romainfrancois] 
> [~javierluraschi] could you have a look?
> https://travis-ci.org/apache/arrow/jobs/445003873#L2325



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3623) [Go] implement Field.Equal

2018-10-26 Thread Sebastien Binet (JIRA)
Sebastien Binet created ARROW-3623:
--

 Summary: [Go] implement Field.Equal
 Key: ARROW-3623
 URL: https://issues.apache.org/jira/browse/ARROW-3623
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Go
Reporter: Sebastien Binet






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3629) [Python] Add write_to_dataset to Python Sphinx API listing

2018-10-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3629:
---

 Summary: [Python] Add write_to_dataset to Python Sphinx API listing
 Key: ARROW-3629
 URL: https://issues.apache.org/jira/browse/ARROW-3629
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Wes McKinney
 Fix For: 0.12.0






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-3593) [R] CI builds failing due to GitHub API rate limits

2018-10-26 Thread Wes McKinney (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665493#comment-16665493
 ] 

Wes McKinney commented on ARROW-3593:
-

See https://github.com/apache/arrow/pull/2846

> [R] CI builds failing due to GitHub API rate limits
> ---
>
> Key: ARROW-3593
> URL: https://issues.apache.org/jira/browse/ARROW-3593
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: R
>Reporter: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Could be due to other GitHub issues of late. [~romainfrancois] 
> [~javierluraschi] could you have a look?
> https://travis-ci.org/apache/arrow/jobs/445003873#L2325



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3593) [R] CI builds failing due to GitHub API rate limits

2018-10-26 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3593?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3593:
--
Labels: pull-request-available  (was: )

> [R] CI builds failing due to GitHub API rate limits
> ---
>
> Key: ARROW-3593
> URL: https://issues.apache.org/jira/browse/ARROW-3593
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: R
>Reporter: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>
> Could be due to other GitHub issues of late. [~romainfrancois] 
> [~javierluraschi] could you have a look?
> https://travis-ci.org/apache/arrow/jobs/445003873#L2325



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3628) [R] Make decimal128 data available in R without requiring lossy conversions

2018-10-26 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3628?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3628:
--
Labels: pull-request-available  (was: )

> [R] Make decimal128 data available in R without requiring lossy conversions
> ---
>
> Key: ARROW-3628
> URL: https://issues.apache.org/jira/browse/ARROW-3628
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Assignee: Romain François
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3630) [Plasma] [GLib] Add GLib bindings of Plasma

2018-10-26 Thread Yosuke Shiro (JIRA)
Yosuke Shiro created ARROW-3630:
---

 Summary: [Plasma] [GLib] Add GLib bindings of Plasma
 Key: ARROW-3630
 URL: https://issues.apache.org/jira/browse/ARROW-3630
 Project: Apache Arrow
  Issue Type: New Feature
  Components: GLib, Plasma (C++)
Reporter: Yosuke Shiro
Assignee: Yosuke Shiro
 Fix For: 0.12.0






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-2592) [Python] AssertionError in to_pandas()

2018-10-26 Thread Wes McKinney (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665710#comment-16665710
 ] 

Wes McKinney commented on ARROW-2592:
-

I'm not quite sure what can be done here. We might have to add an option to 
ignore the pandas metadata, if any

> [Python] AssertionError in to_pandas()
> --
>
> Key: ARROW-2592
> URL: https://issues.apache.org/jira/browse/ARROW-2592
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.8.0, 0.9.0, 0.10.0, 0.11.0, 0.11.1
>Reporter: Dima Ryazanov
>Priority: Major
> Fix For: 0.12.0
>
>
> Pyarrow 0.8 and 0.9 raises an AssertionError for one of the datasets I have 
> (created using an older version of pyarrow). Repro steps:
> {{In [1]: from pyarrow.parquet import ParquetDataset}}
> {{In [2]: d = ParquetDataset(['bug.parq'])}}
> {{In [3]: t = d.read()}}
> {{In [4]: t.to_pandas()}}
> {{---}}
> {{AssertionError    Traceback (most recent call 
> last)}}
> {{ in ()}}
> {{> 1 t.to_pandas()}}
> {{table.pxi in pyarrow.lib.Table.to_pandas()}}
> {{~/envs/cli3/lib/python3.6/site-packages/pyarrow/pandas_compat.py in 
> table_to_blockmanager(options, table, memory_pool, nthreads, categories)}}
> {{    529 # There must be the same number of field names and physical 
> names}}
> {{    530 # (fields in the arrow Table)}}
> {{--> 531 assert len(logical_index_names) == len(index_columns_set)}}
> {{    532 }}
> {{    533 # It can never be the case in a released version of pyarrow 
> that}}
> {{AssertionError: }}
>  
> Here's the file: [https://www.dropbox.com/s/oja3khjsc5tycfh/bug.parq]
> (I was not able to attach it here due to a "missing token", whatever that 
> means.)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2592) [Python] AssertionError in to_pandas()

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2592?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-2592:

Fix Version/s: 0.12.0

> [Python] AssertionError in to_pandas()
> --
>
> Key: ARROW-2592
> URL: https://issues.apache.org/jira/browse/ARROW-2592
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Affects Versions: 0.8.0, 0.9.0, 0.10.0, 0.11.0, 0.11.1
>Reporter: Dima Ryazanov
>Priority: Major
> Fix For: 0.12.0
>
>
> Pyarrow 0.8 and 0.9 raises an AssertionError for one of the datasets I have 
> (created using an older version of pyarrow). Repro steps:
> {{In [1]: from pyarrow.parquet import ParquetDataset}}
> {{In [2]: d = ParquetDataset(['bug.parq'])}}
> {{In [3]: t = d.read()}}
> {{In [4]: t.to_pandas()}}
> {{---}}
> {{AssertionError    Traceback (most recent call 
> last)}}
> {{ in ()}}
> {{> 1 t.to_pandas()}}
> {{table.pxi in pyarrow.lib.Table.to_pandas()}}
> {{~/envs/cli3/lib/python3.6/site-packages/pyarrow/pandas_compat.py in 
> table_to_blockmanager(options, table, memory_pool, nthreads, categories)}}
> {{    529 # There must be the same number of field names and physical 
> names}}
> {{    530 # (fields in the arrow Table)}}
> {{--> 531 assert len(logical_index_names) == len(index_columns_set)}}
> {{    532 }}
> {{    533 # It can never be the case in a released version of pyarrow 
> that}}
> {{AssertionError: }}
>  
> Here's the file: [https://www.dropbox.com/s/oja3khjsc5tycfh/bug.parq]
> (I was not able to attach it here due to a "missing token", whatever that 
> means.)



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-3324) [Python] Users reporting memory leaks using pa.pq.ParquetDataset

2018-10-26 Thread Wes McKinney (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16665729#comment-16665729
 ] 

Wes McKinney commented on ARROW-3324:
-

Here's another memory leak report

{code}
import resource
import random
import string
import pyarrow as pa
import pyarrow.parquet as pq
import pandas as pd


def id_generator(size=6, chars=string.ascii_uppercase + string.digits):
return ''.join(random.choice(chars) for _ in range(size))

schema = pa.schema([
pa.field('test', pa.string()),
])

resource.setrlimit(resource.RLIMIT_NOFILE, (100, 100))
number_files = 1
number_rows_increment = 1000
number_iterations = 100

writers = [pq.ParquetWriter('test_'+id_generator()+'.parquet', schema) for i in 
range(number_files)]

for i in range(number_iterations):
for writer in writers:
table_to_write = pa.Table.from_pandas(
pd.DataFrame({'test': [id_generator() for i in 
range(number_rows_increment)]}),
preserve_index=False,
schema = schema,
nthreads = 1)
table_to_write = table_to_write.replace_schema_metadata(None)
writer.write_table(table_to_write)
print(i)

for writer in writers:
writer.close()
{code}

https://stackoverflow.com/questions/53016802/memory-leak-from-pyarrow

> [Python] Users reporting memory leaks using pa.pq.ParquetDataset
> 
>
> Key: ARROW-3324
> URL: https://issues.apache.org/jira/browse/ARROW-3324
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Reporter: Wes McKinney
>Priority: Major
> Fix For: 0.12.0
>
>
> See:
> * https://github.com/apache/arrow/issues/2614
> * https://github.com/apache/arrow/issues/2624



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3617) [R] support for decimal128

2018-10-26 Thread JIRA
Romain François created ARROW-3617:
--

 Summary: [R] support for decimal128
 Key: ARROW-3617
 URL: https://issues.apache.org/jira/browse/ARROW-3617
 Project: Apache Arrow
  Issue Type: New Feature
  Components: R
Reporter: Romain François


implemented as a vctrs record with one complex field so that it has the same 
layout



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3307) [R] Convert chunked arrow::Column to R vector

2018-10-26 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Romain François resolved ARROW-3307.

Resolution: Fixed

> [R] Convert chunked arrow::Column to R vector
> -
>
> Key: ARROW-3307
> URL: https://issues.apache.org/jira/browse/ARROW-3307
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Priority: Major
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3616) [Java] checkstyle - fix remaining coding checks

2018-10-26 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3616:
--
Labels: pull-request-available  (was: )

> [Java] checkstyle - fix remaining coding checks
> ---
>
> Key: ARROW-3616
> URL: https://issues.apache.org/jira/browse/ARROW-3616
> Project: Apache Arrow
>  Issue Type: Sub-task
>  Components: Java
>Reporter: Bryan Cutler
>Assignee: Bryan Cutler
>Priority: Major
>  Labels: pull-request-available
>
> enable the remaining style checks



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Closed] (ARROW-3307) [R] Convert chunked arrow::Column to R vector

2018-10-26 Thread JIRA


 [ 
https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Romain François closed ARROW-3307.
--

> [R] Convert chunked arrow::Column to R vector
> -
>
> Key: ARROW-3307
> URL: https://issues.apache.org/jira/browse/ARROW-3307
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Priority: Major
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3487) [Gandiva] simplify NULL_IF_NULL functions that can return errors

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-3487.
-
   Resolution: Fixed
Fix Version/s: 0.12.0

Issue resolved by pull request 2840
[https://github.com/apache/arrow/pull/2840]

> [Gandiva] simplify NULL_IF_NULL functions that can return errors
> 
>
> Key: ARROW-3487
> URL: https://issues.apache.org/jira/browse/ARROW-3487
> Project: Apache Arrow
>  Issue Type: Task
>  Components: Gandiva
>Reporter: Pindikura Ravindra
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> NULL_IF_NULL functions that can return errors eg. divide currently look at 
> the validity bits in each function (to avoid returning spurious errors).
>  
> {code:java}
> divide(TYPE in1, boolean is_valid1, TYPE in2, boolean is_valid2, ..) {
>     if (!is_valid1 || !is_valid2) { 
>       return 0;
>     }
> if (in2 == 0) { /* set error */ }
> }
> {code}
>  
> This validity check is duplicated for multiple functions and should be moved 
> to the common layer (for all NULL_IF_NULL functions that can return error).
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3618) [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError

2018-10-26 Thread Krisztian Szucs (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Krisztian Szucs resolved ARROW-3618.

   Resolution: Fixed
Fix Version/s: 0.12.0

Issue resolved by pull request 2841
[https://github.com/apache/arrow/pull/2841]

> [Packaging/Documentation] Add `-c conda-forge` option to avoid 
> PackagesNotFoundError
> 
>
> Key: ARROW-3618
> URL: https://issues.apache.org/jira/browse/ARROW-3618
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Documentation, Packaging
>Reporter: Krisztian Szucs
>Assignee: Yasuo Honda
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-3618) [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError

2018-10-26 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3618:
--
Labels: pull-request-available  (was: )

> [Packaging/Documentation] Add `-c conda-forge` option to avoid 
> PackagesNotFoundError
> 
>
> Key: ARROW-3618
> URL: https://issues.apache.org/jira/browse/ARROW-3618
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Documentation, Packaging
>Reporter: Krisztian Szucs
>Assignee: Yasuo Honda
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3619) [R] Expose global thread pool optins

2018-10-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3619:
---

 Summary: [R] Expose global thread pool optins
 Key: ARROW-3619
 URL: https://issues.apache.org/jira/browse/ARROW-3619
 Project: Apache Arrow
  Issue Type: Improvement
  Components: R
Reporter: Wes McKinney
 Fix For: 0.12.0


This will permit users to configure multithreading options e.g. for 
conversions.  See 
https://github.com/apache/arrow/blob/master/python/pyarrow/lib.pyx#L40 in Python



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3620) [Python] Document multithreading options in Sphinx and add to api.rst

2018-10-26 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-3620:
---

 Summary: [Python] Document multithreading options in Sphinx and 
add to api.rst
 Key: ARROW-3620
 URL: https://issues.apache.org/jira/browse/ARROW-3620
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Python
Reporter: Wes McKinney
 Fix For: 0.12.0


I was looking for these functions (like {{cpu_count}}) in api.rst and did not 
find them



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-2969) [R] Convert between StructArray and "nested" data.frame column containing data frame in each cell

2018-10-26 Thread Wes McKinney (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-2969?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16664993#comment-16664993
 ] 

Wes McKinney commented on ARROW-2969:
-

Having inner cells as lists seems like a reasonable start to me

> [R] Convert between StructArray and "nested" data.frame column containing 
> data frame in each cell
> -
>
> Key: ARROW-2969
> URL: https://issues.apache.org/jira/browse/ARROW-2969
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Priority: Major
>
> These are starting to be more widely used with the tidyr package and 
> elsewhere cc [~hadley]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-3618) [Packaging/Documentation] Add `-c conda-forge` option to avoid PackagesNotFoundError

2018-10-26 Thread Krisztian Szucs (JIRA)
Krisztian Szucs created ARROW-3618:
--

 Summary: [Packaging/Documentation] Add `-c conda-forge` option to 
avoid PackagesNotFoundError
 Key: ARROW-3618
 URL: https://issues.apache.org/jira/browse/ARROW-3618
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Documentation, Packaging
Reporter: Krisztian Szucs
Assignee: Yasuo Honda






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-3487) [Gandiva] simplify NULL_IF_NULL functions that can return errors

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-3487:
---

Assignee: Pindikura Ravindra

> [Gandiva] simplify NULL_IF_NULL functions that can return errors
> 
>
> Key: ARROW-3487
> URL: https://issues.apache.org/jira/browse/ARROW-3487
> Project: Apache Arrow
>  Issue Type: Task
>  Components: Gandiva
>Reporter: Pindikura Ravindra
>Assignee: Pindikura Ravindra
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> NULL_IF_NULL functions that can return errors eg. divide currently look at 
> the validity bits in each function (to avoid returning spurious errors).
>  
> {code:java}
> divide(TYPE in1, boolean is_valid1, TYPE in2, boolean is_valid2, ..) {
>     if (!is_valid1 || !is_valid2) { 
>       return 0;
>     }
> if (in2 == 0) { /* set error */ }
> }
> {code}
>  
> This validity check is duplicated for multiple functions and should be moved 
> to the common layer (for all NULL_IF_NULL functions that can return error).
>  
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Reopened] (ARROW-3307) [R] Convert chunked arrow::Column to R vector

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reopened ARROW-3307:
-
  Assignee: Romain François

> [R] Convert chunked arrow::Column to R vector
> -
>
> Key: ARROW-3307
> URL: https://issues.apache.org/jira/browse/ARROW-3307
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Assignee: Romain François
>Priority: Major
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-3307) [R] Convert chunked arrow::Column to R vector

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-3307.
-
Resolution: Fixed

> [R] Convert chunked arrow::Column to R vector
> -
>
> Key: ARROW-3307
> URL: https://issues.apache.org/jira/browse/ARROW-3307
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Assignee: Romain François
>Priority: Major
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-3307) [R] Convert chunked arrow::Column to R vector

2018-10-26 Thread Wes McKinney (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-3307?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16664974#comment-16664974
 ] 

Wes McKinney commented on ARROW-3307:
-

Leaving in resolved status

> [R] Convert chunked arrow::Column to R vector
> -
>
> Key: ARROW-3307
> URL: https://issues.apache.org/jira/browse/ARROW-3307
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: R
>Reporter: Wes McKinney
>Assignee: Romain François
>Priority: Major
> Fix For: 0.12.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2767) [JS] Add generic to Table for column names

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-2767:

Fix Version/s: (was: 0.12.0)
   JS-0.4.0

> [JS] Add generic to Table for column names
> --
>
> Key: ARROW-2767
> URL: https://issues.apache.org/jira/browse/ARROW-2767
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: JavaScript
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Labels: pull-request-available
> Fix For: JS-0.4.0
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Requested by [~domoritz]
> Something like:
> {code:javascript}
> class Table {
> ...
> getColumn(name: ColName): Vector {
> }
> ...
> }
> {code}
> It would be even better if we could find a way to map the column names to the 
> actual vector data types, but one thing at a time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Assigned] (ARROW-2767) [JS] Add generic to Table for column names

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-2767:
---

Assignee: Brian Hulette  (was: Paul Taylor)

> [JS] Add generic to Table for column names
> --
>
> Key: ARROW-2767
> URL: https://issues.apache.org/jira/browse/ARROW-2767
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: JavaScript
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: Major
>  Labels: pull-request-available
> Fix For: JS-0.4.0
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Requested by [~domoritz]
> Something like:
> {code:javascript}
> class Table {
> ...
> getColumn(name: ColName): Vector {
> }
> ...
> }
> {code}
> It would be even better if we could find a way to map the column names to the 
> actual vector data types, but one thing at a time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-2767) [JS] Add generic to Table for column names

2018-10-26 Thread Wes McKinney (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2767?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-2767.
-
   Resolution: Fixed
Fix Version/s: (was: JS-0.4.0)
   0.12.0

Issue resolved by pull request 2256
[https://github.com/apache/arrow/pull/2256]

> [JS] Add generic to Table for column names
> --
>
> Key: ARROW-2767
> URL: https://issues.apache.org/jira/browse/ARROW-2767
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: JavaScript
>Reporter: Brian Hulette
>Assignee: Paul Taylor
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Requested by [~domoritz]
> Something like:
> {code:javascript}
> class Table {
> ...
> getColumn(name: ColName): Vector {
> }
> ...
> }
> {code}
> It would be even better if we could find a way to map the column names to the 
> actual vector data types, but one thing at a time.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)