[jira] [Updated] (ARROW-3133) [C++] Logical boolean kernels in kernels/boolean.cc cannot write into preallocated memory

2018-12-29 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-3133:
--
Labels: pull-request-available  (was: )

> [C++] Logical boolean kernels in kernels/boolean.cc cannot write into 
> preallocated memory
> -
>
> Key: ARROW-3133
> URL: https://issues.apache.org/jira/browse/ARROW-3133
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Micah Kornfield
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.13.0
>
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-2038) [Python] Follow-up bug fixes for s3fs Parquet support

2018-12-29 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-2038:
--
Labels: aws parquet pull-request-available  (was: aws parquet)

> [Python] Follow-up bug fixes for s3fs Parquet support
> -
>
> Key: ARROW-2038
> URL: https://issues.apache.org/jira/browse/ARROW-2038
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Reporter: Wes McKinney
>Priority: Major
>  Labels: aws, parquet, pull-request-available
> Fix For: 0.13.0
>
>
> see discussion in 
> https://github.com/apache/arrow/pull/916#issuecomment-360558248



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Resolved] (ARROW-4132) [GLib] Add more GArrowTable constructors

2018-12-29 Thread Yosuke Shiro (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yosuke Shiro resolved ARROW-4132.
-
   Resolution: Fixed
Fix Version/s: 0.12.0

Issue resolved by pull request 3285
[https://github.com/apache/arrow/pull/3285]

> [GLib] Add more GArrowTable constructors
> 
>
> Key: ARROW-4132
> URL: https://issues.apache.org/jira/browse/ARROW-4132
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: GLib
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
> Fix For: 0.12.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Commented] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file

2018-12-29 Thread Matthew Rocklin (JIRA)


[ 
https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16730844#comment-16730844
 ] 

Matthew Rocklin commented on ARROW-1983:


> If I understand correctly, we need to combine all of the row group metadata 
> for all files in a directory.

Yes.  Ideally when writing a row group we would get some metadata object in 
memory. We would then collect all of those objects and hand them to some 
`write_metadata` function afterwards.

> When a new file is written, does this file have to be updated?
 
Yes, or it can be removed/invalidated.
 
As a side note, this is probably one of a small number of issues that stop Dask 
Dataframe from using PyArrow by default.  Metadata files with full row group 
information are especially valuable for us, particularly with remote/cloud 
storage.  (I'm going through Dask's parquet handling now)

> [Python] Add ability to write parquet `_metadata` file
> --
>
> Key: ARROW-1983
> URL: https://issues.apache.org/jira/browse/ARROW-1983
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Jim Crist
>Assignee: Robert Gruener
>Priority: Major
>  Labels: beginner, parquet
> Fix For: 0.13.0
>
>
> Currently {{pyarrow.parquet}} can only write the {{_common_metadata}} file 
> (mostly just schema information). It would be useful to add the ability to 
> write a {{_metadata}} file as well. This should include information about 
> each row group in the dataset, including summary statistics. Having this 
> summary file would allow filtering of row groups without needing to access 
> each file beforehand.
> This would require that the user is able to get the written RowGroups out of 
> a {{pyarrow.parquet.write_table}} call and then give these objects as a list 
> to new function that then passes them on as C++ objects to {{parquet-cpp}} 
> that generates the respective {{_metadata}} file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4128) [C++][DOCUMENTATION] Update style guide to reflect some more exceptions

2018-12-29 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4128:
--
Labels: pull-request-available  (was: )

> [C++][DOCUMENTATION] Update style guide to reflect some more exceptions
> ---
>
> Key: ARROW-4128
> URL: https://issues.apache.org/jira/browse/ARROW-4128
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: C++, Documentation
>Reporter: Micah Kornfield
>Assignee: Micah Kornfield
>Priority: Minor
>  Labels: pull-request-available
>
> * It seems we are using doxygen style comments for method/class declaration 
> ("///")
>  * Use the macro NULLPTR ([https://arrow.apache.org/docs/cpp/macros_8h.html] 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Updated] (ARROW-4132) [GLib] Add more GArrowTable constructors

2018-12-29 Thread ASF GitHub Bot (JIRA)


 [ 
https://issues.apache.org/jira/browse/ARROW-4132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-4132:
--
Labels: pull-request-available  (was: )

> [GLib] Add more GArrowTable constructors
> 
>
> Key: ARROW-4132
> URL: https://issues.apache.org/jira/browse/ARROW-4132
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: GLib
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
>




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)


[jira] [Created] (ARROW-4132) [GLib] Add more GArrowTable constructors

2018-12-29 Thread Kouhei Sutou (JIRA)
Kouhei Sutou created ARROW-4132:
---

 Summary: [GLib] Add more GArrowTable constructors
 Key: ARROW-4132
 URL: https://issues.apache.org/jira/browse/ARROW-4132
 Project: Apache Arrow
  Issue Type: New Feature
  Components: GLib
Reporter: Kouhei Sutou
Assignee: Kouhei Sutou






--
This message was sent by Atlassian JIRA
(v7.6.3#76005)