[jira] [Commented] (ARROW-9215) pyarrow parquet writer converts uint32 columns to int64

2020-06-25 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17146023#comment-17146023 ] Micah Kornfield commented on ARROW-9215: I think this goes back to Uwe's first po

[jira] [Commented] (ARROW-9215) pyarrow parquet writer converts uint32 columns to int64

2020-06-25 Thread Devavret Makkar (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17146007#comment-17146007 ] Devavret Makkar commented on ARROW-9215: The physical size requirements seem unre

[jira] [Assigned] (ARROW-8493) [C++] Create unified schema resolution code for Array reconstruction.

2020-06-25 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8493?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Micah Kornfield reassigned ARROW-8493: -- Assignee: Micah Kornfield > [C++] Create unified schema resolution code for Array reco

[jira] [Assigned] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-9132: --- Assignee: (was: Wes McKinney) > [C++] Support unique kernel for dictionary type > --

[jira] [Resolved] (ARROW-8733) [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata

2020-06-25 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman resolved ARROW-8733. - Resolution: Fixed Issue resolved by pull request 7546 [https://github.com/apache/arrow/pull/7546]

[jira] [Assigned] (ARROW-8733) [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata

2020-06-25 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman reassigned ARROW-8733: --- Assignee: Ben Kietzman (was: Joris Van den Bossche) > [C++][Dataset][Python] ParquetFileFra

[jira] [Commented] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145984#comment-17145984 ] Wes McKinney commented on ARROW-9132: - This is actually a good deal more complicated

[jira] [Updated] (ARROW-9132) [C++] Support unique kernel for dictionary type

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9132: Fix Version/s: (was: 1.0.0) 2.0.0 > [C++] Support unique kernel for dictiona

[jira] [Comment Edited] (ARROW-9229) [Python] Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145945#comment-17145945 ] Josh Dimarsky edited comment on ARROW-9229 at 6/26/20, 12:18 AM: --

[jira] [Comment Edited] (ARROW-9229) [Python] Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145945#comment-17145945 ] Josh Dimarsky edited comment on ARROW-9229 at 6/26/20, 12:15 AM: --

[jira] [Commented] (ARROW-9229) [Python] Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145945#comment-17145945 ] Josh Dimarsky commented on ARROW-9229: -- Weird... I can't reproduce it on my laptop a

[jira] [Resolved] (ARROW-9216) [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9216. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7531 [https://github

[jira] [Updated] (ARROW-9216) [C++] Use BitBlockCounter for plain spaced encoding/decoding

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9216: Component/s: C++ > [C++] Use BitBlockCounter for plain spaced encoding/decoding > -

[jira] [Updated] (ARROW-9216) [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9216: Summary: [C++][Parquet] Use BitBlockCounter for plain spaced encoding/decoding (was: [C++] Use Bit

[jira] [Commented] (ARROW-9229) [Python] Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145890#comment-17145890 ] Wes McKinney commented on ARROW-9229: - Any information you can provide about the hard

[jira] [Updated] (ARROW-9229) [Python] Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-9229: Summary: [Python] Pyarrow.Parquet.read_table Silently Crashes Python (was: Pyarrow.Parquet.read_ta

[jira] [Commented] (ARROW-9228) [Python][CI] Always run pytest verbosely

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145888#comment-17145888 ] Wes McKinney commented on ARROW-9228: - I'm not fond of it either. We could turn it on

[jira] [Commented] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145887#comment-17145887 ] Wes McKinney commented on ARROW-8301: - [~apitrou] I agree that it would need error si

[jira] [Updated] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9219: -- Labels: pull-request-available (was: ) > [R] coerce_timestamps in Parquet write options does n

[jira] [Updated] (ARROW-9230) [FlightRPC][Python] flight.connect() doesn't pass through all arguments

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9230: -- Labels: pull-request-available (was: ) > [FlightRPC][Python] flight.connect() doesn't pass thr

[jira] [Commented] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145705#comment-17145705 ] Josh Dimarsky commented on ARROW-9229: -- Thanks, you are correct, my bad. > Pyarrow.

[jira] [Commented] (ARROW-7939) [Python] crashes when reading parquet file compressed with snappy

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7939?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145704#comment-17145704 ] Josh Dimarsky commented on ARROW-7939: -- As [~uwe] noted, I have the same issue, acci

[jira] [Commented] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145702#comment-17145702 ] Uwe Korn commented on ARROW-9229: - This is probably a duplicate of https://issues.apache.

[jira] [Commented] (ARROW-9215) pyarrow parquet writer converts uint32 columns to int64

2020-06-25 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145701#comment-17145701 ] Uwe Korn commented on ARROW-9215: - No, for uint8 and uint16 it doesn't make a difference

[jira] [Created] (ARROW-9230) [FlightRPC][Python] flight.connect() doesn't pass through all arguments

2020-06-25 Thread David Li (Jira)
David Li created ARROW-9230: --- Summary: [FlightRPC][Python] flight.connect() doesn't pass through all arguments Key: ARROW-9230 URL: https://issues.apache.org/jira/browse/ARROW-9230 Project: Apache Arrow

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Environment: Windows 10 Pro 1909 (was: Windows 10 1909) > Pyarrow.Parquet.read_table Silently Cr

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Affects Version/s: (was: 0.17.1) > Pyarrow.Parquet.read_table Silently Crashes Python > -

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Environment: Windows 10 1909 (was: Windows 10 1903) > Pyarrow.Parquet.read_table Silently Crashe

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Description: A simple use of reading a Parquet file using PyArrow crashes Python silently with n

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Description: A simple use of reading a Parquet file using PyArrow crashes Python silently with n

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Description: A simple use of reading a Parquet file using PyArrow crashes Python silently with n

[jira] [Updated] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Josh Dimarsky updated ARROW-9229: - Description: A simple use of reading a Parquet file using PyArrow crashes Python silently with n

[jira] [Created] (ARROW-9229) Pyarrow.Parquet.read_table Silently Crashes Python

2020-06-25 Thread Josh Dimarsky (Jira)
Josh Dimarsky created ARROW-9229: Summary: Pyarrow.Parquet.read_table Silently Crashes Python Key: ARROW-9229 URL: https://issues.apache.org/jira/browse/ARROW-9229 Project: Apache Arrow Issue

[jira] [Resolved] (ARROW-1682) [Python] Add documentation / example for reading a directory of Parquet files on S3

2020-06-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-1682. --- Resolution: Fixed Issue resolved by pull request 7517 [https://github.com/apa

[jira] [Commented] (ARROW-9226) [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml or hdfs-site.xml if available

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145070#comment-17145070 ] Antoine Pitrou commented on ARROW-9226: --- > 'Legacy' pyarrow.hdfs.connect was someho

[jira] [Updated] (ARROW-9226) [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml or hdfs-site.xml if available

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9226: -- Component/s: C++ > [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml o

[jira] [Updated] (ARROW-1682) [Python] Add documentation / example for reading a directory of Parquet files on S3

2020-06-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-1682: -- Component/s: (was: C++) > [Python] Add documentation / example for reading

[jira] [Updated] (ARROW-9228) [Python][CI] Always run pytest verbosely

2020-06-25 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ben Kietzman updated ARROW-9228: Fix Version/s: 2.0.0 > [Python][CI] Always run pytest verbosely > -

[jira] [Commented] (ARROW-9228) [Python][CI] Always run pytest verbosely

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145064#comment-17145064 ] Antoine Pitrou commented on ARROW-9228: --- Not fond of this. This also usually makes

[jira] [Resolved] (ARROW-842) [Python] Handle more kinds of null sentinel objects from pandas 0.x

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-842. Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7537 [https://github.co

[jira] [Assigned] (ARROW-842) [Python] Handle more kinds of null sentinel objects from pandas 0.x

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-842: -- Assignee: Wes McKinney > [Python] Handle more kinds of null sentinel objects from pandas 0.x >

[jira] [Created] (ARROW-9228) [Python][CI] Always run pytest verbosely

2020-06-25 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-9228: --- Summary: [Python][CI] Always run pytest verbosely Key: ARROW-9228 URL: https://issues.apache.org/jira/browse/ARROW-9228 Project: Apache Arrow Issue Type: Impro

[jira] [Commented] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145055#comment-17145055 ] Antoine Pitrou commented on ARROW-8301: --- [~wesm] I wonder if an iteration API would

[jira] [Created] (ARROW-9227) [Python][Dataset] Write a custom field to _metadata caching file size

2020-06-25 Thread Ben Kietzman (Jira)
Ben Kietzman created ARROW-9227: --- Summary: [Python][Dataset] Write a custom field to _metadata caching file size Key: ARROW-9227 URL: https://issues.apache.org/jira/browse/ARROW-9227 Project: Apache Arr

[jira] [Comment Edited] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145055#comment-17145055 ] Antoine Pitrou edited comment on ARROW-8301 at 6/25/20, 4:08 PM: --

[jira] [Updated] (ARROW-8950) [C++] Make head optional in s3fs

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-8950: -- Labels: pull-request-available (was: ) > [C++] Make head optional in s3fs > --

[jira] [Updated] (ARROW-8950) [C++] Make head optional in s3fs

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8950?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-8950: -- Fix Version/s: 1.0.0 > [C++] Make head optional in s3fs > > >

[jira] [Updated] (ARROW-9139) [Python] parquet read_table should not use_legacy_dataset

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9139?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9139: -- Labels: dataset-parquet-read parquet pull-request-available (was: dataset-parquet-read parquet

[jira] [Commented] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-25 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145019#comment-17145019 ] Neal Richardson commented on ARROW-9219: Have you reported that bug to Dremio? I

[jira] [Comment Edited] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145016#comment-17145016 ] Neal Richardson edited comment on ARROW-8301 at 6/25/20, 3:23 PM: -

[jira] [Commented] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17145016#comment-17145016 ] Neal Richardson commented on ARROW-8301: The use case I'm thinking of is: the Pyt

[jira] [Updated] (ARROW-7285) [C++] ensure C++ implementation meets clarified dictionary spec

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7285?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7285: -- Labels: pull-request-available (was: ) > [C++] ensure C++ implementation meets clarified dicti

[jira] [Resolved] (ARROW-9225) [C++][Compute] Improve counting sort

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9225. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7542 [https://github

[jira] [Assigned] (ARROW-8733) [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata

2020-06-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-8733: - Assignee: Joris Van den Bossche (was: Ben Kietzman) > [C++][Dataset][Py

[jira] [Commented] (ARROW-9147) [C++][Dataset] Support null -> other type promotion in Dataset scanning

2020-06-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17144972#comment-17144972 ] Francois Saint-Jacques commented on ARROW-9147: --- This one will require more

[jira] [Assigned] (ARROW-9147) [C++][Dataset] Support null -> other type promotion in Dataset scanning

2020-06-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-9147: - Assignee: (was: Francois Saint-Jacques) > [C++][Dataset] Support nul

[jira] [Updated] (ARROW-9147) [C++][Dataset] Support null -> other type promotion in Dataset scanning

2020-06-25 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques updated ARROW-9147: -- Fix Version/s: (was: 1.0.0) 2.0.0 > [C++][Dataset] Suppo

[jira] [Resolved] (ARROW-8504) [C++] Add Run Length Reader

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8504?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-8504. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7143 [https://github

[jira] [Resolved] (ARROW-9106) [C++] Add C++ foundation to ease file transcoding

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-9106. - Resolution: Fixed Issue resolved by pull request 7456 [https://github.com/apache/arrow/pull/7456]

[jira] [Comment Edited] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-25 Thread Slim Bentami (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17144950#comment-17144950 ] Slim Bentami edited comment on ARROW-9219 at 6/25/20, 2:06 PM:

[jira] [Commented] (ARROW-9219) [R] coerce_timestamps in Parquet write options does not work

2020-06-25 Thread Slim Bentami (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17144950#comment-17144950 ] Slim Bentami commented on ARROW-9219: - or perhaps there is a way around it in the mea

[jira] [Updated] (ARROW-9017) [Python] Refactor the Scalar classes

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9017: -- Labels: pull-request-available (was: ) > [Python] Refactor the Scalar classes > --

[jira] [Assigned] (ARROW-9017) [Python] Refactor the Scalar classes

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-9017: Assignee: Joris Van den Bossche > [Python] Refactor the Scalar classes > -

[jira] [Assigned] (ARROW-9017) [Python] Refactor the Scalar classes

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-9017: Assignee: Krisztian Szucs (was: Joris Van den Bossche) > [Python] Refacto

[jira] [Updated] (ARROW-9017) [Python] Refactor the Scalar classes

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9017: - Fix Version/s: 1.0.0 > [Python] Refactor the Scalar classes > ---

[jira] [Assigned] (ARROW-8733) [C++][Dataset][Python] ParquetFileFragment should provide access to parquet FileMetadata

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8733?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-8733: Assignee: Ben Kietzman > [C++][Dataset][Python] ParquetFileFragment should

[jira] [Commented] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17144917#comment-17144917 ] Wes McKinney commented on ARROW-8301: - There's actually a more fundamental issue at p

[jira] [Updated] (ARROW-9221) ArrowBuf#setBytes(int, ByteBuffer) doesn't check the byte buffer's endianness

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9221: -- Labels: pull-request-available (was: ) > ArrowBuf#setBytes(int, ByteBuffer) doesn't check the

[jira] [Commented] (ARROW-8301) [C++][Python][R] Handle ChunkedArray and Table in C data interface

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17144890#comment-17144890 ] Antoine Pitrou commented on ARROW-8301: --- Could you elaborate on the use case? Is it

[jira] [Created] (ARROW-9226) [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml or hdfs-site.xml if available

2020-06-25 Thread Bruno Quinart (Jira)
Bruno Quinart created ARROW-9226: Summary: [Python] pyarrow.fs.HadoopFileSystem - retrieve options from core-site.xml or hdfs-site.xml if available Key: ARROW-9226 URL: https://issues.apache.org/jira/browse/ARROW-

[jira] [Closed] (ARROW-9213) [Archery] "archery benchmark list" fails cloning repo

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-9213. - Resolution: Duplicate > [Archery] "archery benchmark list" fails cloning repo > -

[jira] [Assigned] (ARROW-8927) [C++] Support dictionary memos when reading/writing record batches using cuda IPC

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-8927: - Assignee: Alex Baden > [C++] Support dictionary memos when reading/writing record batche

[jira] [Resolved] (ARROW-8927) [C++] Support dictionary memos when reading/writing record batches using cuda IPC

2020-06-25 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-8927. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 7263 [https://gi

[jira] [Updated] (ARROW-9225) [C++][Compute] Improve counting sort

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9225: -- Labels: pull-request-available (was: ) > [C++][Compute] Improve counting sort > --

[jira] [Resolved] (ARROW-9089) [Python] A PyFileSystem handler for fsspec-based filesystems

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche resolved ARROW-9089. -- Resolution: Fixed Issue resolved by pull request 7395 [https://github.com/apach

[jira] [Created] (ARROW-9225) [C++][Compute] Improve counting sort

2020-06-25 Thread Yibo Cai (Jira)
Yibo Cai created ARROW-9225: --- Summary: [C++][Compute] Improve counting sort Key: ARROW-9225 URL: https://issues.apache.org/jira/browse/ARROW-9225 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-9224) [Dev][Archery] Copy local repo on clone failure

2020-06-25 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9224: -- Labels: pull-request-available (was: ) > [Dev][Archery] Copy local repo on clone failure > ---

[jira] [Created] (ARROW-9224) [Dev][Archery] Copy local repo on clone failure

2020-06-25 Thread Yibo Cai (Jira)
Yibo Cai created ARROW-9224: --- Summary: [Dev][Archery] Copy local repo on clone failure Key: ARROW-9224 URL: https://issues.apache.org/jira/browse/ARROW-9224 Project: Apache Arrow Issue Type: Improv

[jira] [Updated] (ARROW-9223) [Python] Fix to_pandas() export for timestamps within structs

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9223: - Component/s: Python > [Python] Fix to_pandas() export for timestamps within struc

[jira] [Updated] (ARROW-9223) [Python] Fix to_pandas() export for timestamps within structs

2020-06-25 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-9223: - Summary: [Python] Fix to_pandas() export for timestamps within structs (was: Fix