[jira] [Commented] (ARROW-6301) [Python] atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name arrow.py_extension_type found'

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911943#comment-16911943 ] Wes McKinney commented on ARROW-6301: - It looks like a race condition at teardown, some additional

[jira] [Updated] (ARROW-6301) [Python] atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name arrow.py_extension_type found'

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6301: Fix Version/s: 0.15.0 > [Python] atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name

[jira] [Updated] (ARROW-6276) [C++] Add operator[] to some concrete Array implementations?

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6276: Fix Version/s: (was: 0.15.0) > [C++] Add operator[] to some concrete Array implementations? >

[jira] [Updated] (ARROW-6275) [C++] Deprecate RecordBatchReader::ReadNext

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6275?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6275: Fix Version/s: 0.15.0 > [C++] Deprecate RecordBatchReader::ReadNext >

[jira] [Updated] (ARROW-6227) [Python] pyarrow.array() shouldn't coerce np.nan to string

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6227: -- Labels: pull-request-available (was: ) > [Python] pyarrow.array() shouldn't coerce np.nan to

[jira] [Commented] (ARROW-6300) [C++] Add io::OutputStream::Abort()

2019-08-20 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911928#comment-16911928 ] Micah Kornfield commented on ARROW-6300: Two questions: 1.  What is the use-case for such a

[jira] [Commented] (ARROW-6206) [Java][Docs] Document environment variables/java properties

2019-08-20 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911922#comment-16911922 ] Micah Kornfield commented on ARROW-6206: "iiuc arrow is a team that picked up netty derived

[jira] [Assigned] (ARROW-6227) [Python] pyarrow.array() shouldn't coerce np.nan to string

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6227?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6227: --- Assignee: Wes McKinney > [Python] pyarrow.array() shouldn't coerce np.nan to string >

[jira] [Closed] (ARROW-6223) [C++] Configuration error with Anaconda Python 3.7.4

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6223. --- Resolution: Fixed Looks like this was fixed quickly in defaults > [C++] Configuration error with

[jira] [Updated] (ARROW-6222) [Python] Serialising numpy array yields `pyarrow.lib.ArrowNotImplementedError: list`

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6222: Summary: [Python] Serialising numpy array yields `pyarrow.lib.ArrowNotImplementedError: list`

[jira] [Updated] (ARROW-6178) [Developer] Don't fail in merge script on bad primary author input in multi-author PRs

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6178: -- Labels: pull-request-available (was: ) > [Developer] Don't fail in merge script on bad

[jira] [Assigned] (ARROW-6178) [Developer] Don't fail in merge script on bad primary author input in multi-author PRs

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6178: --- Assignee: Wes McKinney > [Developer] Don't fail in merge script on bad primary author input

[jira] [Updated] (ARROW-6178) [Developer] Don't fail in merge script on bad primary author input in multi-author PRs

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6178: Fix Version/s: 0.15.0 > [Developer] Don't fail in merge script on bad primary author input in >

[jira] [Resolved] (ARROW-5985) [Developer] Do not suggest setting Fix Version for point releases in dev/merge_arrow_pr.py

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5985. - Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed Issue

[jira] [Updated] (ARROW-6174) [C++] Validate chunks in ChunkedArray::Validate

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6174: -- Labels: parquet pull-request-available (was: parquet) > [C++] Validate chunks in

[jira] [Updated] (ARROW-6174) [C++] Validate chunks in ChunkedArray::Validate

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6174: Summary: [C++] Validate chunks in ChunkedArray::Validate (was: [C++] Parquet tests produce

[jira] [Assigned] (ARROW-6174) [C++] Parquet tests produce invalid array

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6174?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6174: --- Assignee: Wes McKinney > [C++] Parquet tests produce invalid array >

[jira] [Closed] (ARROW-6163) [C++] Misnamed test

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6163. --- Resolution: Not A Problem This was handled in

[jira] [Updated] (ARROW-6159) [C++] PrettyPrint of arrow::Schema missing identation for first line

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6159: -- Labels: beginner pull-request-available (was: beginner) > [C++] PrettyPrint of arrow::Schema

[jira] [Assigned] (ARROW-6159) [C++] PrettyPrint of arrow::Schema missing identation for first line

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6159: --- Assignee: Wes McKinney > [C++] PrettyPrint of arrow::Schema missing identation for first

[jira] [Resolved] (ARROW-6182) [R] Add note to README about r-arrow conda installation

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6182. - Resolution: Fixed Issue resolved by pull request 5142

[jira] [Resolved] (ARROW-6288) [Java] Implement TypeEqualsVisitor comparing vector type equals considering names and metadata

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6288?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6288. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5119

[jira] [Resolved] (ARROW-5134) [R][CI] Run nightly tests against multiple R versions

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5134?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-5134. - Resolution: Fixed Issue resolved by pull request 5121

[jira] [Updated] (ARROW-6126) [C++] IPC stream reader handling of empty streams potentially not robust

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6126: -- Labels: pull-request-available (was: ) > [C++] IPC stream reader handling of empty streams

[jira] [Assigned] (ARROW-6126) [C++] IPC stream reader handling of empty streams potentially not robust

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6126: --- Assignee: Wes McKinney > [C++] IPC stream reader handling of empty streams potentially not

[jira] [Updated] (ARROW-6031) [Java] Support iterating a vector by ArrowBufPointer

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6031: --- Component/s: Java > [Java] Support iterating a vector by ArrowBufPointer >

[jira] [Updated] (ARROW-6101) [Rust] [DataFusion] Create physical plan from logical plan

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6101?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6101: --- Component/s: Rust - DataFusion > [Rust] [DataFusion] Create physical plan from logical plan

[jira] [Updated] (ARROW-6216) [C++] Allow user to select the ZSTD compression level

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6216: --- Summary: [C++] Allow user to select the ZSTD compression level (was: Allow user to select

[jira] [Updated] (ARROW-6112) [Java] Update APIs to support 64-bit address space

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6112?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6112: --- Component/s: Java > [Java] Update APIs to support 64-bit address space >

[jira] [Resolved] (ARROW-6105) [C++][Parquet][Python] Add test case showing dictionary-encoded subfields in nested type

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6105. - Assignee: Wes McKinney Resolution: Fixed This was done in

[jira] [Updated] (ARROW-6078) [Java] Implement dictionary-encoded subfields for List type

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6078?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6078: --- Component/s: Java > [Java] Implement dictionary-encoded subfields for List type >

[jira] [Resolved] (ARROW-6095) [C++] Python subproject ignores ARROW_TEST_LINKAGE

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6095?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6095. - Resolution: Fixed Issue resolved by pull request 4982

[jira] [Updated] (ARROW-6092) [C++] Python 2.7: arrow_python_test failure

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6092: -- Labels: pull-request-available test (was: test) > [C++] Python 2.7: arrow_python_test failure

[jira] [Assigned] (ARROW-6092) [C++] Python 2.7: arrow_python_test failure

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6092: --- Assignee: Wes McKinney > [C++] Python 2.7: arrow_python_test failure >

[jira] [Updated] (ARROW-6092) [C++] Python 2.7: arrow_python_test failure

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6092?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6092: Fix Version/s: 0.15.0 > [C++] Python 2.7: arrow_python_test failure >

[jira] [Updated] (ARROW-6225) [Website] Update arrow-site/README and any other places to point website contributors in right direction

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6225: -- Labels: pull-request-available (was: ) > [Website] Update arrow-site/README and any other

[jira] [Updated] (ARROW-6072) [C++] Implement casting List <-> LargeList

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6072: Fix Version/s: 1.0.0 > [C++] Implement casting List <-> LargeList >

[jira] [Updated] (ARROW-6071) [C++] Implement casting Binary <-> LargeBinary

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6071?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6071: Fix Version/s: 1.0.0 > [C++] Implement casting Binary <-> LargeBinary >

[jira] [Updated] (ARROW-6301) [Python] atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name arrow.py_extension_type found'

2019-08-20 Thread Bryan Cutler (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryan Cutler updated ARROW-6301: Summary: [Python] atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name

[jira] [Resolved] (ARROW-5444) [Release][Website] After 0.14 release, update what is an "official" release

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-5444. Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed This

[jira] [Updated] (ARROW-6183) [R] Document that you don't have to use tidyselect if you don't want

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6183: -- Labels: pull-request-available (was: ) > [R] Document that you don't have to use tidyselect

[jira] [Assigned] (ARROW-6183) [R] Document that you don't have to use tidyselect if you don't want

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6183: -- Assignee: Neal Richardson > [R] Document that you don't have to use tidyselect if you

[jira] [Updated] (ARROW-6183) [R] Document that you don't have to use tidyselect if you don't want

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6183?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6183: --- Summary: [R] Document that you don't have to use tidyselect if you don't want (was: [R]

[jira] [Assigned] (ARROW-6049) [C++] Support using Array::View from compatible dictionary type to another

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6049: --- Assignee: Wes McKinney > [C++] Support using Array::View from compatible dictionary type to

[jira] [Resolved] (ARROW-6049) [C++] Support using Array::View from compatible dictionary type to another

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6049?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6049. - Fix Version/s: (was: 1.0.0) 0.15.0 Resolution: Fixed Issue

[jira] [Updated] (ARROW-6302) [Python] parquet categorical support doesn't preserve order

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6302: Fix Version/s: 0.15.0 > [Python] parquet categorical support doesn't preserve order >

[jira] [Commented] (ARROW-6302) [Python] parquet categorical support doesn't preserve order

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911760#comment-16911760 ] Wes McKinney commented on ARROW-6302: - Yeah this should be easy to fix. The changes need to be based

[jira] [Updated] (ARROW-6302) [Python][Parquet] Reading dictionary type with serialized Arrow schema does not restore "ordered" type property

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6302: Summary: [Python][Parquet] Reading dictionary type with serialized Arrow schema does not restore

[jira] [Commented] (ARROW-1644) [C++][Parquet] Read and write nested Parquet data with a mix of struct and list nesting levels

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911731#comment-16911731 ] Wes McKinney commented on ARROW-1644: - Note that contributing to other parts of the project helps

[jira] [Resolved] (ARROW-6125) [Python] Remove any APIs deprecated prior to 0.14.x

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6125?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6125. - Resolution: Fixed Issue resolved by pull request 5130

[jira] [Closed] (ARROW-6280) Add gandiva as a module to modules in java root pom

2019-08-20 Thread Rui Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rui Wang closed ARROW-6280. --- Resolution: Not A Problem > Add gandiva as a module to modules in java root pom >

[jira] [Resolved] (ARROW-6290) [Rust] [DataFusion] sql_csv example errors when running

2019-08-20 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Paddy Horan resolved ARROW-6290. Resolution: Fixed Issue resolved by pull request 5136 [https://github.com/apache/arrow/pull/5136]

[jira] [Created] (ARROW-6303) [Rust] Add a feature to disable SIMD

2019-08-20 Thread Paddy Horan (Jira)
Paddy Horan created ARROW-6303: -- Summary: [Rust] Add a feature to disable SIMD Key: ARROW-6303 URL: https://issues.apache.org/jira/browse/ARROW-6303 Project: Apache Arrow Issue Type:

[jira] [Updated] (ARROW-4752) [Rust] Add explicit SIMD vectorization for the divide kernel

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4752?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4752: -- Labels: pull-request-available (was: ) > [Rust] Add explicit SIMD vectorization for the

[jira] [Updated] (ARROW-4390) [R] Serialize "labeled" metadata in Feather files, IPC messages

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-4390: --- Fix Version/s: (was: 0.15.0) > [R] Serialize "labeled" metadata in Feather files, IPC

[jira] [Commented] (ARROW-4390) [R] Serialize "labeled" metadata in Feather files, IPC messages

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911685#comment-16911685 ] Neal Richardson commented on ARROW-4390: See 

[jira] [Assigned] (ARROW-4390) [R] Serialize "labeled" metadata in Feather files, IPC messages

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4390?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-4390: -- Assignee: (was: Neal Richardson) > [R] Serialize "labeled" metadata in Feather

[jira] [Updated] (ARROW-6302) [Python] parquet categorical support doesn't preserve order

2019-08-20 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-6302: - Labels: parquet (was: ) > [Python] parquet categorical support doesn't preserve

[jira] [Commented] (ARROW-6293) [Rust] datafusion 0.15.0-SNAPSHOT error

2019-08-20 Thread Paddy Horan (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6293?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911677#comment-16911677 ] Paddy Horan commented on ARROW-6293: Hi 0.14.1 is the most recent version.  The README is misleading

[jira] [Commented] (ARROW-6302) [Python] parquet categorical support doesn't preserve order

2019-08-20 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911662#comment-16911662 ] Joris Van den Bossche commented on ARROW-6302: -- cc [~wesmckinn] This was catched from adding

[jira] [Created] (ARROW-6302) [Python] parquet categorical support doesn't preserve order

2019-08-20 Thread Galuh Sahid (Jira)
Galuh Sahid created ARROW-6302: -- Summary: [Python] parquet categorical support doesn't preserve order Key: ARROW-6302 URL: https://issues.apache.org/jira/browse/ARROW-6302 Project: Apache Arrow

[jira] [Updated] (ARROW-6182) [R] Add note to README about r-arrow conda installation

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6182: -- Labels: pull-request-available (was: ) > [R] Add note to README about r-arrow conda

[jira] [Updated] (ARROW-6278) [R] Read parquet files from raw vector

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6278: -- Labels: pull-request-available (was: ) > [R] Read parquet files from raw vector >

[jira] [Updated] (ARROW-6278) [R] Read parquet files from raw vector

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6278: --- Fix Version/s: 0.15.0 > [R] Read parquet files from raw vector >

[jira] [Commented] (ARROW-6278) [R] Read parquet files from raw vector

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911629#comment-16911629 ] Neal Richardson commented on ARROW-6278: It's easy enough to add. PR coming. > [R] Read parquet

[jira] [Updated] (ARROW-6278) [R] Read files from HDFS

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6278: --- Description: {{read_parquet}} currently handles a path to a local file or an Arrow input

[jira] [Assigned] (ARROW-6278) [R] Read parquet files from raw vector

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-6278: -- Assignee: Neal Richardson > [R] Read parquet files from raw vector >

[jira] [Updated] (ARROW-6278) [R] Read parquet files from raw vector

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6278?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6278: --- Summary: [R] Read parquet files from raw vector (was: [R] Read files from HDFS ) > [R]

[jira] [Resolved] (ARROW-5992) [C++] Array::View fails for string/utf8 as binary

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques resolved ARROW-5992. --- Resolution: Fixed Issue resolved by pull request 5125

[jira] [Resolved] (ARROW-6067) [Python] Large memory test failures

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6067?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6067. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5128

[jira] [Closed] (ARROW-6051) [C++][Python] Parquet float column of NaN writing performance regression from 0.13.0 to 0.14.1

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-6051. --- Resolution: Not A Problem So the plot thickens here a little bit Arrow 0.13.0 {code} import

[jira] [Updated] (ARROW-6238) [C++] Implement SimpleDataSource/SimpleDataFragment

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6238: -- Labels: datasets pull-request-available (was: datasets) > [C++] Implement

[jira] [Updated] (ARROW-6229) [C++] Add a DataSource implementation which scans a directory

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6229: -- Labels: pull-request-available (was: ) > [C++] Add a DataSource implementation which scans a

[jira] [Updated] (ARROW-6182) [R] Add note to README about r-arrow conda installation

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6182: --- Priority: Minor (was: Major) > [R] Add note to README about r-arrow conda installation >

[jira] [Updated] (ARROW-5501) [R] read/write_feather/arrow?

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-5501: --- Fix Version/s: (was: 0.15.0) 1.0.0 > [R] read/write_feather/arrow? >

[jira] [Updated] (ARROW-6214) [R] Sanitizer errors triggered via R bindings

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6214: --- Priority: Critical (was: Major) > [R] Sanitizer errors triggered via R bindings >

[jira] [Updated] (ARROW-6236) [R] Deduplicate strings using Arrow hash tables instead of passing all values through R's global hash table

2019-08-20 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6236?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-6236: --- Priority: Minor (was: Major) > [R] Deduplicate strings using Arrow hash tables instead of

[jira] [Closed] (ARROW-5993) [Python] Reading a dictionary column from Parquet results in disproportionate memory usage

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5993?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-5993. --- Resolution: Duplicate I confirmed that this is ARROW-6060. On master peak memory use reading _the

[jira] [Commented] (ARROW-5993) [Python] Reading a dictionary column from Parquet results in disproportionate memory usage

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5993?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911533#comment-16911533 ] Wes McKinney commented on ARROW-5993: - I was able to get the file from

[jira] [Updated] (ARROW-5888) [Python][C++] Add metadata to store Arrow time zones in Parquet file metadata

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5888?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-5888: -- Labels: parquet pull-request-available (was: parquet) > [Python][C++] Add metadata to store

[jira] [Resolved] (ARROW-6161) [C++] Implements dataset::ParquetFile and associated Scan structures

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-6161. - Fix Version/s: 0.15.0 Resolution: Fixed Issue resolved by pull request 5083

[jira] [Updated] (ARROW-6058) [Python][Parquet] Failure when reading Parquet file from S3 with s3fs

2019-08-20 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6058: -- Labels: parquet pull-request-available (was: parquet) > [Python][Parquet] Failure when

[jira] [Updated] (ARROW-6058) [Python][Parquet] Failure when reading Parquet file from S3 with s3fs

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-6058: Summary: [Python][Parquet] Failure when reading Parquet file from S3 with s3fs (was:

[jira] [Commented] (ARROW-6058) [Python][Parquet] Failure when reading Parquet file from S3

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911468#comment-16911468 ] Wes McKinney commented on ARROW-6058: - It's a bug in s3fs. I found a minimal reproduction and

[jira] [Commented] (ARROW-2431) [Rust] Schema fidelity

2019-08-20 Thread Maximilian Roos (Jira)
[ https://issues.apache.org/jira/browse/ARROW-2431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911466#comment-16911466 ] Maximilian Roos commented on ARROW-2431: Yes - your call [~andygrove] - I've been out of this for

[jira] [Commented] (ARROW-6206) [Java][Docs] Document environment variables/java properties

2019-08-20 Thread Jim Northrup (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911465#comment-16911465 ] Jim Northrup commented on ARROW-6206: - > Could you provide a link to the text you quoted I'd be

[jira] [Commented] (ARROW-6298) [Rust] [CI] Examples are not being tested in CI

2019-08-20 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911462#comment-16911462 ] Krisztian Szucs commented on ARROW-6298: I'm on holiday so my availability is limited, but let me

[jira] [Commented] (ARROW-6298) [Rust] [CI] Examples are not being tested in CI

2019-08-20 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6298?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911458#comment-16911458 ] Krisztian Szucs commented on ARROW-6298: Add a ShellCommand to

[jira] [Commented] (ARROW-6301) atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name arrow.py_extension_type found'

2019-08-20 Thread David Alphus (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6301?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911454#comment-16911454 ] David Alphus commented on ARROW-6301: - When recompiling I got a segfault during atexit. Not totally

[jira] [Commented] (ARROW-6058) [Python][Parquet] Failure when reading Parquet file from S3

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911443#comment-16911443 ] Wes McKinney commented on ARROW-6058: - I'm able to reproduce the issue. Going to dig in to see if I

[jira] [Assigned] (ARROW-6058) [Python][Parquet] Failure when reading Parquet file from S3

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6058?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-6058: --- Assignee: Wes McKinney > [Python][Parquet] Failure when reading Parquet file from S3 >

[jira] [Created] (ARROW-6301) atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name arrow.py_extension_type found'

2019-08-20 Thread David Alphus (Jira)
David Alphus created ARROW-6301: --- Summary: atexit: pyarrow.lib.ArrowKeyError: 'No type extension with name arrow.py_extension_type found' Key: ARROW-6301 URL: https://issues.apache.org/jira/browse/ARROW-6301

[jira] [Commented] (ARROW-6300) [C++] Add io::OutputStream::Abort()

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911427#comment-16911427 ] Wes McKinney commented on ARROW-6300: - Seems reasonable to have the default implementation be a no-op

[jira] [Comment Edited] (ARROW-6300) [C++] Add io::OutputStream::Abort()

2019-08-20 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911427#comment-16911427 ] Wes McKinney edited comment on ARROW-6300 at 8/20/19 2:49 PM: -- Seems

[jira] [Commented] (ARROW-1644) [C++][Parquet] Read and write nested Parquet data with a mix of struct and list nesting levels

2019-08-20 Thread Brian Phillips (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1644?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911426#comment-16911426 ] Brian Phillips commented on ARROW-1644: --- My main use case for (py)arrow is converting very nested

[jira] [Assigned] (ARROW-5141) [C++] Share more of the IPC testing utils with the rest of Arrow

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5141: - Assignee: (was: Francois Saint-Jacques) > [C++] Share more of the

[jira] [Assigned] (ARROW-5082) [Python][Packaging] Reduce size of macOS and manylinux1 wheels

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5082?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5082: - Assignee: (was: Francois Saint-Jacques) > [Python][Packaging]

[jira] [Assigned] (ARROW-5630) [Python] Table of nested arrays doesn't round trip

2019-08-20 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5630?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Francois Saint-Jacques reassigned ARROW-5630: - Assignee: (was: Francois Saint-Jacques) > [Python] Table of nested

[jira] [Commented] (ARROW-6300) [C++] Add io::OutputStream::Abort()

2019-08-20 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911406#comment-16911406 ] Antoine Pitrou commented on ARROW-6300: --- [~emkornfi...@gmail.com] opinions? > [C++] Add

[jira] [Commented] (ARROW-6300) [C++] Add io::OutputStream::Abort()

2019-08-20 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6300?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16911402#comment-16911402 ] Antoine Pitrou commented on ARROW-6300: --- Context: S3 multipart uploads must be either completed or

  1   2   >