[jira] [Commented] (ARROW-1992) [Python] to_pandas crashes when using strings_to_categoricals on empty string cols on 0.8.0

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338640#comment-16338640 ] ASF GitHub Bot commented on ARROW-1992: --- wesm commented on a change in pull request #1508:

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338636#comment-16338636 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on a change in pull request

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338632#comment-16338632 ] ASF GitHub Bot commented on ARROW-2019: --- vkorukanti commented on a change in pull request #1497:

[jira] [Updated] (ARROW-1706) [Python] StructArray.from_arrays should handle sequences that are coercible to arrays

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1706: -- Labels: pull-request-available (was: ) > [Python] StructArray.from_arrays should handle

[jira] [Commented] (ARROW-1706) [Python] StructArray.from_arrays should handle sequences that are coercible to arrays

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338628#comment-16338628 ] ASF GitHub Bot commented on ARROW-1706: --- wesm opened a new pull request #1512: ARROW-1706: [Python]

[jira] [Commented] (ARROW-2031) HadoopFileSystem isn't pickleable

2018-01-24 Thread Jim Crist (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338622#comment-16338622 ] Jim Crist commented on ARROW-2031: -- Ah, good catch, sorry about that. Searching for "hdfs" didn't turn

[jira] [Assigned] (ARROW-1706) [Python] StructArray.from_arrays should handle sequences that are coercible to arrays

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1706?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-1706: --- Assignee: Wes McKinney > [Python] StructArray.from_arrays should handle sequences that are

[jira] [Updated] (ARROW-1385) [C++] Add Buffer implementation and helper functions for POSIX shared memory

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1385: Fix Version/s: (was: 0.9.0) 0.10.0 > [C++] Add Buffer implementation and

[jira] [Assigned] (ARROW-1385) [C++] Add Buffer implementation and helper functions for POSIX shared memory

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1385?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-1385: --- Assignee: (was: Wes McKinney) > [C++] Add Buffer implementation and helper functions for

[jira] [Updated] (ARROW-1393) [C++] Simplified CUDA IPC writer and reader for communicating a CPU + GPU payload to another process

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1393: Fix Version/s: (was: 0.9.0) 0.10.0 > [C++] Simplified CUDA IPC writer and

[jira] [Updated] (ARROW-300) [Format] Add buffer compression option to IPC file format

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-300?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-300: --- Fix Version/s: (was: 0.9.0) 0.10.0 > [Format] Add buffer compression option to

[jira] [Updated] (ARROW-1823) [C++] Add hash kernel benchmarks, investigate faster alternative non-SIMD hash functions

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1823: Fix Version/s: (was: 0.9.0) 0.10.0 > [C++] Add hash kernel benchmarks,

[jira] [Updated] (ARROW-1894) [Python] Treat CPython memoryview or buffer objects equivalently to pyarrow.Buffer in pyarrow.serialize

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1894: Fix Version/s: (was: 0.9.0) 0.10.0 > [Python] Treat CPython memoryview or

[jira] [Updated] (ARROW-1961) [Python] Writing Parquet file with flavor='spark' loses pandas schema metadata

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1961: -- Labels: pull-request-available (was: ) > [Python] Writing Parquet file with flavor='spark'

[jira] [Commented] (ARROW-1961) [Python] Writing Parquet file with flavor='spark' loses pandas schema metadata

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1961?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338589#comment-16338589 ] ASF GitHub Bot commented on ARROW-1961: --- wesm opened a new pull request #1511: ARROW-1961: [Python]

[jira] [Resolved] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-2025. - Resolution: Fixed Fix Version/s: 0.9.0 Issue resolved by pull request 1499

[jira] [Commented] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338562#comment-16338562 ] ASF GitHub Bot commented on ARROW-2025: --- wesm closed pull request #1499: ARROW-2025: [C++] Creating

[jira] [Commented] (ARROW-2003) [Python] Do not use deprecated kwarg in pandas.core.internals.make_block

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338531#comment-16338531 ] ASF GitHub Bot commented on ARROW-2003: --- wesm commented on issue #1507: ARROW-2003: [Python] Remove

[jira] [Commented] (ARROW-2007) [Python] Sequence converter for float32 not implemented

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338513#comment-16338513 ] ASF GitHub Bot commented on ARROW-2007: --- wesm opened a new pull request #1509: ARROW-2007: [Python]

[jira] [Updated] (ARROW-2007) [Python] Sequence converter for float32 not implemented

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2007: -- Labels: pull-request-available (was: ) > [Python] Sequence converter for float32 not

[jira] [Updated] (ARROW-1992) [Python] to_pandas crashes when using strings_to_categoricals on empty string cols on 0.8.0

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1992: -- Labels: pull-request-available (was: ) > [Python] to_pandas crashes when using

[jira] [Commented] (ARROW-1992) [Python] to_pandas crashes when using strings_to_categoricals on empty string cols on 0.8.0

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338496#comment-16338496 ] ASF GitHub Bot commented on ARROW-1992: --- Licht-T opened a new pull request #1508: ARROW-1992:

[jira] [Commented] (ARROW-2003) [Python] Do not use deprecated kwarg in pandas.core.internals.make_block

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338482#comment-16338482 ] ASF GitHub Bot commented on ARROW-2003: --- jreback commented on issue #1507: ARROW-2003: [Python]

[jira] [Commented] (ARROW-1976) [Python] Handling unicode pandas columns on parquet.read_table

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1976?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338469#comment-16338469 ] ASF GitHub Bot commented on ARROW-1976: --- Licht-T commented on issue #1476: ARROW-1976: [Python] Fix

[jira] [Commented] (ARROW-2003) [Python] Do not use deprecated kwarg in pandas.core.internals.make_block

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338466#comment-16338466 ] ASF GitHub Bot commented on ARROW-2003: --- wesm opened a new pull request #1507: ARROW-2003: [Python]

[jira] [Updated] (ARROW-2003) [Python] Do not use deprecated kwarg in pandas.core.internals.make_block

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2003: -- Labels: pull-request-available (was: ) > [Python] Do not use deprecated kwarg in

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338464#comment-16338464 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on issue #1497: ARROW-2019:

[jira] [Assigned] (ARROW-2003) [Python] Do not use deprecated kwarg in pandas.core.internals.make_block

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-2003: --- Assignee: Wes McKinney > [Python] Do not use deprecated kwarg in

[jira] [Updated] (ARROW-2010) [C++] Compiler warnings with CHECKIN warning level in ORC adapter

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2010: -- Labels: ORC pull-request-available (was: ORC) > [C++] Compiler warnings with CHECKIN warning

[jira] [Commented] (ARROW-2010) [C++] Compiler warnings with CHECKIN warning level in ORC adapter

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338462#comment-16338462 ] ASF GitHub Bot commented on ARROW-2010: --- wesm opened a new pull request #1506: ARROW-2010: [C++] Do

[jira] [Updated] (ARROW-2031) HadoopFileSystem isn't pickleable

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2031?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2031: -- Labels: pull-request-available (was: ) > HadoopFileSystem isn't pickleable >

[jira] [Created] (ARROW-2032) [C++] ORC ep installs on each call to ninja build (even if no work to do)

2018-01-24 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2032: --- Summary: [C++] ORC ep installs on each call to ninja build (even if no work to do) Key: ARROW-2032 URL: https://issues.apache.org/jira/browse/ARROW-2032 Project:

[jira] [Commented] (ARROW-2031) HadoopFileSystem isn't pickleable

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2031?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338460#comment-16338460 ] ASF GitHub Bot commented on ARROW-2031: --- jcrist opened a new pull request #1505: ARROW-2031:

[jira] [Assigned] (ARROW-2010) [C++] Compiler warnings with CHECKIN warning level in ORC adapter

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-2010: --- Assignee: Wes McKinney > [C++] Compiler warnings with CHECKIN warning level in ORC adapter >

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338450#comment-16338450 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on a change in pull request

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338451#comment-16338451 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on a change in pull request

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338454#comment-16338454 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on a change in pull request

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338453#comment-16338453 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on a change in pull request

[jira] [Updated] (ARROW-2030) NativeFile's Attributes are not exposed in child classes without explicit initialization

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud updated ARROW-2030: - Description: This shows up in calling \{{self._assert_readable()}} which tries to read from a

[jira] [Created] (ARROW-2030) NativeFile's Attributes are not exposed in child classes without explicit initialization

2018-01-24 Thread Phillip Cloud (JIRA)
Phillip Cloud created ARROW-2030: Summary: NativeFile's Attributes are not exposed in child classes without explicit initialization Key: ARROW-2030 URL: https://issues.apache.org/jira/browse/ARROW-2030

[jira] [Updated] (ARROW-2030) NativeFile's Attributes are not exposed in child classes without explicit initialization

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud updated ARROW-2030: - Description: This shows up in calling {{self._assert_readable() }}which tries to read from a

[jira] [Updated] (ARROW-2017) Array initialization with large (>2**31-1) uint64 values fails

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2017: -- Labels: pull-request-available (was: ) > Array initialization with large (>2**31-1) uint64

[jira] [Commented] (ARROW-2017) Array initialization with large (>2**31-1) uint64 values fails

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338443#comment-16338443 ] ASF GitHub Bot commented on ARROW-2017: --- wesm opened a new pull request #1504: ARROW-2017: [Python]

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338425#comment-16338425 ] ASF GitHub Bot commented on ARROW-2019: --- jacques-n commented on a change in pull request #1497:

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338423#comment-16338423 ] ASF GitHub Bot commented on ARROW-2019: --- jacques-n commented on a change in pull request #1497:

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338422#comment-16338422 ] ASF GitHub Bot commented on ARROW-2019: --- jacques-n commented on a change in pull request #1497:

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338424#comment-16338424 ] ASF GitHub Bot commented on ARROW-2019: --- jacques-n commented on a change in pull request #1497:

[jira] [Assigned] (ARROW-2017) Array initialization with large (>2**31-1) uint64 values fails

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-2017: --- Assignee: Wes McKinney > Array initialization with large (>2**31-1) uint64 values fails >

[jira] [Commented] (ARROW-1589) [C++] Fuzzing for certain input formats

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338415#comment-16338415 ] Wes McKinney commented on ARROW-1589: - I started a PR to test some malformed inputs here

[jira] [Commented] (ARROW-2023) [C++] Test opening IPC stream reader or file reader on an empty InputStream

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338411#comment-16338411 ] ASF GitHub Bot commented on ARROW-2023: --- wesm opened a new pull request #1503: WIP ARROW-2023: [C++]

[jira] [Updated] (ARROW-2023) [C++] Test opening IPC stream reader or file reader on an empty InputStream

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2023: -- Labels: pull-request-available (was: ) > [C++] Test opening IPC stream reader or file reader

[jira] [Assigned] (ARROW-2023) [C++] Test opening IPC stream reader or file reader on an empty InputStream

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2023?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-2023: --- Assignee: Wes McKinney > [C++] Test opening IPC stream reader or file reader on an empty

[jira] [Commented] (ARROW-2029) [Python] Program crash on `HdfsFile.tell` if file is closed

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338365#comment-16338365 ] ASF GitHub Bot commented on ARROW-2029: --- jcrist opened a new pull request #1502: ARROW-2029:

[jira] [Updated] (ARROW-2029) [Python] Program crash on `HdfsFile.tell` if file is closed

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2029?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2029: -- Labels: pull-request-available (was: ) > [Python] Program crash on `HdfsFile.tell` if file is

[jira] [Created] (ARROW-2029) [Python] Program crash on `HdfsFile.tell` if file is closed

2018-01-24 Thread Jim Crist (JIRA)
Jim Crist created ARROW-2029: Summary: [Python] Program crash on `HdfsFile.tell` if file is closed Key: ARROW-2029 URL: https://issues.apache.org/jira/browse/ARROW-2029 Project: Apache Arrow

[jira] [Commented] (ARROW-2028) [Python] extra_cmake_args needs to be passed through shlex.split

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338326#comment-16338326 ] ASF GitHub Bot commented on ARROW-2028: --- xhochy opened a new pull request #1501: ARROW-2028:

[jira] [Updated] (ARROW-2028) [Python] extra_cmake_args needs to be passed through shlex.split

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2028: -- Labels: pull-request-available (was: ) > [Python] extra_cmake_args needs to be passed through

[jira] [Commented] (ARROW-1754) [Python] Fix buggy Parquet roundtrip when an index name is the same as a column name

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338325#comment-16338325 ] ASF GitHub Bot commented on ARROW-1754: --- cpcloud commented on issue #1408: ARROW-1754: [Python]

[jira] [Created] (ARROW-2028) [Python] extra_cmake_args needs to be passed through shlex.split

2018-01-24 Thread Uwe L. Korn (JIRA)
Uwe L. Korn created ARROW-2028: -- Summary: [Python] extra_cmake_args needs to be passed through shlex.split Key: ARROW-2028 URL: https://issues.apache.org/jira/browse/ARROW-2028 Project: Apache Arrow

[jira] [Commented] (ARROW-1754) [Python] Fix buggy Parquet roundtrip when an index name is the same as a column name

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338322#comment-16338322 ] ASF GitHub Bot commented on ARROW-1754: --- cpcloud commented on a change in pull request #1408:

[jira] [Updated] (ARROW-2026) [Python] µs timestamps saved as int64 even if use_deprecated_int96_timestamps=True

2018-01-24 Thread Uwe L. Korn (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe L. Korn updated ARROW-2026: --- Fix Version/s: 0.9.0 > [Python] µs timestamps saved as int64 even if >

[jira] [Updated] (ARROW-488) [Python] Implement conversion between integer coded as floating points with NaN to an Arrow integer type

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-488?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-488: --- Fix Version/s: (was: 0.9.0) 0.10.0 > [Python] Implement conversion between

[jira] [Updated] (ARROW-501) [C++] Implement concurrent / buffering InputStream for streaming data use cases

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-501: --- Fix Version/s: (was: 0.9.0) 0.10.0 > [C++] Implement concurrent / buffering

[jira] [Updated] (ARROW-412) [Format] Handling of buffer padding in the IPC metadata

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-412?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-412: --- Fix Version/s: (was: 0.9.0) 0.10.0 > [Format] Handling of buffer padding in the

[jira] [Commented] (ARROW-1860) [C++] Add data structure to "stage" a sequence of IPC messages from in-memory data

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338267#comment-16338267 ] ASF GitHub Bot commented on ARROW-1860: --- wesm commented on issue #1500: ARROW-1860: [C++] Introduce

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338266#comment-16338266 ] ASF GitHub Bot commented on ARROW-2019: --- siddharthteotia commented on issue #1497: ARROW-2019:

[jira] [Created] (ARROW-2027) [C++] ipc::Message::SerializeTo does not pad the message body

2018-01-24 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-2027: --- Summary: [C++] ipc::Message::SerializeTo does not pad the message body Key: ARROW-2027 URL: https://issues.apache.org/jira/browse/ARROW-2027 Project: Apache Arrow

[jira] [Commented] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338234#comment-16338234 ] ASF GitHub Bot commented on ARROW-2025: --- jcrist commented on issue #1499: ARROW-2025: [C++] Creating

[jira] [Updated] (ARROW-2026) [Python] µs timestamps saved as int64 even if use_deprecated_int96_timestamps=True

2018-01-24 Thread Diego Argueta (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diego Argueta updated ARROW-2026: - Summary: [Python] µs timestamps saved as int64 even if use_deprecated_int96_timestamps=True

[jira] [Updated] (ARROW-2026) [Python] Timestamps saved as int64 even if use_deprecated_int96_timestamps=True

2018-01-24 Thread Diego Argueta (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diego Argueta updated ARROW-2026: - Summary: [Python] Timestamps saved as int64 even if use_deprecated_int96_timestamps=True (was:

[jira] [Updated] (ARROW-1860) [C++] Add data structure to "stage" a sequence of IPC messages from in-memory data

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-1860: -- Labels: pull-request-available (was: ) > [C++] Add data structure to "stage" a sequence of IPC

[jira] [Commented] (ARROW-633) [Java] Add support for FixedSizeBinary type

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338149#comment-16338149 ] ASF GitHub Bot commented on ARROW-633: -- alphalfalfa commented on a change in pull request #1492:

[jira] [Commented] (ARROW-1938) [Python] Error writing to partitioned Parquet dataset

2018-01-24 Thread Robert Dailey (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338081#comment-16338081 ] Robert Dailey commented on ARROW-1938: -- Added data to test with as well as the exact commands I was

[jira] [Updated] (ARROW-1938) [Python] Error writing to partitioned Parquet dataset

2018-01-24 Thread Robert Dailey (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Dailey updated ARROW-1938: - Attachment: ARROW-1938.py > [Python] Error writing to partitioned Parquet dataset >

[jira] [Updated] (ARROW-1938) [Python] Error writing to partitioned Parquet dataset

2018-01-24 Thread Robert Dailey (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robert Dailey updated ARROW-1938: - Attachment: ARROW-1938-test-data.csv.gz > [Python] Error writing to partitioned Parquet dataset >

[jira] [Commented] (ARROW-2022) [Format] Add custom metadata field specific to a RecordBatch message

2018-01-24 Thread Julian Hyde (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2022?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338058#comment-16338058 ] Julian Hyde commented on ARROW-2022: There's a whole class of batch-level data, including statistics,

[jira] [Commented] (ARROW-1938) [Python] Error writing to partitioned Parquet dataset

2018-01-24 Thread Robert Dailey (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338057#comment-16338057 ] Robert Dailey commented on ARROW-1938: -- Let me gather the data I was using for this.  Here are the

[jira] [Updated] (ARROW-2020) [Python] Parquet segfaults if coercing ns timestamps and writing 96-bit timestamps

2018-01-24 Thread Diego Argueta (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diego Argueta updated ARROW-2020: - Labels: timestamps (was: ) > [Python] Parquet segfaults if coercing ns timestamps and writing

[jira] [Updated] (ARROW-2026) Timestamps saved as int64 even if use_deprecated_int96_timestamps=True

2018-01-24 Thread Diego Argueta (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diego Argueta updated ARROW-2026: - Description: When writing to a Parquet file, if `use_deprecated_int96_timestamps` is True,

[jira] [Updated] (ARROW-2026) Timestamps saved as int64 even if use_deprecated_int96_timestamps=True

2018-01-24 Thread Diego Argueta (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Diego Argueta updated ARROW-2026: - Description: When writing to a Parquet file, if `use_deprecated_int96_timestamps` is True,

[jira] [Created] (ARROW-2026) Timestamps saved as int64 even if use_deprecated_int96_timestamps=True

2018-01-24 Thread Diego Argueta (JIRA)
Diego Argueta created ARROW-2026: Summary: Timestamps saved as int64 even if use_deprecated_int96_timestamps=True Key: ARROW-2026 URL: https://issues.apache.org/jira/browse/ARROW-2026 Project: Apache

[jira] [Commented] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338030#comment-16338030 ] ASF GitHub Bot commented on ARROW-2025: --- jcrist opened a new pull request #1499: ARROW-2025: [C++]

[jira] [Updated] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2025: -- Labels: pull-request-available (was: ) > [Python/C++] HDFS Client disconnect closes all open

[jira] [Commented] (ARROW-633) [Java] Add support for FixedSizeBinary type

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-633?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16338021#comment-16338021 ] ASF GitHub Bot commented on ARROW-633: -- icexelloss commented on a change in pull request #1492:

[jira] [Commented] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread Jim Crist (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16337955#comment-16337955 ] Jim Crist commented on ARROW-2025: -- Actually, we should just use `hdfsBuilderSetForceNewInstance` to

[jira] [Commented] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread Jim Crist (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2025?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16337932#comment-16337932 ] Jim Crist commented on ARROW-2025: -- Looking closer, I think this may be due to the libhdfs fs cache. From

[jira] [Assigned] (ARROW-1940) [Python] Extra metadata gets added after multiple conversions between pd.DataFrame and pa.Table

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-1940: Assignee: Phillip Cloud > [Python] Extra metadata gets added after multiple conversions

[jira] [Assigned] (ARROW-1974) [Python] Segfault when working with Arrow tables with duplicate columns

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-1974: Assignee: Phillip Cloud > [Python] Segfault when working with Arrow tables with duplicate

[jira] [Created] (ARROW-2025) [Python/C++] HDFS Client disconnect closes all open clients

2018-01-24 Thread Jim Crist (JIRA)
Jim Crist created ARROW-2025: Summary: [Python/C++] HDFS Client disconnect closes all open clients Key: ARROW-2025 URL: https://issues.apache.org/jira/browse/ARROW-2025 Project: Apache Arrow

[jira] [Assigned] (ARROW-1998) [Python] Table.from_pandas crashes when data frame is empty

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-1998: Assignee: Phillip Cloud > [Python] Table.from_pandas crashes when data frame is empty >

[jira] [Assigned] (ARROW-1999) [Python] from_numpy_dtype returns wrong types

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1999?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-1999: Assignee: Phillip Cloud > [Python] from_numpy_dtype returns wrong types >

[jira] [Assigned] (ARROW-2016) [Python] Fix up ASV benchmarking setup and document procedure for use

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-2016: Assignee: Phillip Cloud > [Python] Fix up ASV benchmarking setup and document procedure

[jira] [Assigned] (ARROW-2014) [Python] Document read_pandas method in pyarrow.parquet

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2014?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-2014: Assignee: Phillip Cloud > [Python] Document read_pandas method in pyarrow.parquet >

[jira] [Assigned] (ARROW-1938) [Python] Error writing to partitioned Parquet dataset

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1938?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-1938: Assignee: Phillip Cloud > [Python] Error writing to partitioned Parquet dataset >

[jira] [Assigned] (ARROW-1973) [Python] Memory leak when converting Arrow tables with array columns to Pandas dataframes.

2018-01-24 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Phillip Cloud reassigned ARROW-1973: Assignee: Phillip Cloud > [Python] Memory leak when converting Arrow tables with array

[jira] [Commented] (ARROW-1860) [C++] Add data structure to "stage" a sequence of IPC messages from in-memory data

2018-01-24 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1860?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16337899#comment-16337899 ] Wes McKinney commented on ARROW-1860: - I'm running into a slight issue with this refactoring around

[jira] [Commented] (ARROW-2019) Control the memory allocated for inner vector in LIST

2018-01-24 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16337819#comment-16337819 ] ASF GitHub Bot commented on ARROW-2019: --- jacques-n commented on issue #1497: ARROW-2019: [JAVA]