[jira] [Resolved] (ARROW-1192) [JAVA] Improve splitAndTransfer performance for List and Union vectors

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1192. - Resolution: Fixed Issue resolved by pull request 901 [https://github.com/apache/arrow/pull/901]

[jira] [Updated] (ARROW-633) [Java] Add support for FixedWidthBinary type

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-633: --- Fix Version/s: (was: 0.6.0) 1.0.0 > [Java] Add support for FixedWidthBinary

[jira] [Updated] (ARROW-973) [Website] Add FAQ page about project

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-973: --- Fix Version/s: (was: 0.6.0) 1.0.0 > [Website] Add FAQ page about project >

[jira] [Updated] (ARROW-1175) [Java] Implement/test dictionary-encoded subfields

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1175: Fix Version/s: (was: 0.6.0) 1.0.0 > [Java] Implement/test dictionary-encoded

[jira] [Closed] (ARROW-1254) failing installation on osx / linux

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1254?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney closed ARROW-1254. --- > failing installation on osx / linux > --- > > Key:

[jira] [Resolved] (ARROW-276) Nullable Vectors should extend BaseValueVector and not BaseDataValueVector

2017-07-28 Thread Steven Phillips (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-276?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Phillips resolved ARROW-276. --- Resolution: Fixed Issue resolved by pull request 892

[jira] [Resolved] (ARROW-1267) [Java] Handle zero length case in BitVector.splitAndTransfer

2017-07-28 Thread Steven Phillips (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Steven Phillips resolved ARROW-1267. Resolution: Fixed Issue resolved by pull request 890

[jira] [Commented] (ARROW-968) [Python] RecordBatch [i:j] syntax is incomplete

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105752#comment-16105752 ] Wes McKinney commented on ARROW-968: PR: https://github.com/apache/arrow/pull/908 > [Python]

[jira] [Commented] (ARROW-1287) [Python] Emulate "whence" argument of seek in NativeFile

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105738#comment-16105738 ] Wes McKinney commented on ARROW-1287: - PR: https://github.com/apache/arrow/pull/907 > [Python]

[jira] [Assigned] (ARROW-968) [Python] RecordBatch [i:j] syntax is incomplete

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-968: -- Assignee: Wes McKinney > [Python] RecordBatch [i:j] syntax is incomplete >

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105728#comment-16105728 ] Phillip Cloud commented on ARROW-1291: -- That could work, but then the round trip conversion is no

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105716#comment-16105716 ] Li Jin commented on ARROW-1291: --- +1 > [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with

[jira] [Commented] (ARROW-1282) Large memory reallocation by Arrow causes hang in jemalloc

2017-07-28 Thread Jeff Knupp (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105639#comment-16105639 ] Jeff Knupp commented on ARROW-1282: --- [~wesmckinn], I tried using the jemalloc with a prefix and

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105640#comment-16105640 ] Wes McKinney commented on ARROW-1291: - How about we convert non-string column labels to strings for

[jira] [Updated] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-1291: Fix Version/s: 0.6.0 > [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric >

[jira] [Created] (ARROW-1292) [C++/Python] Expand libhdfs feature coverage

2017-07-28 Thread Wes McKinney (JIRA)
Wes McKinney created ARROW-1292: --- Summary: [C++/Python] Expand libhdfs feature coverage Key: ARROW-1292 URL: https://issues.apache.org/jira/browse/ARROW-1292 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105491#comment-16105491 ] Li Jin commented on ARROW-1291: --- The use case I have is that I am passing a user provided pandas dataframe

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Phillip Cloud (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105308#comment-16105308 ] Phillip Cloud commented on ARROW-1291: -- I'm -1 on allowing numeric column names since it adds an IMO

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105288#comment-16105288 ] Li Jin commented on ARROW-1291: --- I think stringifying non-string columns is fine. Having metadata containing

[jira] [Resolved] (ARROW-1289) [Python] Add PYARROW_BUILD_PLASMA option like Parquet

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1289?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1289. - Resolution: Fixed Issue resolved by pull request 903 [https://github.com/apache/arrow/pull/903]

[jira] [Commented] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105153#comment-16105153 ] Wes McKinney commented on ARROW-1291: - This is a known limitation because Arrow schemas must have all

[jira] [Updated] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated ARROW-1291: -- Priority: Minor (was: Major) > [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric >

[jira] [Updated] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1291?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Li Jin updated ARROW-1291: -- Component/s: Python > [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric > column names

[jira] [Created] (ARROW-1291) [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names

2017-07-28 Thread Li Jin (JIRA)
Li Jin created ARROW-1291: - Summary: [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names Key: ARROW-1291 URL: https://issues.apache.org/jira/browse/ARROW-1291 Project:

[jira] [Commented] (ARROW-1234) [Java] publishing nightly snapshot java artifacts to maven repo

2017-07-28 Thread Li Jin (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105110#comment-16105110 ] Li Jin commented on ARROW-1234: --- [~antonymayi], I can also help with this. I don't want to duplicate the

[jira] [Resolved] (ARROW-1273) [Python] Add convenience functions for reading only Parquet metadata or effective Arrow schema from a particular Parquet file

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1273?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1273. - Resolution: Fixed Issue resolved by pull request 904 [https://github.com/apache/arrow/pull/904]

[jira] [Resolved] (ARROW-1290) [C++] Use array capacity doubling in arrow::BufferBuilder

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1290?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1290. - Resolution: Fixed Issue resolved by pull request 905 [https://github.com/apache/arrow/pull/905]

[jira] [Commented] (ARROW-1282) Large memory reallocation by Arrow causes hang in jemalloc

2017-07-28 Thread Jeff Knupp (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1282?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16105051#comment-16105051 ] Jeff Knupp commented on ARROW-1282: --- Yeah, that's what I'm working on now. Just about finished making

[jira] [Resolved] (ARROW-1281) [C++/Python] Add Docker setup for running HDFS tests and other tests we may not run in Travis CI

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-1281. - Resolution: Fixed Issue resolved by pull request 895 [https://github.com/apache/arrow/pull/895]

[jira] [Assigned] (ARROW-1281) [C++/Python] Add Docker setup for running HDFS tests and other tests we may not run in Travis CI

2017-07-28 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-1281?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-1281: --- Assignee: Wes McKinney > [C++/Python] Add Docker setup for running HDFS tests and other