[jira] [Comment Edited] (ARROW-7628) [Python] read_csv problematic cases

2020-02-18 Thread Athanassios Hatzis (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039767#comment-17039767 ] Athanassios Hatzis edited comment on ARROW-7628 at 2/19/20 7:52 AM:

[jira] [Comment Edited] (ARROW-7628) [Python] read_csv problematic cases

2020-02-18 Thread Athanassios Hatzis (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039767#comment-17039767 ] Athanassios Hatzis edited comment on ARROW-7628 at 2/19/20 7:49 AM:

[jira] [Comment Edited] (ARROW-7628) [Python] read_csv problematic cases

2020-02-18 Thread Athanassios Hatzis (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039767#comment-17039767 ] Athanassios Hatzis edited comment on ARROW-7628 at 2/19/20 7:47 AM:

[jira] [Comment Edited] (ARROW-7628) [Python] read_csv problematic cases

2020-02-18 Thread Athanassios Hatzis (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039767#comment-17039767 ] Athanassios Hatzis edited comment on ARROW-7628 at 2/19/20 7:35 AM:

[jira] [Comment Edited] (ARROW-7628) [Python] read_csv problematic cases

2020-02-18 Thread Athanassios Hatzis (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039767#comment-17039767 ] Athanassios Hatzis edited comment on ARROW-7628 at 2/19/20 7:36 AM:

[jira] [Commented] (ARROW-7628) [Python] read_csv problematic cases

2020-02-18 Thread Athanassios Hatzis (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7628?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039767#comment-17039767 ] Athanassios Hatzis commented on ARROW-7628: --- Thanks [~apitrou] for clearing these cases. Yes, I

[jira] [Commented] (ARROW-7875) [Java] Decimal place getting shifted

2020-02-18 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039719#comment-17039719 ] Micah Kornfield commented on ARROW-7875: [~lparker] could you provide sample code for what you

[jira] [Commented] (ARROW-7808) [Java][Dataset] Implement Datasets Java API

2020-02-18 Thread Hongze Zhang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7808?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039672#comment-17039672 ] Hongze Zhang commented on ARROW-7808: - I am not pretty sure but based on the mail discussion I would

[jira] [Resolved] (ARROW-7880) [CI][R] R sanitizer job is not really working

2020-02-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-7880. Resolution: Fixed Issue resolved by pull request 6450

[jira] [Resolved] (ARROW-7201) [GLib][Gandiva] Add support for BooleanNode

2020-02-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-7201. - Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 6389

[jira] [Commented] (ARROW-7877) [Packaging] Fix crossbow deployment to github artifacts

2020-02-18 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7877?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039592#comment-17039592 ] Kouhei Sutou commented on ARROW-7877: - We may resolve this by using GitHub Actions and

[jira] [Updated] (ARROW-7881) [C++] Fix pedantic warnings

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7881: -- Labels: pull-request-available (was: ) > [C++] Fix pedantic warnings >

[jira] [Created] (ARROW-7881) [C++] Fix pedantic warnings

2020-02-18 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-7881: -- Summary: [C++] Fix pedantic warnings Key: ARROW-7881 URL: https://issues.apache.org/jira/browse/ARROW-7881 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-7880) [CI][R] R sanitizer job is not really working

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7880: -- Labels: pull-request-available (was: ) > [CI][R] R sanitizer job is not really working >

[jira] [Created] (ARROW-7880) [CI][R] R sanitizer job is not really working

2020-02-18 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-7880: -- Summary: [CI][R] R sanitizer job is not really working Key: ARROW-7880 URL: https://issues.apache.org/jira/browse/ARROW-7880 Project: Apache Arrow Issue

[jira] [Resolved] (ARROW-7862) [R] Linux installation should run quieter by default

2020-02-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7862?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-7862. Fix Version/s: (was: 0.16.1) Resolution: Fixed Issue resolved by pull request

[jira] [Updated] (ARROW-7824) [C++][Dataset] Provide Dataset writing to IPC format

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7824?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7824: -- Labels: pull-request-available (was: ) > [C++][Dataset] Provide Dataset writing to IPC format

[jira] [Updated] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Matt Calder (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Calder updated ARROW-7873: --- Attachment: foo.pkl > [Python] Segfault in pandas version 1.0.1, read_parquet after creating a >

[jira] [Commented] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Matt Calder (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039462#comment-17039462 ] Matt Calder commented on ARROW-7873: I added a pickle of the dataframe, as created in 1.0.1. >

[jira] [Commented] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039443#comment-17039443 ] Wes McKinney commented on ARROW-7873: - That would help. If you can provide a pickle of the offending

[jira] [Commented] (ARROW-3779) [C++/Python] Validate timezone passed to pa.timestamp

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039440#comment-17039440 ] Wes McKinney commented on ARROW-3779: - If (but only if) pytz is available we could validate. I'm not

[jira] [Commented] (ARROW-1907) [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039428#comment-17039428 ] Wes McKinney commented on ARROW-1907: - This will be handled by the Feather V2 project (since the IPC

[jira] [Updated] (ARROW-7764) [C++] Builders allocate a null bitmap buffer even if there is no nulls

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7764: -- Labels: pull-request-available (was: ) > [C++] Builders allocate a null bitmap buffer even if

[jira] [Assigned] (ARROW-7764) [C++] Builders allocate a null bitmap buffer even if there is no nulls

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7764?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-7764: - Assignee: Antoine Pitrou > [C++] Builders allocate a null bitmap buffer even if there

[jira] [Updated] (ARROW-6521) [C++] Add function to arrow:: namespace that returns the current ABI version

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6521?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6521: -- Fix Version/s: 1.0.0 > [C++] Add function to arrow:: namespace that returns the current ABI

[jira] [Created] (ARROW-7879) [C++][Doc] Add doc for the Device API

2020-02-18 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-7879: - Summary: [C++][Doc] Add doc for the Device API Key: ARROW-7879 URL: https://issues.apache.org/jira/browse/ARROW-7879 Project: Apache Arrow Issue Type:

[jira] [Commented] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Matt Calder (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039323#comment-17039323 ] Matt Calder commented on ARROW-7873: If it would help, I can build pyarrow with debugging symbols and

[jira] [Updated] (ARROW-6699) [C++] Add Parquet docs

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6699?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6699: -- Fix Version/s: 1.0.0 > [C++] Add Parquet docs > -- > >

[jira] [Commented] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Matt Calder (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039320#comment-17039320 ] Matt Calder commented on ARROW-7873: I attached an example of foo.pq. In case it isn't clear from my

[jira] [Closed] (ARROW-1470) [C++] Add BufferAllocator abstract interface

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-1470. - Fix Version/s: 1.0.0 Resolution: Fixed I believe the functionality is now provided by the

[jira] [Assigned] (ARROW-1470) [C++] Add BufferAllocator abstract interface

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-1470: - Assignee: (was: Wes McKinney) > [C++] Add BufferAllocator abstract interface >

[jira] [Commented] (ARROW-1565) [C++] "argtopk" and "argbottomk" functions for computing indices of largest or smallest elements

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039309#comment-17039309 ] Antoine Pitrou commented on ARROW-1565: --- Another possibility is to use quickselect. > [C++]

[jira] [Updated] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Matt Calder (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matt Calder updated ARROW-7873: --- Attachment: foo.pq > [Python] Segfault in pandas version 1.0.1, read_parquet after creating a >

[jira] [Updated] (ARROW-1565) [C++] "argtopk" and "argbottomk" functions for computing indices of largest or smallest elements

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-1565: -- Fix Version/s: 2.0.0 > [C++] "argtopk" and "argbottomk" functions for computing indices of

[jira] [Assigned] (ARROW-4286) [C++/R] Namespace vendored Boost

2020-02-18 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson reassigned ARROW-4286: -- Assignee: (was: Neal Richardson) > [C++/R] Namespace vendored Boost >

[jira] [Updated] (ARROW-4286) [C++/R] Namespace vendored Boost

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-4286: -- Fix Version/s: 1.0.0 > [C++/R] Namespace vendored Boost > > >

[jira] [Assigned] (ARROW-4286) [C++/R] Namespace vendored Boost

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4286?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-4286: - Assignee: Neal Richardson > [C++/R] Namespace vendored Boost >

[jira] [Commented] (ARROW-5074) [C++/Python] When installing into a SYSTEM prefix, RPATHs are not correctly set

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5074?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039292#comment-17039292 ] Antoine Pitrou commented on ARROW-5074: --- [~uwe] Is this still a problem? > [C++/Python] When

[jira] [Commented] (ARROW-3779) [C++/Python] Validate timezone passed to pa.timestamp

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-3779?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039290#comment-17039290 ] Antoine Pitrou commented on ARROW-3779: --- Should we keep this open? There's no obvious way to

[jira] [Updated] (ARROW-4133) [C++/Python] ORC adapter should fail gracefully if /etc/timezone is missing instead of aborting

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-4133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-4133: -- Fix Version/s: 2.0.0 > [C++/Python] ORC adapter should fail gracefully if /etc/timezone is

[jira] [Updated] (ARROW-971) [C++/Python] Implement Array.isvalid/notnull/isnull

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-971: - Component/s: C++ - Compute > [C++/Python] Implement Array.isvalid/notnull/isnull >

[jira] [Updated] (ARROW-971) [C++/Python] Implement Array.isvalid/notnull/isnull

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-971: - Fix Version/s: 2.0.0 > [C++/Python] Implement Array.isvalid/notnull/isnull >

[jira] [Updated] (ARROW-1907) [C++/Python] Feather format cannot accommodate string columns containing more than a total of 2GB of data

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-1907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-1907: -- Fix Version/s: 2.0.0 > [C++/Python] Feather format cannot accommodate string columns

[jira] [Commented] (ARROW-6469) [Python] HDFS documentation does not mention HDFS short circuit readings

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039284#comment-17039284 ] Antoine Pitrou commented on ARROW-6469: --- For reference, this is the doc for short-circuit reads:

[jira] [Commented] (ARROW-6469) [Python] HDFS documentation does not mention HDFS short circuit readings

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039278#comment-17039278 ] Wes McKinney commented on ARROW-6469: - It seems that short-circuit reads are a configuration issue

[jira] [Updated] (ARROW-6582) [R] read_parquet() fails with embedded nuls in strings

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6582: -- Summary: [R] read_parquet() fails with embedded nuls in strings (was: R's read_parquet()

[jira] [Updated] (ARROW-6469) [Python] HDFS doc5mentation does not mention HDFS short circuit readings

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6469: -- Summary: [Python] HDFS documentation does not mention HDFS short circuit readings (was:

[jira] [Commented] (ARROW-7809) [R] vignette does not run on Win 10 nor ubuntu

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7809?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039276#comment-17039276 ] Antoine Pitrou commented on ARROW-7809: --- cc [~npr] > [R] vignette does not run on Win 10 nor

[jira] [Updated] (ARROW-7809) [R] vignette does not run on Win 10 nor ubuntu

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7809: -- Summary: [R] vignette does not run o. Win 10 nor ubuntu (was: R vignette does not run on Win

[jira] [Updated] (ARROW-7809) [R] vignette does not run on Win 10 nor ubuntu

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7809?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7809: -- Component/s: R > [R] vignette does not run on Win 10 nor ubuntu >

[jira] [Commented] (ARROW-6469) PyArrow HDFS documentation does not mention HDFS short circuit readings

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039274#comment-17039274 ] Antoine Pitrou commented on ARROW-6469: --- cc [~wesm] > PyArrow HDFS documentation does not mention

[jira] [Updated] (ARROW-6775) [C++] [Python] Proposal for several Array utility functions

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-6775: -- Summary: [C++] [Python] Proposal for several Array utility functions (was: Proposal for

[jira] [Updated] (ARROW-7782) [Python] Losing index information when using write_to_dataset with partition_cols

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7782: -- Summary: [Python] Losing index information when using write_to_dataset with partition_cols

[jira] [Updated] (ARROW-7782) Losing index information when using write_to_dataset with partition_cols

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7782: -- Component/s: Python > Losing index information when using write_to_dataset with partition_cols

[jira] [Commented] (ARROW-7875) [Java] Decimal place getting shifted

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039271#comment-17039271 ] Antoine Pitrou commented on ARROW-7875: --- cc [~emkornfi...@gmail.com] [~liyafan] > [Java] Decimal

[jira] [Updated] (ARROW-7875) [Java] Decimal place getting shifted

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7875: -- Summary: [Java] Decimal place getting shifted (was: Decimal place getting shifted) > [Java]

[jira] [Updated] (ARROW-7875) Decimal place getting shifted

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7875: -- Component/s: Java > Decimal place getting shifted > - > >

[jira] [Commented] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039252#comment-17039252 ] Wes McKinney commented on ARROW-7873: - Can you share a foo.pq that exhibits the problem? > [Python]

[jira] [Updated] (ARROW-7873) [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7873: Summary: [Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc

[jira] [Updated] (ARROW-7788) [C++] Add schema conversion support for map type

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7788: -- Summary: [C++] Add schema conversion support for map type (was: Add schema conversion support

[jira] [Updated] (ARROW-7788) Add schema conversion support for map type

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-7788: -- Component/s: C++ > Add schema conversion support for map type >

[jira] [Resolved] (ARROW-7788) Add schema conversion support for map type

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7788?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-7788. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 6379

[jira] [Commented] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Francois Saint-Jacques (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039212#comment-17039212 ] Francois Saint-Jacques commented on ARROW-6895: --- Thanks for the followup Adam, would you

[jira] [Created] (ARROW-7878) [C++] Implement LogicalPlan and LogicalPlanBuilder

2020-02-18 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-7878: - Summary: [C++] Implement LogicalPlan and LogicalPlanBuilder Key: ARROW-7878 URL: https://issues.apache.org/jira/browse/ARROW-7878 Project: Apache

[jira] [Updated] (ARROW-7664) [C++] Extract localfs default from FileSystemFromUri

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7664?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7664: -- Labels: pull-request-available (was: ) > [C++] Extract localfs default from FileSystemFromUri

[jira] [Commented] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039184#comment-17039184 ] Adam Hooper commented on ARROW-6895: I just uploaded {{01-fix-arrow-6895.diff}}, which I _imagine_

[jira] [Updated] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper updated ARROW-6895: --- Attachment: 01-fix-arrow-6895.diff > [C++][Parquet] parquet::arrow::ColumnReader:

[jira] [Reopened] (ARROW-6895) [C++][Parquet] parquet::arrow::ColumnReader: ByteArrayDictionaryRecordReader repeats returned values when calling `NextBatch()`

2020-02-18 Thread Adam Hooper (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Adam Hooper reopened ARROW-6895: The code snippet given in the bug description still fails to read the {{bad.parquet}} file I

[jira] [Created] (ARROW-7877) [Packaging] Fix crossbow deployment to github artifacts

2020-02-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-7877: -- Summary: [Packaging] Fix crossbow deployment to github artifacts Key: ARROW-7877 URL: https://issues.apache.org/jira/browse/ARROW-7877 Project: Apache Arrow

[GitHub] [arrow-testing] fsaintjacques merged pull request #17: ARROW-7861: More corpus

2020-02-18 Thread GitBox
fsaintjacques merged pull request #17: ARROW-7861: More corpus URL: https://github.com/apache/arrow-testing/pull/17 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [arrow-testing] fsaintjacques opened a new pull request #17: ARROW-7861: More corpus

2020-02-18 Thread GitBox
fsaintjacques opened a new pull request #17: ARROW-7861: More corpus URL: https://github.com/apache/arrow-testing/pull/17 This is an automated message from the Apache Git Service. To respond to the message, please log on to

[jira] [Updated] (ARROW-7876) [R] Installation in the documentation generation image

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7876?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7876: -- Labels: pull-request-available (was: ) > [R] Installation fails in the documentation

[jira] [Created] (ARROW-7876) [R] Installation fails in the documentation generation image

2020-02-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-7876: -- Summary: [R] Installation fails in the documentation generation image Key: ARROW-7876 URL: https://issues.apache.org/jira/browse/ARROW-7876 Project: Apache Arrow

[jira] [Commented] (ARROW-7875) Decimal place getting shifted

2020-02-18 Thread Larry Parker (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039112#comment-17039112 ] Larry Parker commented on ARROW-7875: - The JDBC driver is denodo-vdp-jdbcdriver.jar.   > Decimal

[jira] [Commented] (ARROW-7875) Decimal place getting shifted

2020-02-18 Thread Larry Parker (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7875?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039111#comment-17039111 ] Larry Parker commented on ARROW-7875: - FWIW, I just updated PyArrow to 0.16.0, and the Java Arrow

[jira] [Created] (ARROW-7875) Decimal place getting shifted

2020-02-18 Thread Larry Parker (Jira)
Larry Parker created ARROW-7875: --- Summary: Decimal place getting shifted Key: ARROW-7875 URL: https://issues.apache.org/jira/browse/ARROW-7875 Project: Apache Arrow Issue Type: Bug

[jira] [Comment Edited] (ARROW-7338) [C++] Improve InMemoryDataSource to support generator instead of static list

2020-02-18 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7338?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17014500#comment-17014500 ] Ben Kietzman edited comment on ARROW-7338 at 2/18/20 2:06 PM: -- I think it's

[jira] [Updated] (ARROW-7874) [Python][Archery] Validate docstrings with numpydoc

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-7874: -- Labels: pull-request-available (was: ) > [Python][Archery] Validate docstrings with numpydoc

[jira] [Commented] (ARROW-7412) [C++][Dataset] Ensure that dataset code is robust to schemas with duplicate field names

2020-02-18 Thread Ben Kietzman (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17039101#comment-17039101 ] Ben Kietzman commented on ARROW-7412: - A more general solution would be provide an unambiguous way to

[jira] [Created] (ARROW-7874) [Python][Archery] Validate docstrings with numpydoc

2020-02-18 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-7874: -- Summary: [Python][Archery] Validate docstrings with numpydoc Key: ARROW-7874 URL: https://issues.apache.org/jira/browse/ARROW-7874 Project: Apache Arrow

[jira] [Resolved] (ARROW-7839) [Python][Dataset] Add IPC format to python bindings

2020-02-18 Thread Krisztian Szucs (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7839?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Krisztian Szucs resolved ARROW-7839. Resolution: Fixed Issue resolved by pull request 6409

[jira] [Created] (ARROW-7873) Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection

2020-02-18 Thread Matt Calder (Jira)
Matt Calder created ARROW-7873: -- Summary: Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection Key: ARROW-7873 URL: https://issues.apache.org/jira/browse/ARROW-7873

[jira] [Resolved] (ARROW-7462) [C++] Add CpuInfo detection for Arm64 Architecture

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7462?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-7462. --- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 6412

[jira] [Commented] (ARROW-7838) [C++] Installed plasma-store-server fails finding Boost

2020-02-18 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038938#comment-17038938 ] Antoine Pitrou commented on ARROW-7838: --- >From the users' side, they're probably more improvements

[jira] [Updated] (ARROW-6393) [C++]Add EqualOptions support in SparseTensor::Equals

2020-02-18 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-6393?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-6393: -- Labels: pull-request-available (was: ) > [C++]Add EqualOptions support in

[jira] [Commented] (ARROW-7838) [C++] Installed plasma-store-server fails finding Boost

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038907#comment-17038907 ] Wes McKinney commented on ARROW-7838: - I tagged this with 0.16.1 since it probably all Boost-related

[jira] [Updated] (ARROW-7838) [C++] Installed plasma-store-server fails finding Boost

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7838: Fix Version/s: 0.16.1 > [C++] Installed plasma-store-server fails finding Boost >

[jira] [Created] (ARROW-7872) [Python] Support conversion of list-of-struct in Array/Table.to_pandas

2020-02-18 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-7872: --- Summary: [Python] Support conversion of list-of-struct in Array/Table.to_pandas Key: ARROW-7872 URL: https://issues.apache.org/jira/browse/ARROW-7872 Project: Apache

[jira] [Resolved] (ARROW-7869) [Python] Boost::system and boost::filesystem not necessary anymore in Python wheels

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney resolved ARROW-7869. - Fix Version/s: 0.16.1 Resolution: Fixed Issue resolved by pull request 6441

[jira] [Assigned] (ARROW-7869) [Python] Boost::system and boost::filesystem not necessary anymore in Python wheels

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney reassigned ARROW-7869: --- Assignee: Antoine Pitrou > [Python] Boost::system and boost::filesystem not necessary

[jira] [Commented] (ARROW-7758) [Python] Wrong conversion of timestamps that are out of bounds for pandas (eg 0000-01-01)

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038878#comment-17038878 ] Wes McKinney commented on ARROW-7758: - In the next release. If we do a 0.16.1 release it will be

[jira] [Updated] (ARROW-7758) [Python] Wrong conversion of timestamps that are out of bounds for pandas (eg 0000-01-01)

2020-02-18 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-7758: Fix Version/s: 0.16.1 > [Python] Wrong conversion of timestamps that are out of bounds for pandas

[jira] [Commented] (ARROW-7870) [CI][Packaging] Host nightly wheels on Apache bintray

2020-02-18 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-7870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17038863#comment-17038863 ] Joris Van den Bossche commented on ARROW-7870: -- As mentioned elsewhere, I think using