[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16435450#comment-16435450 ] ASF GitHub Bot commented on ARROW-2369: --- pitrou commented on a change in pull reques

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16435447#comment-16435447 ] ASF GitHub Bot commented on ARROW-2369: --- xhochy closed pull request #1866: ARROW-236

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16435443#comment-16435443 ] ASF GitHub Bot commented on ARROW-2369: --- pitrou commented on a change in pull reques

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-12 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16435440#comment-16435440 ] ASF GitHub Bot commented on ARROW-2369: --- xhochy commented on a change in pull reques

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16430650#comment-16430650 ] ASF GitHub Bot commented on ARROW-2369: --- pitrou opened a new pull request #1866: ARR

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-09 Thread Antoine Pitrou (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16430572#comment-16430572 ] Antoine Pitrou commented on ARROW-2369: --- Ok, there are two things going on: * when {

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-04-09 Thread Justin Tan (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16430571#comment-16430571 ] Justin Tan commented on ARROW-2369: --- Looks like the file is readable by early pyarrow ve

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-03-31 Thread Wes McKinney (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421510#comment-16421510 ] Wes McKinney commented on ARROW-2369: - Sounds like there's a {{uint32_t}} overflow som

[jira] [Commented] (ARROW-2369) Large (>~20 GB) files written to Parquet via PyArrow are corrupted

2018-03-31 Thread Babak Alipour (JIRA)
[ https://issues.apache.org/jira/browse/ARROW-2369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16421506#comment-16421506 ] Babak Alipour commented on ARROW-2369: -- I've got the same issue on Win 10, Arrow v0.9