AvroParquetWriter write to s3

2022-05-13 Thread Regin Quinoa
Hi, we are trying to use org.apache.parquet.avro .AvroParquetWriter to write parquet file to s3 bucket. The file is successfully written to s3 bucket but get an exception com.amazonaws.SdkClientException: Unable to verify

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536902#comment-17536902 ] Timothy Miller commented on PARQUET-2069: - Yup. If I force prepareForRead() to ignore the avro

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536894#comment-17536894 ] Timothy Miller commented on PARQUET-2069: - So, where does the avro schema come from in the

[jira] [Comment Edited] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536888#comment-17536888 ] Timothy Miller edited comment on PARQUET-2069 at 5/13/22 9:02 PM: -- I

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536888#comment-17536888 ] Timothy Miller commented on PARQUET-2069: - I managed to probe this just a bit. No idea why this

[jira] [Created] (PARQUET-2147) Can't run ParquetMR test in IDEs

2022-05-13 Thread Timothy Miller (Jira)
Timothy Miller created PARQUET-2147: --- Summary: Can't run ParquetMR test in IDEs Key: PARQUET-2147 URL: https://issues.apache.org/jira/browse/PARQUET-2147 Project: Parquet Issue Type: Bug

[jira] [Updated] (PARQUET-2147) Can't run ParquetMR tests in IDEs

2022-05-13 Thread Timothy Miller (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Timothy Miller updated PARQUET-2147: Summary: Can't run ParquetMR tests in IDEs (was: Can't run ParquetMR test in IDEs) >

[jira] [Created] (PARQUET-2146) AvroParquetWriter write to s3 bucket throws data intergrity exception

2022-05-13 Thread sean (Jira)
sean created PARQUET-2146: - Summary: AvroParquetWriter write to s3 bucket throws data intergrity exception Key: PARQUET-2146 URL: https://issues.apache.org/jira/browse/PARQUET-2146 Project: Parquet

[jira] [Commented] (PARQUET-2069) Parquet file containing arrays, written by Parquet-MR, cannot be read again by Parquet-MR

2022-05-13 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2069?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17536799#comment-17536799 ] ASF GitHub Bot commented on PARQUET-2069: - theosib-amazon commented on PR #957: URL:

[GitHub] [parquet-mr] theosib-amazon commented on pull request #957: PARQUET-2069: Allow list and array record types to be compatible.

2022-05-13 Thread GitBox
theosib-amazon commented on PR #957: URL: https://github.com/apache/parquet-mr/pull/957#issuecomment-1126305627 I won't be able to add a test any time soon. Here's why. First take note of the two parquet files attached to https://issues.apache.org/jira/browse/PARQUET-2069.