[jira] [Commented] (PARQUET-1115) Warn users when misusing parquet-tools merge

2022-08-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17584533#comment-17584533 ] ASF GitHub Bot commented on PARQUET-1115: - NickCrews commented on PR #433: URL:

[GitHub] [parquet-mr] NickCrews commented on pull request #433: PARQUET-1115: Warn users when misusing parquet-tools merge

2022-08-24 Thread GitBox
NickCrews commented on PR #433: URL: https://github.com/apache/parquet-mr/pull/433#issuecomment-1226667307 It might be nice if we actually suggested an alternative instead of just saying "don't do this." You can see my solution at

[jira] [Commented] (PARQUET-2175) Skip method skips levels and not rows for repeated fields

2022-08-24 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17584388#comment-17584388 ] Micah Kornfield commented on PARQUET-2175: -- I think the current signature is

Re: Interest in adding the float16 logical type to the Parquet spec

2022-08-24 Thread Xinli shang
Hi Anja, Thanks for your interest! We encourage people's new proposals. Go head to make a proposal and the community can review it. Xinli On Tue, Aug 23, 2022 at 4:53 PM Anja wrote: > Hello! > > Is there interest in having the float16 logical type standardised in the > Parquet spec? I am

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2022-08-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17584271#comment-17584271 ] ASF GitHub Bot commented on PARQUET-2160: - zhongyujiang commented on code in PR #982: URL:

[GitHub] [parquet-mr] zhongyujiang commented on a diff in pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time.

2022-08-24 Thread GitBox
zhongyujiang commented on code in PR #982: URL: https://github.com/apache/parquet-mr/pull/982#discussion_r953866331 ## parquet-hadoop/src/main/java/org/apache/parquet/hadoop/CodecFactory.java: ## @@ -109,7 +110,17 @@ public BytesInput decompress(BytesInput bytes, int

[jira] [Updated] (PARQUET-2176) Parquet writers should allow for configurable index/statistics truncation

2022-08-24 Thread patchwork01 (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] patchwork01 updated PARQUET-2176: - Description: ParquetWriter does not expose any way to set the properties for column index or

[jira] [Commented] (PARQUET-2176) Parquet writers should allow for configurable index/statistics truncation

2022-08-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17584199#comment-17584199 ] ASF GitHub Bot commented on PARQUET-2176: - patchwork01 opened a new pull request, #989: URL:

[GitHub] [parquet-mr] patchwork01 opened a new pull request, #989: PARQUET-2176: Column index/statistics truncation in ParquetWriter

2022-08-24 Thread GitBox
patchwork01 opened a new pull request, #989: URL: https://github.com/apache/parquet-mr/pull/989 Make sure you have checked _all_ steps below. ### Jira - [x] My PR addresses the following [Parquet Jira](https://issues.apache.org/jira/browse/PARQUET/) issues and references them

[jira] [Created] (PARQUET-2176) Parquet writers should allow for configurable index/statistics truncation

2022-08-24 Thread patchwork01 (Jira)
patchwork01 created PARQUET-2176: Summary: Parquet writers should allow for configurable index/statistics truncation Key: PARQUET-2176 URL: https://issues.apache.org/jira/browse/PARQUET-2176 Project: