[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656421#comment-17656421 ] ASF GitHub Bot commented on PARQUET-2160: - shangxinli commented on PR #982: URL:

[GitHub] [parquet-mr] shangxinli commented on pull request #1018: PARQUET-2219: ParquetFileReader skips empty row group

2023-01-09 Thread GitBox
shangxinli commented on PR #1018: URL: https://github.com/apache/parquet-mr/pull/1018#issuecomment-1376751333 @gszadovszky Nice to see you are back! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[jira] [Commented] (PARQUET-2219) ParquetFileReader throws a runtime exception when a file contains only headers and now row data

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656422#comment-17656422 ] ASF GitHub Bot commented on PARQUET-2219: - shangxinli commented on PR #1018: URL:

[jira] [Commented] (PARQUET-2224) Publish SBOM artifacts

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656442#comment-17656442 ] ASF GitHub Bot commented on PARQUET-2224: - dongjoon-hyun commented on PR #1017: URL:

[GitHub] [parquet-mr] shangxinli commented on pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted

2023-01-09 Thread GitBox
shangxinli commented on PR #1008: URL: https://github.com/apache/parquet-mr/pull/1008#issuecomment-1376755881 @wgtmac Do you have time to have a look? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[jira] [Commented] (PARQUET-2212) Add ByteBuffer api for decryptors to allow direct memory to be decrypted

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656428#comment-17656428 ] ASF GitHub Bot commented on PARQUET-2212: - shangxinli commented on PR #1008: URL:

[jira] [Commented] (PARQUET-2212) Add ByteBuffer api for decryptors to allow direct memory to be decrypted

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2212?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656427#comment-17656427 ] ASF GitHub Bot commented on PARQUET-2212: - shangxinli commented on code in PR #1008: URL:

[GitHub] [parquet-mr] shangxinli commented on pull request #1017: PARQUET-2224: Publish SBOM artifacts

2023-01-09 Thread GitBox
shangxinli commented on PR #1017: URL: https://github.com/apache/parquet-mr/pull/1017#issuecomment-1376754236 Thank you @dongjoon-hyun for working on it! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above

[GitHub] [parquet-mr] shangxinli commented on pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time.

2023-01-09 Thread GitBox
shangxinli commented on PR #982: URL: https://github.com/apache/parquet-mr/pull/982#issuecomment-1376750703 @alexeykudinkin We might release a new patch in the next 2 or 3 months. Can you elaborate why "this is a severe problem that does affect our ability to use Parquet w/ Zstd"?

[jira] [Commented] (PARQUET-2224) Publish SBOM artifacts

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656424#comment-17656424 ] ASF GitHub Bot commented on PARQUET-2224: - shangxinli commented on PR #1017: URL:

[GitHub] [parquet-mr] shangxinli merged pull request #1017: PARQUET-2224: Publish SBOM artifacts

2023-01-09 Thread GitBox
shangxinli merged PR #1017: URL: https://github.com/apache/parquet-mr/pull/1017 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Commented] (PARQUET-2224) Publish SBOM artifacts

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656425#comment-17656425 ] ASF GitHub Bot commented on PARQUET-2224: - shangxinli merged PR #1017: URL:

[GitHub] [parquet-mr] shangxinli commented on pull request #1014: PARQUET-2075: Implement unified file rewriter

2023-01-09 Thread GitBox
shangxinli commented on PR #1014: URL: https://github.com/apache/parquet-mr/pull/1014#issuecomment-1376754942 @gszadovszky I Just want to check if you have time to have a look. @wgtmac Just be nice to take over the work that we discussed earlier to have an aggregated rewriter. -- This

[GitHub] [parquet-mr] dongjoon-hyun commented on pull request #1017: PARQUET-2224: Publish SBOM artifacts

2023-01-09 Thread GitBox
dongjoon-hyun commented on PR #1017: URL: https://github.com/apache/parquet-mr/pull/1017#issuecomment-1376796050 Thank you all, @shangxinli , @ggershinsky , @sunchao , @wgtmac . -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub

[GitHub] [parquet-mr] shangxinli commented on a diff in pull request #1008: PARQUET-2212: Add ByteBuffer api for decryptors to allow direct memory to be decrypted

2023-01-09 Thread GitBox
shangxinli commented on code in PR #1008: URL: https://github.com/apache/parquet-mr/pull/1008#discussion_r1065346689 ## parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ColumnChunkPageReadStore.java: ## @@ -133,11 +135,36 @@ public DataPage readPage() { public

[jira] [Commented] (PARQUET-2075) Unified Rewriter Tool

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2075?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656426#comment-17656426 ] ASF GitHub Bot commented on PARQUET-2075: - shangxinli commented on PR #1014: URL:

[jira] [Updated] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated PARQUET-2160: - Component/s: parquet-format > Close decompression stream to free off-heap memory in

[jira] [Updated] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated PARQUET-2160: - Priority: Critical (was: Major) > Close decompression stream to free off-heap memory

[jira] [Updated] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated PARQUET-2160: - Affects Version/s: 1.12.3 > Close decompression stream to free off-heap memory in time

[jira] [Updated] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated PARQUET-2160: - Priority: Blocker (was: Critical) > Close decompression stream to free off-heap

[jira] [Updated] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin updated PARQUET-2160: - Issue Type: Bug (was: Improvement) > Close decompression stream to free off-heap

[jira] [Resolved] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alexey Kudinkin resolved PARQUET-2160. -- Resolution: Fixed > Close decompression stream to free off-heap memory in time >

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656334#comment-17656334 ] ASF GitHub Bot commented on PARQUET-2160: - alexeykudinkin commented on PR #982: URL:

[GitHub] [parquet-mr] alexeykudinkin commented on pull request #982: PARQUET-2160: Close ZstdInputStream to free off-heap memory in time.

2023-01-09 Thread GitBox
alexeykudinkin commented on PR #982: URL: https://github.com/apache/parquet-mr/pull/982#issuecomment-1376498280 @gszadovszky @ggershinsky @shangxinli Folks, do we have an approximate timeline for the next patch release that will be including this patch? This is a severe

[jira] [Commented] (PARQUET-2160) Close decompression stream to free off-heap memory in time

2023-01-09 Thread Alexey Kudinkin (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656336#comment-17656336 ] Alexey Kudinkin commented on PARQUET-2160: -- Corresponding Spark issue:

[jira] [Commented] (PARQUET-1980) Build and test Apache Parquet on ARM64 CPU architecture

2023-01-09 Thread Martin Tzvetanov Grigorov (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1980?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655959#comment-17655959 ] Martin Tzvetanov Grigorov commented on PARQUET-1980: Thanks for the ping,

[GitHub] [parquet-mr] gszadovszky commented on a diff in pull request #1018: PARQUET-2219: ParquetFileReader skips empty row group

2023-01-09 Thread GitBox
gszadovszky commented on code in PR #1018: URL: https://github.com/apache/parquet-mr/pull/1018#discussion_r1064374553 ## parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetFileReader.java: ## @@ -1038,7 +1044,9 @@ public PageReadStore readNextFilteredRowGroup()

[jira] [Commented] (PARQUET-2219) ParquetFileReader throws a runtime exception when a file contains only headers and now row data

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17655975#comment-17655975 ] ASF GitHub Bot commented on PARQUET-2219: - gszadovszky commented on code in PR #1018: URL:

[GitHub] [parquet-format] anjakefala commented on pull request #184: PARQUET-758: Add Float16/Half-float logical type

2023-01-09 Thread GitBox
anjakefala commented on PR #184: URL: https://github.com/apache/parquet-format/pull/184#issuecomment-1376199292 Hey @emkornfield! Is it reasonable for me to send a proposal to the mailing list for a vote? It seems @gszadovszky is not available for insight; is there anyone else that can

[jira] [Commented] (PARQUET-758) [Format] HALF precision FLOAT Logical type

2023-01-09 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17656264#comment-17656264 ] ASF GitHub Bot commented on PARQUET-758: anjakefala commented on PR #184: URL: