Parquet Sync meeting - July 26 2022

2022-07-26 Thread Xinli shang
Attendees ( Gidon Gershinsky, Xinli Shang, Tim Miller) 1. Release 1.12.3 1. Post release - no issue reported. 2. Parquet Cell-level encryption a. What if the user only partially has the keys but not all the hidden columns? Should we throw an

[jira] [Commented] (PARQUET-2126) Thread safety bug in CodecFactory

2022-07-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17571518#comment-17571518 ] ASF GitHub Bot commented on PARQUET-2126: - theosib-amazon commented on PR #959: URL:

[GitHub] [parquet-mr] theosib-amazon commented on pull request #959: PARQUET-2126: Make cached (de)compressors thread-safe

2022-07-26 Thread GitBox
theosib-amazon commented on PR #959: URL: https://github.com/apache/parquet-mr/pull/959#issuecomment-1195676003 I just thought of something that makes me nervous about this PR that requires further investigation. Consider the following scenario: - Thread A allocates a codec - Thread A

[GitHub] [parquet-mr] theosib-amazon commented on pull request #960: Performance optimization: Move all LittleEndianDataInputStream functionality into ByteBufferInputStream

2022-07-26 Thread GitBox
theosib-amazon commented on PR #960: URL: https://github.com/apache/parquet-mr/pull/960#issuecomment-1195568967 > Is this mostly a refactoring PR? I also don't see `LittleEndianDataInputStream` being marked as deprecated. I initially marked `LittleEndianDataInputStream` as

[jira] [Commented] (PARQUET-2161) Row positions are computed incorrectly when range or offset metadata filter is used

2022-07-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17571233#comment-17571233 ] ASF GitHub Bot commented on PARQUET-2161: - ggershinsky commented on PR #978: URL:

[GitHub] [parquet-mr] ggershinsky commented on pull request #978: PARQUET-2161: Fix row index generation in combination with range filtering

2022-07-26 Thread GitBox
ggershinsky commented on PR #978: URL: https://github.com/apache/parquet-mr/pull/978#issuecomment-1195083014 cc @shangxinli -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.