[jira] [Commented] (PARQUET-2122) Adding Bloom filter to small Parquet file bloats in size X1700

2022-02-14 Thread Junjie Chen (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492418#comment-17492418 ] Junjie Chen commented on PARQUET-2122: -- That's the default size of the bloom filter. Please

[jira] [Created] (PARQUET-2125) ParquetFileReader has a currentBlock information in a private field

2022-02-14 Thread Tanuja Dubey (Jira)
Tanuja Dubey created PARQUET-2125: - Summary: ParquetFileReader has a currentBlock information in a private field Key: PARQUET-2125 URL: https://issues.apache.org/jira/browse/PARQUET-2125 Project:

[jira] [Commented] (PARQUET-2121) Remove descriptions for the removed modules

2022-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492347#comment-17492347 ] ASF GitHub Bot commented on PARQUET-2121: - sekikn commented on a change in pull request #947:

[GitHub] [parquet-mr] sekikn commented on a change in pull request #947: PARQUET-2121: Remove descriptions for the removed modules

2022-02-14 Thread GitBox
sekikn commented on a change in pull request #947: URL: https://github.com/apache/parquet-mr/pull/947#discussion_r806426714 ## File path: README.md ## @@ -66,10 +66,8 @@ Parquet is a very active project, and new features are being added quickly. Here * Type-specific encoding

[jira] [Updated] (PARQUET-2124) Bad DCHECK For Intermixed Dictionary Encoding

2022-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2124?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-2124: Labels: pull-request-available (was: ) > Bad DCHECK For Intermixed Dictionary Encoding

[jira] [Created] (PARQUET-2124) Bad DCHECK For Intermixed Dictionary Encoding

2022-02-14 Thread William Butler (Jira)
William Butler created PARQUET-2124: --- Summary: Bad DCHECK For Intermixed Dictionary Encoding Key: PARQUET-2124 URL: https://issues.apache.org/jira/browse/PARQUET-2124 Project: Parquet

[jira] [Updated] (PARQUET-2123) Invalid memory access in ScanFileContents

2022-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated PARQUET-2123: Labels: pull-request-available (was: ) > Invalid memory access in ScanFileContents >

[jira] [Created] (PARQUET-2123) Invalid memory access in ScanFileContents

2022-02-14 Thread William Butler (Jira)
William Butler created PARQUET-2123: --- Summary: Invalid memory access in ScanFileContents Key: PARQUET-2123 URL: https://issues.apache.org/jira/browse/PARQUET-2123 Project: Parquet Issue

[jira] [Commented] (PARQUET-2120) parquet-cli dictionary command fails on pages without dictionary encoding

2022-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492126#comment-17492126 ] ASF GitHub Bot commented on PARQUET-2120: - shangxinli commented on pull request #946: URL:

[GitHub] [parquet-mr] shangxinli commented on pull request #946: PARQUET-2120: Dictionary command should handle missing dictionary pages

2022-02-14 Thread GitBox
shangxinli commented on pull request #946: URL: https://github.com/apache/parquet-mr/pull/946#issuecomment-1039360003 Thanks for working on it! Can you squash the commits? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and

[jira] [Commented] (PARQUET-2121) Remove descriptions for the removed modules

2022-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492117#comment-17492117 ] ASF GitHub Bot commented on PARQUET-2121: - shangxinli commented on pull request #947: URL:

[GitHub] [parquet-mr] shangxinli commented on pull request #947: PARQUET-2121: Remove descriptions for the removed modules

2022-02-14 Thread GitBox
shangxinli commented on pull request #947: URL: https://github.com/apache/parquet-mr/pull/947#issuecomment-1039353753 @sekikn Thanks for working on it! Just leave some minor comments. -- This is an automated message from the Apache Git Service. To respond to the message, please log on

[jira] [Commented] (PARQUET-2121) Remove descriptions for the removed modules

2022-02-14 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2121?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492108#comment-17492108 ] ASF GitHub Bot commented on PARQUET-2121: - shangxinli commented on a change in pull request

[GitHub] [parquet-mr] shangxinli commented on a change in pull request #947: PARQUET-2121: Remove descriptions for the removed modules

2022-02-14 Thread GitBox
shangxinli commented on a change in pull request #947: URL: https://github.com/apache/parquet-mr/pull/947#discussion_r806064746 ## File path: README.md ## @@ -66,10 +66,8 @@ Parquet is a very active project, and new features are being added quickly. Here * Type-specific

[jira] [Comment Edited] (PARQUET-2122) Adding Bloom filter to small Parquet file bloats in size X1700

2022-02-14 Thread Xinli Shang (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492099#comment-17492099 ] Xinli Shang edited comment on PARQUET-2122 at 2/14/22, 4:56 PM:

[jira] [Commented] (PARQUET-2122) Adding Bloom filter to small Parquet file bloats in size X1700

2022-02-14 Thread Xinli Shang (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-2122?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17492099#comment-17492099 ] Xinli Shang commented on PARQUET-2122: -- [~junjie]Do you know why? > Adding Bloom filter to small

[jira] [Created] (PARQUET-2122) Adding Bloom filter to small Parquet file bloats in size X1700

2022-02-14 Thread Z M (Jira)
Z M created PARQUET-2122: Summary: Adding Bloom filter to small Parquet file bloats in size X1700 Key: PARQUET-2122 URL: https://issues.apache.org/jira/browse/PARQUET-2122 Project: Parquet Issue