[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-26 Thread Xinli Shang (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17221036#comment-17221036 ] Xinli Shang commented on PARQUET-1927: -- ParquetFileReader.getFilteredRecordCount() cannot be used

[jira] [Commented] (PARQUET-1396) Example of using EncryptionPropertiesFactory and DecryptionPropertiesFactory

2020-10-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220599#comment-17220599 ] ASF GitHub Bot commented on PARQUET-1396: - ggershinsky commented on a change in pull request

[GitHub] [parquet-mr] ggershinsky commented on a change in pull request #808: PARQUET-1396: Example of using EncryptionPropertiesFactory and DecryptionPropertiesFactory

2020-10-26 Thread GitBox
ggershinsky commented on a change in pull request #808: URL: https://github.com/apache/parquet-mr/pull/808#discussion_r511840433 ## File path: parquet-hadoop/src/main/java/org/apache/parquet/hadoop/ParquetWriter.java ## @@ -279,6 +279,11 @@ public ParquetWriter(Path file,

[jira] [Commented] (PARQUET-1396) Example of using EncryptionPropertiesFactory and DecryptionPropertiesFactory

2020-10-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220597#comment-17220597 ] ASF GitHub Bot commented on PARQUET-1396: - ggershinsky commented on a change in pull request

[GitHub] [parquet-mr] ggershinsky commented on a change in pull request #808: PARQUET-1396: Example of using EncryptionPropertiesFactory and DecryptionPropertiesFactory

2020-10-26 Thread GitBox
ggershinsky commented on a change in pull request #808: URL: https://github.com/apache/parquet-mr/pull/808#discussion_r511838838 ## File path: parquet-common/src/main/java/org/apache/parquet/io/OutputFile.java ## @@ -31,4 +33,5 @@ long defaultBlockSize(); + Path

[jira] [Commented] (PARQUET-1915) Add null command

2020-10-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220595#comment-17220595 ] ASF GitHub Bot commented on PARQUET-1915: - gszadovszky commented on a change in pull request

[GitHub] [parquet-mr] gszadovszky commented on a change in pull request #819: PARQUET-1915: Add nullify column

2020-10-26 Thread GitBox
gszadovszky commented on a change in pull request #819: URL: https://github.com/apache/parquet-mr/pull/819#discussion_r511835929 ## File path: parquet-hadoop/src/main/java/org/apache/parquet/hadoop/util/ColumnMasker.java ## @@ -0,0 +1,274 @@ +/* + * Licensed to the Apache

[jira] [Commented] (PARQUET-1927) ColumnIndex should provide number of records skipped

2020-10-26 Thread Gabor Szadovszky (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220590#comment-17220590 ] Gabor Szadovszky commented on PARQUET-1927: --- [~sha...@uber.com], sorry for keep bothering

[jira] [Commented] (PARQUET-1396) Example of using EncryptionPropertiesFactory and DecryptionPropertiesFactory

2020-10-26 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/PARQUET-1396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17220578#comment-17220578 ] ASF GitHub Bot commented on PARQUET-1396: - gszadovszky commented on a change in pull request

[GitHub] [parquet-mr] gszadovszky commented on a change in pull request #808: PARQUET-1396: Example of using EncryptionPropertiesFactory and DecryptionPropertiesFactory

2020-10-26 Thread GitBox
gszadovszky commented on a change in pull request #808: URL: https://github.com/apache/parquet-mr/pull/808#discussion_r511805317 ## File path: parquet-common/src/main/java/org/apache/parquet/io/OutputFile.java ## @@ -31,4 +33,5 @@ long defaultBlockSize(); + Path

Re: Current status of Data Page V2?

2020-10-26 Thread Gabor Szadovszky
Hi Micah, V2 pages are not only about the new encodings but a couple of other things the community thought would be better than V1. One of these improvements was to break pages at row boundaries. This one is outdated because during the development of column-indexes we had to implement the same

[GitHub] [parquet-mr] mauliksoneji closed pull request #837: Metadata fields

2020-10-26 Thread GitBox
mauliksoneji closed pull request #837: URL: https://github.com/apache/parquet-mr/pull/837 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to

[GitHub] [parquet-mr] mauliksoneji opened a new pull request #837: Metadata fields

2020-10-26 Thread GitBox
mauliksoneji opened a new pull request #837: URL: https://github.com/apache/parquet-mr/pull/837 This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL