[
https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yujiang Zhong updated PARQUET-2160:
---
Description:
The decompressed stream in HeapBytesDecompressor$decompress now relies on the
[
https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yujiang Zhong updated PARQUET-2160:
---
Description:
The decompressed stream in HeapBytesDecompressor$decompress now relies on the
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555142#comment-17555142
]
Timothy Miller commented on PARQUET-2159:
-
If this is already being generated at runtime, then
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555116#comment-17555116
]
Fang-Xie edited comment on PARQUET-2159 at 6/16/22 3:14 PM:
We implemented
huaxingao commented on PR #975:
URL: https://github.com/apache/parquet-mr/pull/975#issuecomment-1157762209
> it should be good enough to also check the lower limit, eg exist >
totalCount * (testFpp[i] * 0.9) , or exist > totalCount * (testFpp[i] * 0.5) ,
or even exist > 0. What do you
[
https://issues.apache.org/jira/browse/PARQUET-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555118#comment-17555118
]
ASF GitHub Bot commented on PARQUET-2157:
-
huaxingao commented on PR #975:
URL:
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555116#comment-17555116
]
Fang-Xie commented on PARQUET-2159:
---
We implemented Parquet bit packing en/decode using JDK Vector
[
https://issues.apache.org/jira/browse/PARQUET-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555115#comment-17555115
]
ASF GitHub Bot commented on PARQUET-2157:
-
huaxingao commented on code in PR #975:
URL:
huaxingao commented on code in PR #975:
URL: https://github.com/apache/parquet-mr/pull/975#discussion_r899177750
##
parquet-hadoop/src/test/java/org/apache/parquet/hadoop/TestParquetWriter.java:
##
@@ -282,6 +286,63 @@ public void testParquetFileWithBloomFilter() throws
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555099#comment-17555099
]
Timothy Miller commented on PARQUET-2159:
-
I frequently wish Java had a preprocessor like C++
[
https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Yujiang Zhong updated PARQUET-2160:
---
Description:
The decompressed stream in HeapBytesDecompressor$decompress now relies on the
[
https://issues.apache.org/jira/browse/PARQUET-2160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555082#comment-17555082
]
Yujiang Zhong commented on PARQUET-2160:
[~shangxinli] [~dongjoon] Can you please take a look
Yujiang Zhong created PARQUET-2160:
--
Summary: Close decompression stream to free off-heap memory in time
Key: PARQUET-2160
URL: https://issues.apache.org/jira/browse/PARQUET-2160
Project: Parquet
[
https://issues.apache.org/jira/browse/PARQUET-2159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555081#comment-17555081
]
Fang-Xie commented on PARQUET-2159:
---
Thanks [~theosib-amazon], these improvements depend on Vector
[
https://issues.apache.org/jira/browse/PARQUET-2051?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andreas Hailu updated PARQUET-2051:
---
Fix Version/s: 1.12.3
> AvroWriteSupport does not pass Configuration to
[
https://issues.apache.org/jira/browse/PARQUET-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555041#comment-17555041
]
ASF GitHub Bot commented on PARQUET-2157:
-
ggershinsky commented on PR #975:
URL:
ggershinsky commented on PR #975:
URL: https://github.com/apache/parquet-mr/pull/975#issuecomment-1157577513
> The test takes about 2300 milli seconds on my laptop.
Ok, this is reasonable. If this time is sufficient for reliably testing the
upper limit of FPPs, it should be good
[
https://issues.apache.org/jira/browse/PARQUET-2157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17554932#comment-17554932
]
ASF GitHub Bot commented on PARQUET-2157:
-
chenjunjiedada commented on code in PR #975:
URL:
chenjunjiedada commented on code in PR #975:
URL: https://github.com/apache/parquet-mr/pull/975#discussion_r898756998
##
parquet-hadoop/src/test/java/org/apache/parquet/hadoop/TestParquetWriter.java:
##
@@ -282,6 +286,63 @@ public void testParquetFileWithBloomFilter() throws
19 matches
Mail list logo