[
https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Le Dem resolved PARQUET-1135.
Resolution: Fixed
merged in:
[
https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Julien Le Dem updated PARQUET-1135:
---
Fix Version/s: 1.9.1
Description:
thrift 0.7.0 -> 0.9.3
protobuf 3.2 -> 3.5.1
[
https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16393851#comment-16393851
]
ASF GitHub Bot commented on PARQUET-1135:
-
julienledem closed pull request #427: PARQUET-1135:
[
https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16393531#comment-16393531
]
ASF GitHub Bot commented on PARQUET-1135:
-
julienledem commented on issue #427: PARQUET-1135:
hello,
It sounds like you are talking about the C++ implementation in
https://github.com/apache/parquet-cpp/blob/master/src/parquet/arrow/reader.cc,
is that right?
Which data types are you benchmarking? My understanding is that we are
not appending 1 cell at a time. Let us know.
Thanks
Wes
On
Hi, I am testing parquet->arrow performance and find it's really slow to read
parquet file into arrow table. When I check the parquet source code, it seems
parquet need to check the null value and use arrow Append method to insert the
cell one by one. Although we can use multithread to speed up
Zoltan Ivanfi created PARQUET-1242:
--
Summary: parquet.thrift refers to wrong releases for the new
compressions
Key: PARQUET-1242
URL: https://issues.apache.org/jira/browse/PARQUET-1242
Project: