[jira] [Resolved] (PARQUET-1135) upgrade thrift and protobuf dependencies

2018-03-09 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Le Dem resolved PARQUET-1135. Resolution: Fixed merged in:

[jira] [Updated] (PARQUET-1135) upgrade thrift and protobuf dependencies

2018-03-09 Thread Julien Le Dem (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Le Dem updated PARQUET-1135: --- Fix Version/s: 1.9.1 Description: thrift 0.7.0 -> 0.9.3 protobuf 3.2 -> 3.5.1

[jira] [Commented] (PARQUET-1135) upgrade thrift and protobuf dependencies

2018-03-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16393851#comment-16393851 ] ASF GitHub Bot commented on PARQUET-1135: - julienledem closed pull request #427: PARQUET-1135:

[jira] [Commented] (PARQUET-1135) upgrade thrift and protobuf dependencies

2018-03-09 Thread ASF GitHub Bot (JIRA)
[ https://issues.apache.org/jira/browse/PARQUET-1135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16393531#comment-16393531 ] ASF GitHub Bot commented on PARQUET-1135: - julienledem commented on issue #427: PARQUET-1135:

Re: parquet performance

2018-03-09 Thread Wes McKinney
hello, It sounds like you are talking about the C++ implementation in https://github.com/apache/parquet-cpp/blob/master/src/parquet/arrow/reader.cc, is that right? Which data types are you benchmarking? My understanding is that we are not appending 1 cell at a time. Let us know. Thanks Wes On

parquet performance

2018-03-09 Thread mildwolf_jh
Hi, I am testing parquet->arrow performance and find it's really slow to read parquet file into arrow table. When I check the parquet source code, it seems parquet need to check the null value and use arrow Append method to insert the cell one by one. Although we can use multithread to speed up

[jira] [Created] (PARQUET-1242) parquet.thrift refers to wrong releases for the new compressions

2018-03-09 Thread Zoltan Ivanfi (JIRA)
Zoltan Ivanfi created PARQUET-1242: -- Summary: parquet.thrift refers to wrong releases for the new compressions Key: PARQUET-1242 URL: https://issues.apache.org/jira/browse/PARQUET-1242 Project: