[ 
https://issues.apache.org/jira/browse/PARQUET-1571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16835446#comment-16835446
 ] 

worker24h commented on PARQUET-1571:
------------------------------------

parquet::ReaderProperties _properties;
 _properties = parquet::ReaderProperties(); 
 _properties.enable_buffered_stream(); // used buffer stream. Don't set 
buffer-sizeparquet_reader = parquet::ParquetFileReader::Open(_parquet, 
_properties);

Don't invoke interface {color:#FF0000}_properties.set_buffer_size(){color} if I 
forgot ,then library throw exception. How to solve this problem??  You can't 
guarantee how the user will use it.

So I recommend providing the default, for example :size=1024 (one page size)

> [C++] Can't read data from parquet file in C++ library
> ------------------------------------------------------
>
>                 Key: PARQUET-1571
>                 URL: https://issues.apache.org/jira/browse/PARQUET-1571
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-cpp
>            Reporter: worker24h
>            Priority: Critical
>              Labels: pull-request-available
>          Time Spent: 40m
>  Remaining Estimate: 0h
>
> Specified the second param *parquet::ReaderProperties* When I used 
> parquet::ParquetFileReader::Open, it can't work.
>  The following code:
> {code:java}
> parquet::ReaderProperties _properties;
> _properties = parquet::ReaderProperties(); 
> _properties.enable_buffered_stream();  // used  buffer stream.  Don't set 
> buffer-size
> parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties);
> ...
> int32_t value;
> parquet::Int32Reader* int32_reader =
> static_cast<parquet::Int32Reader*>(column_reader.get());
> int32_reader->Skip(_current_line_of_group);// skip lines of processed.
> rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, 
> &values_read);  
> {code}
> The interface *Skip* throw exception:
> {color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid 
> data Deserializing page header failed.}}{color}
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to