worker24h created ARROW-5282:
--------------------------------
Summary: Can't read data from parquet file in C++ library
Key: ARROW-5282
URL: https://issues.apache.org/jira/browse/ARROW-5282
Project: Apache Arrow
Issue Type: Bug
Reporter: worker24h
Specified the second param *parquet::ReaderProperties* When I used
parquet::ParquetFileReader::Open, it can't work.
The following code:
{code:java}
parquet::ReaderProperties _properties;
_properties = parquet::ReaderProperties();
_properties.enable_buffered_stream(); // used buffer stream. Don't set
buffer-size
parquet_reader = parquet::ParquetFileReader::Open(_parquet, _properties);
...
int32_t value;
parquet::Int32Reader* int32_reader =
static_cast<parquet::Int32Reader*>(column_reader.get());
int32_reader->Skip(_current_line_of_group);// skip lines of processed.
rows_read = int32_reader->ReadBatch(1, nullptr, nullptr, &value, &values_read);
{code}
The interface *Skip* throw exception:
{color:#FF0000}{{Couldn't deserialize thrift: TProtocolException: Invalid data
Deserializing page header failed.}}{color}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)