[
https://issues.apache.org/jira/browse/PARQUET-843?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15838241#comment-15838241
]
Wes McKinney commented on PARQUET-843:
--------------------------------------
I was able to get an error trace:
{code}
I0125 12:54:51.033644 7786 status.cc:50] File
hdfs://localhost:20500/tmp/parquet-test-1/example.parquet corrupt. RLE level
data bytes = -2011166459
@ 0x7f82276ad2a2 impala::Status::Status()
@ 0x7f82273d933a impala::HdfsParquetScanner::LevelDecoder::Init()
@ 0x7f82273de478
impala::HdfsParquetScanner::BaseScalarColumnReader::ReadDataPage()
@ 0x7f82273de9e8
impala::HdfsParquetScanner::BaseScalarColumnReader::NextPage()
@ 0x7f82273ea6fd
impala::HdfsParquetScanner::BaseScalarColumnReader::NextLevels<>()
@ 0x7f82273e2781 impala::HdfsParquetScanner::ProcessSplit()
@ 0x7f82273a5866 impala::HdfsScanNode::ProcessSplit()
@ 0x7f82273a630b impala::HdfsScanNode::ScannerThread()
@ 0x7f82250e4b87 impala::Thread::SuperviseThread()
@ 0x7f82250e5564 boost::detail::thread_data<>::run()
@ 0x6133fa thread_proxy
@ 0x7f8224de2184 start_thread
@ 0x7f822215237d (unknown)
I0125 12:54:51.038568 7786 status.cc:50] Could not read definition level, even
though metadata states there are 29 values remaining in data page.
file=hdfs://localhost:20500/tmp/parquet-test-1/example.parquet
@ 0x7f82276ad2a2 impala::Status::Status()
@ 0x7f82273e8e19
impala::HdfsParquetScanner::BaseScalarColumnReader::SetLevelError()
@ 0x7f82273ea6b5
impala::HdfsParquetScanner::BaseScalarColumnReader::NextLevels<>()
@ 0x7f82273e2781 impala::HdfsParquetScanner::ProcessSplit()
@ 0x7f82273a5866 impala::HdfsScanNode::ProcessSplit()
@ 0x7f82273a630b impala::HdfsScanNode::ScannerThread()
@ 0x7f82250e4b87 impala::Thread::SuperviseThread()
@ 0x7f82250e5564 boost::detail::thread_data<>::run()
@ 0x6133fa thread_proxy
@ 0x7f8224de2184 start_thread
@ 0x7f822215237d (unknown)
I0125 12:54:51.038578 7786 runtime-state.
{code}
> [C++] Impala unable to read files created by parquet-cpp
> --------------------------------------------------------
>
> Key: PARQUET-843
> URL: https://issues.apache.org/jira/browse/PARQUET-843
> Project: Parquet
> Issue Type: Bug
> Components: parquet-cpp
> Reporter: Wes McKinney
> Priority: Blocker
> Attachments: example.parquet
>
>
> See attached example file. parquet-tools is able to read this. I have only
> tested on Impala 2.5.0, with some effort I could check on newer Impala, but
> it would be good to figure out what is the issue with older versions
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)