[ 
https://issues.apache.org/jira/browse/PARQUET-2109?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated PARQUET-2109:
------------------------------------
    Labels: pull-request-available  (was: )

> Parquet Cpp Reader Can Loop Forever If Page Values Overstated
> -------------------------------------------------------------
>
>                 Key: PARQUET-2109
>                 URL: https://issues.apache.org/jira/browse/PARQUET-2109
>             Project: Parquet
>          Issue Type: Bug
>          Components: parquet-cpp
>            Reporter: William Butler
>            Assignee: William Butler
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> If the page header states that there are more values than are actually 
> present in the page, the Parquet CPP can loop forever.  This is because 
> HasNext() will return true but the actual ReadBatch() will have nothing to 
> read and will not change reader state, causing an infinite loop. We first 
> noticed the bug via ScanFileContents(), but this impacts any code that does 
> not check to see if ReadBatch() consumed anything.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to