[ https://issues.apache.org/jira/browse/IMPALA-7304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Zoltán Borók-Nagy updated IMPALA-7304: -------------------------------------- Description: Impala master branch can already write the [Parquet page index|https://github.com/apache/parquet-format/blob/master/PageIndex.md]. However, we still don't have well-defined ordering for floating-point numbers in Parquet: [PARQUET-1222|https://issues-test.apache.org/jira/browse/PARQUET-1222] Currently impala writes the page index with fmax()/fmin() semantics, but it might contradicts the future specification that will be defined once PARQUET-1222 is resolved. We should not write column indexes for floating-point columns until PARQUET-1222 is resolved. was: Impala master branch can already writes the [Parquet page index|https://github.com/apache/parquet-format/blob/master/PageIndex.md]. However, we still don't have well-defined ordering for floating-point numbers in Parquet: [PARQUET-1222|https://issues-test.apache.org/jira/browse/PARQUET-1222] Currently impala writes the page index with fmax()/fmin() semantics, but it might contradicts the future specification that will be defined once PARQUET-1222 is resolved. We should not write column indexes for floating-point columns until PARQUET-1222 is resolved. > Impala shouldn't write column indexes for float columns until PARQUET-1222 is > resolved > -------------------------------------------------------------------------------------- > > Key: IMPALA-7304 > URL: https://issues.apache.org/jira/browse/IMPALA-7304 > Project: IMPALA > Issue Type: Bug > Affects Versions: Impala 2.13.0, Impala 3.1.0 > Reporter: Zoltán Borók-Nagy > Assignee: Zoltán Borók-Nagy > Priority: Blocker > > Impala master branch can already write the [Parquet page > index|https://github.com/apache/parquet-format/blob/master/PageIndex.md]. > > However, we still don't have well-defined ordering for floating-point > numbers in Parquet: > [PARQUET-1222|https://issues-test.apache.org/jira/browse/PARQUET-1222] > > Currently impala writes the page index with fmax()/fmin() semantics, but it > might contradicts the future specification that will be defined once > PARQUET-1222 is resolved. > > We should not write column indexes for floating-point columns until > PARQUET-1222 is resolved. -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org For additional commands, e-mail: issues-all-h...@impala.apache.org