[ https://issues.apache.org/jira/browse/PARQUET-686?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Andrew Duffy updated PARQUET-686: --------------------------------- Summary: Allow for Unsigned Statistics in Binary Type (was: Add lexicographic ordering for FromStringBinary) > Allow for Unsigned Statistics in Binary Type > -------------------------------------------- > > Key: PARQUET-686 > URL: https://issues.apache.org/jira/browse/PARQUET-686 > Project: Parquet > Issue Type: Bug > Reporter: Andrew Duffy > > FromStringBinary is the Binary-type for UTF-8 strings, but uses the old > comparison method of byte-by-byte instead of being UTF-8 aware like Avro and > core Spark. This should be fixed so that the BinaryStatistics generated for > String columns that contain non-ASCII text will actually be useful. -- This message was sent by Atlassian JIRA (v6.3.4#6332)