Viktor Szathmary created PARQUET-98:
---------------------------------------
Summary: filter2 API performance regression
Key: PARQUET-98
URL: https://issues.apache.org/jira/browse/PARQUET-98
Project: Parquet
Issue Type: Bug
Reporter: Viktor Szathmary
The new filter API seems to be much slower (or perhaps I'm using it wrong :)
Code using an UnboundRecordFilter:
{code:java}
ColumnRecordFilter.column(column,
ColumnPredicates.applyFunctionToBinary(
input -> Binary.fromString(value).equals(input)));
{code}
vs. code using FilterPredicate:
{code:java}
eq(binaryColumn(column), Binary.fromString(value));
{code}
The latter performs twice as slow on the same Parquet file (built using
1.6.0rc2).
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)