Alexey Kudinkin created HUDI-3594:
-------------------------------------

             Summary: Support standard Spark functions in Filter Exprs in Data 
Skipping
                 Key: HUDI-3594
                 URL: https://issues.apache.org/jira/browse/HUDI-3594
             Project: Apache Hudi
          Issue Type: Bug
            Reporter: Alexey Kudinkin
            Assignee: Alexey Kudinkin


As part of this effort we're planning to (at the very least) support a suite of 
standard Spark functions when evaluating Data Filtering expressions w/in Data 
Skipping flow, for ex: when user is issuing a following query 

 
{code:java}
SELECT ... WHERE date_format(ts, 'dd-mm-yyyy') > '01-01-2022'
{code}
We're able to relate such query to our Column Stats Index appropriately, 
therefore being able to do Data Skipping not only on the "raw" columns, but 
also upon simple derivative expressions on top of them (like standard function 
calls){*}{*}



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to