Jorge created ARROW-9836:
----------------------------

             Summary: [Rust] [DataFusion] Improve API for usage of UDFs
                 Key: ARROW-9836
                 URL: https://issues.apache.org/jira/browse/ARROW-9836
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Rust, Rust - DataFusion
            Reporter: Jorge


TL;DR; currently, users call UDFs through
 
{color:#000000}df.select(scalar_functions(“sqrt”, vec![col(“a”)], 
DataType::Float64)){color}
 
Proposal:
 
{color:#000000}let udf = df.registry()?;{color}

{color:#000000}df.select(udf(“sqrt”, vec![col(“a”)])?){color}
 
so that they do not have to remember the UDFs return type when using it.
 
This API will in the future allow to declare the UDF as part of the planning, 
like spark, instead of having to register it in the registry before using it 
(we just need to check if the UDF is registered or not before doing so).
See complete proposal here: 
[https://docs.google.com/document/d/1Kzz642ScizeKXmVE1bBlbLvR663BKQaGqVIyy9cAscY/edit?usp=sharing]

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to