Yicong Huang created SPARK-54598:
------------------------------------

             Summary: Refactor UDF fetching logic out from invocation
                 Key: SPARK-54598
                 URL: https://issues.apache.org/jira/browse/SPARK-54598
             Project: Spark
          Issue Type: Task
          Components: PySpark
    Affects Versions: 4.2.0
            Reporter: Yicong Huang


Currently we always fetch UDFs (function and its arguments) 



Single UDF:

```
arg_offsets, f = read_single_udf(
pickleSer, infile, eval_type, runner_conf, udf_index=0, profiler=profiler
)
parsed_offsets=extract_key_value_indexes(arg_offsets)
```
 
Multi UDFs:
 
```
        udfs = []
        for i in range(num_udfs):
            udfs.append(
                read_single_udf(
                    pickleSer, infile, eval_type, runner_conf, udf_index=i, 
profiler=profiler
                )
            )
```
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to