[jira] [Commented] (ARROW-15637) [C++][Python] UDF Optimizations

Vibhatha Lakmal Abeykoon (Jira) Wed, 09 Feb 2022 17:14:04 -0800


    [ 
https://issues.apache.org/jira/browse/ARROW-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17489896#comment-17489896
 ]


Vibhatha Lakmal Abeykoon commented on ARROW-15637:
--------------------------------------------------

In this context we can also analyze further about data conversions that may be 
happening within the UDFs for data structures not supported by Arrow. Most of 
the data science or data engineering applications in the Python space use 
Pandas or Numpy based data structures, so it won't be a serious problems, but 
it is nice to keep an eye on possible situations where there are exceptions to 
these cases. 

> [C++][Python] UDF Optimizations
> -------------------------------
>
>                 Key: ARROW-15637
>                 URL: https://issues.apache.org/jira/browse/ARROW-15637
>             Project: Apache Arrow
>          Issue Type: Sub-task
>            Reporter: Vibhatha Lakmal Abeykoon
>            Priority: Minor
>
> Need an interface to evaluate the memory footprint, execution time and health 
> of the UDFs and return a meaningful status ex: 
> `Status::HighMemoryUsageException()`, `Status::TimeLimitException()`
> Note: This is also aligned with resource monitoring in the parallel execution 
> space. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

[jira] [Commented] (ARROW-15637) [C++][Python] UDF Optimizations

Reply via email to