[
https://issues.apache.org/jira/browse/ARROW-15637?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17489896#comment-17489896
]
Vibhatha Lakmal Abeykoon commented on ARROW-15637:
--------------------------------------------------
In this context we can also analyze further about data conversions that may be
happening within the UDFs for data structures not supported by Arrow. Most of
the data science or data engineering applications in the Python space use
Pandas or Numpy based data structures, so it won't be a serious problems, but
it is nice to keep an eye on possible situations where there are exceptions to
these cases.
> [C++][Python] UDF Optimizations
> -------------------------------
>
> Key: ARROW-15637
> URL: https://issues.apache.org/jira/browse/ARROW-15637
> Project: Apache Arrow
> Issue Type: Sub-task
> Reporter: Vibhatha Lakmal Abeykoon
> Priority: Minor
>
> Need an interface to evaluate the memory footprint, execution time and health
> of the UDFs and return a meaningful status ex:
> `Status::HighMemoryUsageException()`, `Status::TimeLimitException()`
> Note: This is also aligned with resource monitoring in the parallel execution
> space.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)