[
https://issues.apache.org/jira/browse/ARROW-15765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17497011#comment-17497011
]
Weston Pace commented on ARROW-15765:
-------------------------------------
This is indeed about user-defined functions. Vibhatha has been working on an
implementation. You can see the current progress here:
https://github.com/apache/arrow/compare/master...vibhatha:test-udf-vibhatha
I suspect the need has to do with registering a function like:
{code}
def function(array1: pa.Int64Array, arrya2: pa.Int64Array) -> pa.Int64Array:
return pc.call_function("add", [array1, array2])
{code}
with our function registry (which will want to know the arity and types of each
argument). Vibhatha can probably give a more complete answer.
> [Python] Extracting Type information from Python Objects
> --------------------------------------------------------
>
> Key: ARROW-15765
> URL: https://issues.apache.org/jira/browse/ARROW-15765
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Python
> Reporter: Vibhatha Lakmal Abeykoon
> Assignee: Vibhatha Lakmal Abeykoon
> Priority: Major
>
> When creating user defined functions or similar exercises where we want to
> extract the Arrow data types from the type hints, the existing Python API
> have some limitations.
> An example case is as follows;
> {code:java}
> def function(array1: pa.Int64Array, arrya2: pa.Int64Array) -> pa.Int64Array:
> return pc.call_function("add", [array1, array2])
> {code}
> We want to extract the fact that array1 is an `pa.Array` of `pa.Int32Type`.
> At the moment there doesn't exist a straightforward manner to get this done.
> So the idea is to expose this feature to Python.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)