[ 
https://issues.apache.org/jira/browse/ARROW-15765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17497838#comment-17497838
 ] 

Vibhatha Lakmal Abeykoon commented on ARROW-15765:
--------------------------------------------------

I want to clarify a point, if I have not clearly mentioned the reason for the 
necessity of the typing information earlier in the thread. If I am not 
mistaken, here the main issue is not what UDF internally is doing for the data. 
We just need to register it in the function registry without taking the input 
and output types from the user explicitly. It is just a nice to have a feature 
which could look great in terms of presentability and usability with new Python 
upgrades. 

> [Python] Extracting Type information from Python Objects
> --------------------------------------------------------
>
>                 Key: ARROW-15765
>                 URL: https://issues.apache.org/jira/browse/ARROW-15765
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: C++, Python
>            Reporter: Vibhatha Lakmal Abeykoon
>            Assignee: Vibhatha Lakmal Abeykoon
>            Priority: Major
>
> When creating user defined functions or similar exercises where we want to 
> extract the Arrow data types from the type hints, the existing Python API 
> have some limitations. 
> An example case is as follows;
> {code:java}
> def function(array1: pa.Int64Array, arrya2: pa.Int64Array) -> pa.Int64Array:
>     return pc.call_function("add", [array1, array2])
>   {code}
> We want to extract the fact that array1 is an `pa.Array` of `pa.Int32Type`. 
> At the moment there doesn't exist a straightforward manner to get this done. 
> So the idea is to expose this feature to Python. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to