[ https://issues.apache.org/jira/browse/ARROW-15765?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17497838#comment-17497838 ]
Vibhatha Lakmal Abeykoon commented on ARROW-15765: -------------------------------------------------- I want to clarify a point, if I have not clearly mentioned the reason for the necessity of the typing information earlier in the thread. If I am not mistaken, here the main issue is not what UDF internally is doing for the data. We just need to register it in the function registry without taking the input and output types from the user explicitly. It is just a nice to have a feature which could look great in terms of presentability and usability with new Python upgrades. > [Python] Extracting Type information from Python Objects > -------------------------------------------------------- > > Key: ARROW-15765 > URL: https://issues.apache.org/jira/browse/ARROW-15765 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Python > Reporter: Vibhatha Lakmal Abeykoon > Assignee: Vibhatha Lakmal Abeykoon > Priority: Major > > When creating user defined functions or similar exercises where we want to > extract the Arrow data types from the type hints, the existing Python API > have some limitations. > An example case is as follows; > {code:java} > def function(array1: pa.Int64Array, arrya2: pa.Int64Array) -> pa.Int64Array: > return pc.call_function("add", [array1, array2]) > {code} > We want to extract the fact that array1 is an `pa.Array` of `pa.Int32Type`. > At the moment there doesn't exist a straightforward manner to get this done. > So the idea is to expose this feature to Python. -- This message was sent by Atlassian Jira (v8.20.1#820001)