[ 
https://issues.apache.org/jira/browse/ARROW-14319?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17429651#comment-17429651
 ] 

Roee Shlomo commented on ARROW-14319:
-------------------------------------

I suspect that a better approach would be to create a new module and keep 
pyarrow.jvm as is:
 # Backward compatibility seems like a challenge. There must be a reference 
provided to org.apache.arrow.c so ArrowSchema, ArrowArray and the various 
import/export functions would be available on the python side. In addition, all 
C data interface methods require an allocator as a parameter. These are not 
provided in the current pyarrow.jvm API. 
 # The current pyarrow.jvm module works with a pure java build of Arrow Java, 
while the C data interface requires building a small JNI library. Unless you 
rely on end users to build the Java jar on their own, packaging the JNI lib 
will be required for all platforms targeted by pyarrow.

> [Python] Rewrite pyarrow.jvm using the C data interface
> -------------------------------------------------------
>
>                 Key: ARROW-14319
>                 URL: https://issues.apache.org/jira/browse/ARROW-14319
>             Project: Apache Arrow
>          Issue Type: Task
>          Components: Java, Python
>            Reporter: Antoine Pitrou
>            Priority: Major
>             Fix For: 7.0.0
>
>
> The {{pyarrow.jvm}} is currently a custom-written bridge between PyArrow and 
> Arrow Java, with limited datatype support. Now that Java implements the C 
> data interface (see ARROW-12965), we should be able to simplify the code 
> while making it more general.
> Also, we should reenable the {{conda-python-jpype}} build somewhere, for 
> example in the Crossbow nightly builds.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to