Yes, this is zero-copy and is probably the thing you are looking for. In future, it would be nice to migrate this codebase to the C-interface but for now, this should do the job.
On Mon, Jun 8, 2020, at 5:24 AM, Micah Kornfield wrote: > Uwe wrote a blog post [1] on how to do this with PY4J a while ago. I think > this ends up being zero copy but not 100% sure. > > [1] > https://uwekorn.com/2019/11/17/fast-jdbc-access-in-python-using-pyarrow-jvm.html > > On Fri, Jun 5, 2020 at 9:01 PM Chris Zheng <[email protected]> wrote: >> >> Hi there, >> >> I’m coming here from clojure, where we are hoping to use more of Arrow for >> https://github.com/clj-python/libpython-clj, a library that gives very easy >> python interop from the JVM. >> >> Here is the link to the initial question: >> https://clojurians.zulipchat.com/#narrow/stream/151924-data-science/topic/dplyr-like.20API.20to.20tech.2Eml.2Edataset/near/199914955 >> >> The use case would be to load from a database on the Java side and then >> convert it to arrow, allowing it to be consumed by a python ml library (with >> the whole zero copy thing). >> >> Once there is a Java Arrow Object created in the JVM, how would it be >> accessible by the Python Library? >> >> I was hoping to get some input or a simple example on how it might be done >> >> Chris. >> >>
