[ https://issues.apache.org/jira/browse/ARROW-7272 ]
Hongze Zhang deleted comment on ARROW-7272:
-------------------------------------
was (Author: zhztheplayer):
-Hi guys, would you suggest to just use the existing
*org.apache.arrow.vector.ipc.ArrowReader*? We already have a similar approach
in orc adaptor and it works fine. As schemas in Datasets API are always
predefined I think we don't have to convert the schema everytime.-
> [C++][Java][Dataset] JNI bridge between RecordBatch and VectorSchemaRoot
> ------------------------------------------------------------------------
>
> Key: ARROW-7272
> URL: https://issues.apache.org/jira/browse/ARROW-7272
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++, Java
> Reporter: Francois Saint-Jacques
> Assignee: Hongze Zhang
> Priority: Major
> Labels: pull-request-available
> Fix For: 9.0.0
>
> Time Spent: 7h 10m
> Remaining Estimate: 0h
>
> Given a C++ std::shared_ptr<RecordBatch>, retrieve it in java as a
> VectorSchemaRoot class. Gandiva already offer a similar facility but with raw
> buffers. It would be convenient if users could call C++ that yields
> RecordBatch and retrieve it in a seamless fashion.
> This would remove one roadblock of using C++ dataset facility in Java.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)