[ 
https://issues.apache.org/jira/browse/ARROW-13166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17371456#comment-17371456
 ] 

Sebastiaan Alvarez Rodriguez commented on ARROW-13166:
------------------------------------------------------

Hey, I later on heard that the scanning API is going to change. It might be 
easier to hold off a bit, before we find ourselves in a position were we have 
to reimplement stuff again.

> [Java] Java Dataset API ScanOptions expansion
> ---------------------------------------------
>
>                 Key: ARROW-13166
>                 URL: https://issues.apache.org/jira/browse/ARROW-13166
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Java
>            Reporter: Sebastiaan Alvarez Rodriguez
>            Priority: Major
>   Original Estimate: 3h
>  Remaining Estimate: 3h
>
> Currently, there are very few scanning options which we can set in the Java 
> Dataset API [1].
> Additionally, the options that exist now always must be set from Java, 
> without the possibility to use sensible default values from core Arrow.
> For my use-case, I want to be able to set the `fragment_readahead` option 
> from the Java-side.
>  
> It would be great if:
>  + `ScanOptions.java` would be expanded to allow us to set more, potentially 
> all options related to scanner creation.
>  + Java users can omit options to use the default values, e.g. [2].
> It would be good to know what others think, and whether a PR for this is 
> useful.
> [1][https://github.com/apache/arrow/blob/master/java/dataset/src/main/java/org/apache/arrow/dataset/scanner/ScanOptions.java]
> [2][https://github.com/apache/arrow/blob/ad5dc8207192abe71d3e88303252629041968508/cpp/src/arrow/dataset/scanner.h#L51-L53]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to