[
https://issues.apache.org/jira/browse/ARROW-13166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17371489#comment-17371489
]
Weston Pace commented on ARROW-13166:
-------------------------------------
Can you expand a bit more on what changes you think are coming up? I think if
you use the ScannerBuilder to set the options then I wouldn't expect any
changes to the API coming up soon. There is
https://issues.apache.org/jira/browse/ARROW-12311 but it is more about
simplifying the user API (so there aren't two ways, ScanOptions and
ScannerBuilder, to do the same thing) than it is about removing or changing
functionality.
> [Java] Java Dataset API ScanOptions expansion
> ---------------------------------------------
>
> Key: ARROW-13166
> URL: https://issues.apache.org/jira/browse/ARROW-13166
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Java
> Reporter: Sebastiaan Alvarez Rodriguez
> Priority: Major
> Original Estimate: 3h
> Remaining Estimate: 3h
>
> Currently, there are very few scanning options which we can set in the Java
> Dataset API [1].
> Additionally, the options that exist now always must be set from Java,
> without the possibility to use sensible default values from core Arrow.
> For my use-case, I want to be able to set the `fragment_readahead` option
> from the Java-side.
>
> It would be great if:
> + `ScanOptions.java` would be expanded to allow us to set more, potentially
> all options related to scanner creation.
> + Java users can omit options to use the default values, e.g. [2].
> It would be good to know what others think, and whether a PR for this is
> useful.
> [1][https://github.com/apache/arrow/blob/master/java/dataset/src/main/java/org/apache/arrow/dataset/scanner/ScanOptions.java]
> [2][https://github.com/apache/arrow/blob/ad5dc8207192abe71d3e88303252629041968508/cpp/src/arrow/dataset/scanner.h#L51-L53]
--
This message was sent by Atlassian Jira
(v8.3.4#803005)