Sebastiaan Alvarez Rodriguez created ARROW-13166:
----------------------------------------------------

             Summary: Java Dataset API ScanOptions expansion
                 Key: ARROW-13166
                 URL: https://issues.apache.org/jira/browse/ARROW-13166
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Java
            Reporter: Sebastiaan Alvarez Rodriguez


Currently, there are very few scanning options which we can set in the Java 
Dataset API [1].

Additionally, the options that exist now always must be set from Java, without 
the possibility to use sensible default values from core Arrow.

For my use-case, I want to be able to set the `fragment_readahead` option from 
the Java-side.

 

It would be great if:
 + `ScanOptions.java` would be expanded to allow us to set more, potentially 
all options related to scanner creation.
 + Java users can omit options to use the default values, e.g. [2].

It would be good to know what others think, and whether a PR for this is useful.


[1][https://github.com/apache/arrow/blob/master/java/dataset/src/main/java/org/apache/arrow/dataset/scanner/ScanOptions.java]
[2][https://github.com/apache/arrow/blob/ad5dc8207192abe71d3e88303252629041968508/cpp/src/arrow/dataset/scanner.h#L51-L53]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to