[ 
https://issues.apache.org/jira/browse/ARROW-17346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17629942#comment-17629942
 ] 

Apache Arrow JIRA Bot commented on ARROW-17346:
-----------------------------------------------

This issue was last updated over 90 days ago, which may be an indication it is 
no longer being actively worked. To better reflect the current state, the issue 
is being unassigned per [project 
policy|https://arrow.apache.org/docs/dev/developers/bug_reports.html#issue-assignment].
 Please feel free to re-take assignment of the issue if it is being actively 
worked, or if you plan to start that work soon.

> [Java][Docs] Document the use of the batchSize argument in Dataset ScanOptions
> ------------------------------------------------------------------------------
>
>                 Key: ARROW-17346
>                 URL: https://issues.apache.org/jira/browse/ARROW-17346
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Java
>    Affects Versions: 9.0.0
>            Reporter: Larry White
>            Assignee: Larry White
>            Priority: Major
>
> Several ScanOptions methods take a batchSize argument as shown: 
> {{public ScanOptions(long batchSize) {}}
> {{    this(batchSize, Optional.empty());}}
> {{}}}
> Since the scanner reads one ArrowRecordBatch per load invocation, setting the 
> parameter to a size larger than the RecordBatch has no effect. It only works 
> when it's smaller than the number of rows in the RecordBatch, (i.e., the 
> number or records read is equal to min(batchSize, recordBatch rowCount), 
> potentially leading to some confusion. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to