[
https://issues.apache.org/jira/browse/ARROW-11782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17291419#comment-17291419
]
Weston Pace commented on ARROW-11782:
-------------------------------------
Thanks for pointing this out. I think we can't get rid of ScanTask. I will
create a ScanTaskInternal (or some similar name) for the work I'm doing and
leave ScanTask's interface as is.
In the future though it might be better to change Scanner::Scan to return a
RecordBatchIterator instead of ScanTaskIterator. Otherwise we're putting the
burden on the user to manage how many files are loaded concurrently. I think
that should either be hidden completely or controlled by some option
(MaxConcurrentFiles or, better yet, MaxAllocatedBytes)
> [GLib][Dataset] Remove bindings for internal classes
> ----------------------------------------------------
>
> Key: ARROW-11782
> URL: https://issues.apache.org/jira/browse/ARROW-11782
> Project: Apache Arrow
> Issue Type: Improvement
> Components: GLib
> Affects Versions: 3.0.0
> Reporter: Ben Kietzman
> Priority: Major
> Fix For: 4.0.0
>
>
> GLib and ruby include bindings for internal classes such as ScanOptions,
> ScanContext, InMemoryScanTask, ScanTask, ... These are probably unnecessary
> and should be removed to present a simpler interface less prone to breakage
> under refactoring of the wrapped classes
> https://github.com/apache/arrow/pull/9532/checks?check_run_id=1974229719#step:8:2071
--
This message was sent by Atlassian Jira
(v8.3.4#803005)