[GitHub] [arrow] benibus commented on pull request #33732: GH-33209: [C++] Support for reading JSON Datasets

via GitHub Tue, 07 Feb 2023 15:25:16 -0800


benibus commented on PR #33732:
URL: https://github.com/apache/arrow/pull/33732#issuecomment-1421624442


   I attempted to implement the newer `ScanNode` methods. However, I ran into 
some issues that have prevented feature parity with the legacy API. Primarily, 
the methods that use `ScanOptions` are able to prevent loading top-level fields 
if none of its children are expected to be materialized. However, I haven't 
figured out a good way to do this using `FragmentScanRequest` since it only 
provides `FieldPath`s (relative to the inspected fragment).
   
   I suppose there's a possibility that the tests in `FileFormatScanNodeMixin` 
are expecting too much of the format implementation, though. From what I can 
tell, CSV is the only other format that implements the new API and even then it 
doesn't support nested fields - so the relevant tests in that fixture haven't 
been used anywhere before.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow] benibus commented on pull request #33732: GH-33209: [C++] Support for reading JSON Datasets

Reply via email to