drin commented on pull request #9810:
URL: https://github.com/apache/arrow/pull/9810#issuecomment-810607005


   @lidavidm I'm curious on your thoughts about other things that 
`ScannerBuilder`s can be constructed from and whether this is something we 
would want to add to these commits, or if we should have a separate commit?
   
   As an example, one of the things that was difficult to figure out for our 
research group from existing documentation was how to apply a scan to something 
already in memory:
   
   ```cpp
       using InMemDatasetPtr = std::shared_ptr<arrow::dataset::InMemoryDataset>;
       using ScanBuilderPtr  = std::shared_ptr<arrow::dataset::ScannerBuilder>;
   
       ...
   
       // Create an in-memory dataset from the parsed record batch
       InMemDatasetPtr batch_as_dataset = 
std::make_shared<arrow::dataset::InMemoryDataset>(
            table_batch->schema()
           ,RecordBatchVec({ table_batch })
       );
   
       // Create a scanner to pass the expression
       ARROW_ASSIGN_OR_RAISE(
            ScanBuilderPtr scanbuilder
           ,batch_as_dataset->NewScan()
       );
   
       ...
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to