friendlymatthew opened a new pull request, #9117:
URL: https://github.com/apache/arrow-rs/pull/9117

   # Which issue does this PR close?
   
   - Closes https://github.com/apache/arrow-rs/issues/8641
   - Related https://github.com/apache/arrow-rs/issues/8799
   
   # Rationale for this change
   
   This PR adds support for storing row group indices as a virtual column, 
allowing users to determine which row group each row originated from
   
   The usage pattern is quite simple, something like: 
   
   ```rs
   use parquet::arrow::RowGroupIndex;
   
   let row_group_index_field = Arc::new(
       Field::new("row_group_index", DataType::Int64, false)
           .with_extension_type(RowGroupIndex)
   );
   
   let options = ArrowReaderOptions::new()
       .with_virtual_columns(vec![row_group_index_field])?;
   
   let reader = ParquetRecordBatchReaderBuilder::try_new_with_options(file, 
options)?
       .build()?;
   ```
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to