[PR] [python] Add row kind support for TableRead [paimon]

via GitHub Tue, 10 Mar 2026 10:53:36 -0700


tub opened a new pull request, #7394:
URL: https://github.com/apache/paimon/pull/7394


   ## Summary
   - Add `include_row_kind` parameter to `TableRead` for streaming change 
tracking
   - Prepend a `_row_kind` string column (`+I`, `-D`, `+U`, `-U`) to Arrow 
batches when enabled
   - Support row kind for both `RecordBatchReader` (default `+I`) and 
`OffsetRow`-based readers (from `RowKind`)
   
   ## Stacked PR series
   This is **PR 1b2/5** in the Python streaming read series:
   - PR 1a: Caching infrastructure + utilities (#7347)
   - PR 1b: Scanners + sharding (#7348)
   - **PR 1b2** (this): Row kind support for TableRead
   - PR 2: Core streaming – `AsyncStreamingTableScan`, consumer management 
(#7350)
   - PR 3: CLI – `paimon tail` (#7351)
   
   Incremental diff (vs 1b): 
https://github.com/tub/paimon/compare/python-streaming-1b-scanners...tub:paimon:python-streaming-1b2-row-kind
   
   ## Test plan
   - [x] `flake8` passes
   - [ ] `python -m pytest` passes
   - [ ] Manually verify row kind column appears in streaming reads
   
   🤖 Generated with [Claude Code](https://claude.com/claude-code)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[PR] [python] Add row kind support for TableRead [paimon]

Reply via email to