tub opened a new pull request, #7394: URL: https://github.com/apache/paimon/pull/7394
## Summary - Add `include_row_kind` parameter to `TableRead` for streaming change tracking - Prepend a `_row_kind` string column (`+I`, `-D`, `+U`, `-U`) to Arrow batches when enabled - Support row kind for both `RecordBatchReader` (default `+I`) and `OffsetRow`-based readers (from `RowKind`) ## Stacked PR series This is **PR 1b2/5** in the Python streaming read series: - PR 1a: Caching infrastructure + utilities (#7347) - PR 1b: Scanners + sharding (#7348) - **PR 1b2** (this): Row kind support for TableRead - PR 2: Core streaming – `AsyncStreamingTableScan`, consumer management (#7350) - PR 3: CLI – `paimon tail` (#7351) Incremental diff (vs 1b): https://github.com/tub/paimon/compare/python-streaming-1b-scanners...tub:paimon:python-streaming-1b2-row-kind ## Test plan - [x] `flake8` passes - [ ] `python -m pytest` passes - [ ] Manually verify row kind column appears in streaming reads 🤖 Generated with [Claude Code](https://claude.com/claude-code) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
