tub opened a new pull request, #7349: URL: https://github.com/apache/paimon/pull/7349
## Summary - Add `Consumer` dataclass for tracking consumption progress (snapshot ID) - Add `ConsumerManager` for persisting, loading, and expiring consumers via file IO - Consumer state stored as JSON files under table's `consumer/` directory ## Stacked PR series This is **PR 1c/5** in the Python streaming read series: - PR 1a: Caching infrastructure + utilities - PR 1b: Scanners, sharding, row kind - **PR 1c** (this): Consumer management (~406 lines) - PR 2: Core streaming (`AsyncStreamingTableScan`) - PR 3: CLI (`paimon tail`) Incremental diff (vs 1b): https://github.com/apache/paimon/compare/python-streaming-1b-scanners...tub:paimon:python-streaming-1c-consumer ## Test plan - [x] `flake8` passes on all changed files - [x] `python -m pytest` passes - [x] New tests: `consumer_test.py` (11 tests) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
