JunRuiLee opened a new pull request, #7989: URL: https://github.com/apache/paimon/pull/7989
## Summary pypaimon currently implements 8 system tables: `$snapshots`, `$schemas`, `$options`, `$manifests`, `$files`, `$partitions`, `$tags`, `$branches`. Compared to the Java side, it still lacks `$buckets`, `$audit_log`, `$read_optimized`, `$consumers`, `$statistics`, `$aggregation_fields`, `$file_key_ranges`, `$table_indexes`, etc. This PR adds `$buckets`, which is one of the more commonly used system tables for diagnosing data skew. It aggregates manifest entries by (partition, bucket) and exposes per-bucket record_count, file_size, file_count, and last_update_time. ## Changes - New: `buckets_table.py` — `BucketsTable` implementation - New: `buckets_table_test.py` — 5 end-to-end tests (schema validation, empty snapshot, aggregation correctness, sort order, catalog dispatch) - Modified: `system_table_loader.py` — register `"buckets"` - Modified: `system_table_loader_test.py` — update expected table list -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
