JunRuiLee opened a new pull request, #7989:
URL: https://github.com/apache/paimon/pull/7989

   ## Summary
   
   pypaimon currently implements 8 system tables: `$snapshots`, `$schemas`, 
`$options`, `$manifests`, `$files`, `$partitions`, `$tags`, `$branches`. 
Compared to the Java side, it still lacks `$buckets`, `$audit_log`, 
`$read_optimized`, `$consumers`, `$statistics`, `$aggregation_fields`, 
`$file_key_ranges`, `$table_indexes`, etc.
   
   This PR adds `$buckets`, which is one of the more commonly used system 
tables for diagnosing data skew. It aggregates manifest entries by (partition, 
bucket) and exposes per-bucket record_count, file_size, file_count, and 
last_update_time.
   
   ## Changes
   - New: `buckets_table.py` — `BucketsTable` implementation
   - New: `buckets_table_test.py` — 5 end-to-end tests (schema validation, 
empty snapshot, aggregation correctness, sort order, catalog dispatch)
   - Modified: `system_table_loader.py` — register `"buckets"`
   - Modified: `system_table_loader_test.py` — update expected table list


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to