JunRuiLee opened a new pull request, #7916:
URL: https://github.com/apache/paimon/pull/7916
## Purpose
Internal infrastructure for ANALYZE TABLE:
- `ColStats` / `Statistics` / `StatsFileHandler` — Java-compatible stats
JSON format
- `FileStoreCommit.commit_statistics()` — creates ANALYZE snapshot,
preserves watermark/next_row_id, inherits stats on subsequent commits
- `StatisticsCollector` — reads merged data via read path, computes
distinctCount/nullCount/min/max/avgLen/maxLen with correct type gating and
Java-compatible serialization (DATE→epoch days, TIME→millis,
TIMESTAMP→formatted string,
string/binary→no min/max)
## Tests
- ColStats/Statistics serialization roundtrip + Java key format
- Empty colStats always emitted (avoids Java NPE)
- StatsFileHandler write/read via snapshot reference
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]