c-dickens opened a new pull request, #105: URL: https://github.com/apache/datasketches-characterization/pull/105
## Summary - Extend `cms_point_query_profile` to run 3 Zipf skew regimes (α=1.5, 1.0, 0.5) with U=8192, N=2^20, 8192 trials - Add per-item relative error tracking via second set of KLL sketches and bound violation rate - Rewrite plotting script for 3-panel estimated frequency profile + relative error vs true frequency - Add website documentation drafts in `docs/cms/` ## Test plan - [x] `make cms-point-query` builds, runs all 3 regimes, produces 3 TSV files - [x] `make cms-point-query-plot` generates `cms_point_query_error.svg` and `cms_rel_error_vs_freq_rank.svg` - [x] TSV metadata shows bound violations well below theoretical δ=0.05 - [x] Relative error plot shows decreasing error for heavy hitters (matches Cormode et al.) - [ ] Review docs for correctness 🤖 Generated with [Claude Code](https://claude.com/claude-code) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
