Hello All,
This is the core Java component of the DataSketches library that includes
all the sketch algorithms in production-ready packages. These sketches can
be called directly from this component or used in conjunction with the
adaptor components such as Hadoop Pig, Hadoop Hive, or the aggregator
adaptors built into Apache Druid.
Major new features and enhancements:
- Quantile Sketches
- The major APIs for all the quantile sketches now derive from
interfaces common to all the quantile sketches. This makes it much easier
for the user to move from one quantile sketch to another with only very
minor API changes.
- All the quantile sketches now have a "SortedView", which is iterable
and makes analysis of the quantile distribution even easier.
- HLL Sketches
- Major speed performance improvements for HLL union/merge operations.
- Major improvements to the HLL Javadocs.
- Theta Sketches
- The Theta sketch has been enhanced with an optional compress
operation that makes the serialized theta sketch smaller.
- TestNG has been updated to version 7.5.1 (works with Java 8), which
includes the Zip Slip Vulnerability fix.
Thank you to the PMC members and community for taking the time to review
this release!
Lee