Re: Implementation for approx_count_distinct_sketch and associated functions

2023-01-20 Thread Ryan Berti
Hello, Wanted to follow up and link out the Spark PR associated with these changes ; I'm excited to open up the implementation for community review! For reference, I worked with @Daniel Tenedorio and the Databricks team on a pre-review

Implementation for approx_count_distinct_sketch and associated functions

2023-01-11 Thread Ryan Berti
Hello! I've recently wanted to write the sketches associated with the approx_count_distinct function to allow for distinct count re-aggregation. This 2019 databricks post proposes the use of spark-alchemy, and