fulmicoton-dd opened a new pull request, #113: URL: https://github.com/apache/datasketches-rust/pull/113
We need update_with_coupon to be public for the following reason. We are running the equivalent of a ``` SELECT DISTINCT(some_field) GROUP BY bucket_field; ``` some_field is dictionary encoded. Fetching the actual string value from value id associated to it is not a cheap operation. For this reason, we first perform the group by, collecting, for each bucket the value ids encounterred. We then compute once and for all the mapping from term_id to term and store it in a cache. We can then reuse this mapping when building the hll sketch for each independent bucket. In order to avoid dealing with strings, we would like to store, not the strings, but their coupons. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
