fulmicoton-dd opened a new pull request, #113:
URL: https://github.com/apache/datasketches-rust/pull/113

   We need update_with_coupon to be public for the following reason.
   
   
   We are running the equivalent of a 
   ```
   SELECT DISTINCT(some_field)
   GROUP BY bucket_field;
   ```
   
   some_field is dictionary encoded. Fetching the actual string value from 
value id associated to it is not a cheap operation.
   For this reason, we first perform the group by, collecting, for each bucket 
the value ids encounterred.
   
   We then compute once and for all the mapping from term_id to term and store 
it in a cache.
   We can then reuse this mapping when building the hll sketch for each 
independent bucket.
   
   In order to avoid dealing with strings, we would like to store, not the 
strings, but their coupons.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to