ianvkoeppe commented on issue #4293: Add support for an aggregation function 
returning serialized hyperlog…
URL: https://github.com/apache/incubator-pinot/pull/4293#issuecomment-502202592
 
 
   @kishoreg, thanks for your feedback.
   
   > I think it's better to return byte[] as hexString instead of string. 
HllUtil has a toBytes method. The HLLUtil.toString has some additional overhead 
that can be avoided.
   
   Awesome. I've converted it to use a HexString representation, and updated 
the tests accordingly.
   
   > But what you are suggesting is solution 1 - batch by pageId. I am not sure 
why this will be better unless there is some relationship between pageId and 
jobTitle such that restricting pageId will automatically limit jobTitle. Does 
this line of reasoning make sense?
   
   Absolutely, we are definitely in agreement overall. I think not batching 
makes sense in any case possible. My example above may not exactly outline the 
use case for needing to batch and merge large responses. I still think it could 
exist. In any case, being able to merge across columns or tables which have 
HLLs and use the same key for the HLL is a more straightforward use case and 
justification.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to