ianvkoeppe commented on issue #4293: Add support for an aggregation function 
returning serialized hyperlog…
URL: https://github.com/apache/incubator-pinot/pull/4293#issuecomment-500128536
 
 
   @sunithabeeram, thank you for the quick reply. I'm happy to speak more with 
@kishoreg about our use case.
   
   Interestingly, we actually already are planning to use serialized HLLs, but 
still want Pinot to do the aggregation across multiple segments before 
returning the serialized HLL.
   
   A more concrete example would be; given a Pinot table which has...
   **Unique Page Views** [SerializedHLL]
   **PageId** [Int]
   
   ...and I want to get the total unique page views for 10,000 pages. I can't 
supply those 10,000 page ids in a single query because URI length will be 
exceeded; especially when ids are represented as 40+ character urns. So I batch 
calls by 1,000 pages and get 10 responses.
   
   Today, I would have 10 responses with unique page views as a longs. I can't 
logically add them. With this change, I'm hoping I can get 10 serialized HLLs 
back, and use the HLLUtil client side to deserialize, aggregate, and then find 
cardinality.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to