leerho commented on issue #7187: Improve topN algorithm
URL: 
https://github.com/apache/incubator-druid/issues/7187#issuecomment-471056305
 
 
   @peferron  
   1. How is the group by (your step 1) performed under the covers?  Hash 
Table?  What do you do if n or m don't fit into memory?  (or is that out of 
scope by definition?)
   2. "per-segment results are merged on the node".  How?  Merging of Priority 
Queues from each segment?  How big are the priority queues? Size k? or bigger 
than k?
   3. What exactly is passed from the node to the broker, effectively a 
Priority Queue? And how big is it WRT k?

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
[email protected]


With regards,
Apache Git Services

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to