noahprince22 commented on issue #6187:
URL: 
https://github.com/apache/incubator-pinot/issues/6187#issuecomment-718771493


   https://eng.uber.com/operating-apache-pinot/
   
   Reading this blog:
   
   > As the scale of data grew, we also experienced several issues caused by 
too many segments. Pinot leverages Apache Helix over Apache Zookeeper for 
cluster management. For example, when a server transitioned from offline to 
online, Pinot will propagate state transition messages via Helix to notify 
other instances. The number of such state transition messages are proportional 
to the number of the segments on the server. When a server hosts too many 
segments, there could be a spike of state transition messages on Helix, 
resulting in lots of zookeeper nodes. If the number of zookeeper nodes is 
beyond the buffer threshold, the Pinot server and controller will crash. To 
solve this issue, we added message throttling to Pinot controllers to flatten 
the state transition surge.
   
   
   At large scale of data that requires this kind of lazy loading, you're going 
to have _a lot_ of segments. Do we see this causing an issue with helix state 
management messages?


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to