If gRPC connections are persistent, a single client will always talk to a 
single backend in that configuration. It's fine if you have lots of clients 
but how load balancing is done?
How server is able to handle so many open connections?? Won't it hit open 
file discriptors limit??
Can some one please explain how it was implemented at socket level to 
handle that many connections??

