We really need to get positive confirmation that this plan is both required
and sufficient.  Also we need confirmation of usage numbers and dates.



On Wed, Aug 13, 2014 at 5:43 PM, Ben Bangert <[email protected]> wrote:

> Hello Dev-Media!
>
> I'm Ben Bangert, engineer in Cloud Services, leading technical development
> on Push.
>
> This is a general update on the status of simplepush as it pertains to
> usage by loop for call push notifications.
>
> Currently we are proceeding with the assumption that the only viable way
> to alert the client of a call is via simplepush, which loop currently uses
> with our FxOS simplepush cluster. I've talked with Adam Roach about some
> ways to alter the loop client's use slightly so that we can stand up a
> separate set of 'simplepush' clusters specifically for use by loop. These
> separate clusters will use a lighter-weight version of simplepush (in
> development now) that doesn't do longer-term state maintenance which is
> unnecessary for loop's use.
>
> Simplepush, the version currently running, has been benchmarked to about 1
> million clients using a cluster of 5 machines. This cluster required
> memcached as it stored some state that loop's version (loopPush) doesn't
> require.
>
> Adam indicated that it seems reasonable that 20% of Firefox users might
> click the Loop icon, which would be around 100 million connections.
> Confirmation or refinement of that number would be helpful. For this load
> we would deploy multiple loopPush clusters and provide a URL that the loop
> client would query before initially connecting to determine which clusters
> have capacity to handle more connections.
>
> Remaining work to be done:
> - simplepush codebase
>   - refactoring already under way for not using memcached
>   - additional clean-ups, performance optimizations
>   - cluster setup automation
>
> - loads (or some other testing tool)
>   - needs dynamic test client handling (we will need over 1,000 instances
> to be spun up for larger scale test runs)
>   - ideally some automation integration (so that we can wire it into
> jenkins for automatic runs)
>
> - integrated complete testing (Tokbox, Simplepush, Loop)
> Tokbox has contacted us regarding how we plan on handling load testing of
> the push system and ensuring their TURN/STUN servers can meet requirements
> at scale. If we're looking at 100 million people possibly using this, how
> many are behind firewalls that will require complete TURN server proxying?
> If we have some ideas about what capacity Tokbox can/should handle, they'll
> want to sync up with us on that.
>
> We will also need to do some failure-case testing of the loop client to
> determine behavior under various failures in the SimplePush service.
>
> Regarding timelines, we believe it is possible to have a basic service up
> around Firefox 33 hitting the public but as we haven't gotten to a point
> where we've done any load-testing its hard to determine what kind of
> capacity can be handled. I'm also not sure what timeline Loop or Tokbox is
> operating with when it comes to having production deployments ready that
> can handle 100 million users.
>
> Does anyone have roadmaps and timelines for these components?
>
> For those interested, we have the Push Meeting on Weds at 11am PST in the
> Vidyo Services channel.
>
> Cheers,
> Ben
> _______________________________________________
> dev-media mailing list
> [email protected]
> https://lists.mozilla.org/listinfo/dev-media
>



-- 
Ian Bicking  |  Cloud Services Engineering Manager
_______________________________________________
dev-media mailing list
[email protected]
https://lists.mozilla.org/listinfo/dev-media

Reply via email to