I thought this might be of interest to everyone. Cheers, Blake
---------- Forwarded message ---------- From: Grant Slater <[email protected]> Date: Fri, May 13, 2016 at 2:59 PM Subject: Re: [OSM-talk] Upload slowness - what's going on? To: Tim Waters <[email protected]> Cc: "[email protected]" <[email protected]> Hi All, On Monday 9th May 2016 the master OSM database server was moved to York (Bytemark) from London (Imperial). This was to avoid multiple upcoming weekends of planned power testing & maintenance at the Imperial data centre. For the last few years Imperial has housed all our main critical systems including master & slave DB servers and frontend & backend web/api servers. We also added 4 new frontend/backend web/api server to York on Monday. We now have the master database server in York and the secondary database server in Imperial. We also have a warm standby slave db in AWS Ireland. A fourth SSD (NVMe) based DB server was delivered yesterday (Thursday), but it needs testing (burn-in, reliability, performance etc) before we can start using it. Slave DB servers can be promoted to master if required. The slave db servers serve Web/API read traffic and writes go to the master. When the frontend + backend servers were in the same data centre as the master db server the latency was <1ms. We now run a VPN to connect the servers up and the latency is ~8ms Imperial to Bytemark. Currently we are using the frontend & backends server at Imperial (closest to slave db read server) and sending writes over the VPN to Bytemark. The extra 8ms roundtrip is triggered multiple times based on the size of the upload changeset, this is the root cause for the slower uploads. The link between Imperial & Bytemark can handle gigabit speeds. Over the last few days we've been tweaking the VPN settings to get optimal latency & throughput over the links. Over today (for at least the weekend) we are switching to the new frontend & backend servers in York (Bytemark). London Imperial will be offline from approximately 5pm (GMT+1) for the first weekend of power maintenance. In summary: The slow uploads are a known issue and we'll fix as soon as practical. Our main concern has been setting up multiple data center redundancy to avoid extended downtime. Here is the list of all core hardware and hosting locations: https://hardware.openstreetmap.org/ Hope that answers the questions. ;-) Photos or it didn't happen: * Syncing & powering down before we start London -> York DB move: https://twitter.com/OSM_Tech/status/729582996685213696 * Staged photo of racking up the master DB server at Bytemark: https://twitter.com/OSM_Tech/status/729693392737832961 * Testing the new Frontend / Backend servers a week ago: https://twitter.com/OSM_Tech/status/728286193696292865 Bytemark are a fantastic hosting company and their ongoing support of the OpenStreetMap project is highly commendable. Please support them ;-) https://twitter.com/bytemark/status/729698435339853824 Kind regards, Grant Part of the OSM Ops team. _______________________________________________ HOT mailing list [email protected] https://lists.openstreetmap.org/listinfo/hot
