Re: [Starlink] fiber IXPs in space

Ulrich Speidel via Starlink Sun, 16 Apr 2023 17:52:17 -0700

On 17/04/2023 11:22 am, David Lang wrote:

On Mon, 17 Apr 2023, Ulrich Speidel wrote:
On 17/04/2023 10:03 am, David Lang wrote:
On Mon, 17 Apr 2023, Ulrich Speidel via Starlink wrote:
On 17/04/2023 5:54 am, David Fernández via Starlink wrote:
In case you put a DNS server in the satellite, so that it replies
instead of a DNS server on ground, the RTT is reduced by half.

The idea would be that the satellite inspects IP packets and when it
detects a DNS query, instead of forwarding the packet to ground
station, it just answers back to the sender of the query.
Understood - it's just that the gain you have from this is quitesmall. DNS queries only happen the first time a host needs toresolve a name, and then again after cache expiry much later, sothey account for only a tiny fraction of the traffic, and also foronly a small amount of the total delay in page loads. RTT isn'treally the big issue in Starlink - yes it's larger than it perhapsneeds to be, and bufferbloat seems to be present, but compared toGEO, it's now in the range seen for terrestrial Internet.
DNS time is more significant than you think, due to the fact that somany websites pull data from many different locations, you end upwith a lot of DNS queries when hitting a new site for the first time(and many of these queries are serial not parallel) so it adds quitea bit to the first rendering time of a page.
But most people don't hit new sites most of the time, and a lot ofcascading loads hit the same CDNs you've seen previously.
the timeouts on DNS are short enough that they hit them every day whenthey wake up

That's OK (and has to be that way or else DNS changes would neverpropagate).

But, an end client, typically hits the same site many times within arelatively short time window. For this, it does only need to do do onelookup. The client will then cache the entry. If it needs to look upagain 15 minutes plus later doesn't really matter in terms of a LEOsystem - the client will be talking to a different satellite by then.

Also, the percentage of DNS queries relating to CDN servers is very highnowadays, because CDN use is so pervasive that people will claim thatthere is an "Internet outage" when a CDN goes down.

CDNs or even datacenters (Cloud) in GEO or LEO is even more complex.
Indeed. In so many ways.
Mind though that CDNs are generally tied in with DNS nowadays, andthere's another snag: Take two users, Alice in the UK and Bob inNew Zealand - pretty much antipodean, using Starlink in bent-pipeconfiguration, i.e., their traffic goes through, say, the Londongateway in the UK and the Clevedon gateway in NZ. Now imagine bothtrying to resolve the same CDN hostname some time apart, but viathe same satellite DNS as the satellite has moved from the UK to NZin the interim. Say Alice resolves first and gets the IP address ofa CDN server in the UK. If the satellite DNS now caches this, andBob queries the same hostname, he gets directed to a server in theUK literally a world away instead of the Auckland one closest tohim. So unless each satellite carries a geolocated copy of theworld's DNS entries with it and makes a decision based on userlocation, you have a problem.
This is true when the DNS answer is dynamic, but such cases alsohave short cache timeouts. Even with a 90 min orbit, a 15 mintimeout would significantly lessen the impact (and I would expectthat an orbital DNS would detect short timeouts and treat them as asignal to shorten the timeout even more)
Timeout where? At the end user client or at the satellite?
at the DNS cache and at the client. If you are using DNS to redirectpeople to the closest/least loaded site, you need to have your DNStimeouts set short so that you can change where they go with minimaldowntime. Many clients refuse to honor extremely short timeouts (IIRCabout 15 min is the low end)
At the end user client, a short timeout makes no sense at all becausetheir host-to-CDN-IP server mapping shouldn't really change in bentpipe - only the sat hop changes.
If the timeout is meant to be on the satellite, it means that thesatellite knows nothing about anything when it arrives to assist you,and needs to query some sort of (probably ground-based) DNS serveranyway.
Also, the assumption that a satellite will return to the same spotafter a full orbital period (of say 90 minutes) only applies tosatellites in equatorial orbits (or polar orbits, and then only tothe poles). In all other cases, the Earth's rotation will assure thatthe satellite's return to the same location takes many orbital periods.
when the satellite first comes into an area, it won't know what'sappropriate to cach for the area, but it will start caching whenpeople start using it, the first person suffers the full hit, buteveryone after that benefits.

Yes, full understood. That's how it works on terrestrial DNS (and evenon GEO this would be a really good argument). But on LEO, that benefitonly materialises if the second client and any others in the same areaget to query the same satellite that handled the first client's query,because that's where the information would be cached if we had a DNSserver of sorts on each bird. For this to happen, the second client andany subsequent ones have to query within minutes if not seconds of thefirst one. What is the probability for this to happen? This depends onthe total number of active users hanging off that satellite and thepopularity of the target host/site among them. The larger the number ofusers and the higher the site popularity, the more likely that cachedentries will see a second or subsequent query. "Active" in this contextmeans users navigating to new sites during the visibility window of thatsatellite.

Practically speaking, we know from various sources that each Starlinksatellite provides - ballpark - a couple of dozen Gb/s in capacity, andthat active users on a "busy" satellite see a couple of dozen Mb/s ofthat. "Busy" means most active users, and so we can conclude that thenumber of users per satellite who use any site is at most around 1000.The subset of users navigating to new sites among them is probably inthe low 100's at best. If we're excluding new sites that aren't dynamic,we're probably down to a couple of dozen new static sites being queriedper satellite pass. How many of these queries will be duplicates? Not alot. If we're including sites that are dynamic, we're still not gettinga huge probability of cache entry re-use.

DNS data is not that large, getting enough storage into the satellitesto serve 90% of the non-dynamic data should not be a big deal. Thedynamic data expires fast enough (and can be detected as being dynamicand expired faster in the satellite) that I'm not worried aboutserving data from one side of the world to the other.

Yes, but the only advantage we'd get here is faster resolution for avery small subset of DNS queries.


--
****************************************************************
Dr. Ulrich Speidel

School of Computer Science

Room 303S.594 (City Campus)

The University of Auckland
[email protected]
http://www.cs.auckland.ac.nz/~ulrich/
****************************************************************



_______________________________________________
Starlink mailing list
[email protected]
https://lists.bufferbloat.net/listinfo/starlink

Re: [Starlink] fiber IXPs in space

Reply via email to