Re: [HACKERS] Replication on the backend

J. Andrew Rogers Wed, 07 Dec 2005 01:26:39 -0800


On Dec 6, 2005, at 9:09 PM, Gregory Maxwell wrote:

Eh, why would light limited delay be any slower than a disk on FC the
same distance away? :)


In any case, performance of PG on iscsi is just fine. You can't blame
the network... Doing multimaster replication is hard because the
locking primitives that are fine on a simple multiprocessor system
(with a VERY high bandwidth very low latency interconnect between
processors) just don't work across a network, so you're left finding
other methods and making them work...

Speed of light latency shows up pretty damn often in real networks,even relatively local ones. The number of people that wonder why atranscontinental SLA of 10ms is not possible is astonishing. Thesilicon fabrics are sufficiently fast that most well-designednetworks are limited by how fast one can push photons through afiber, which is significantly slower than photons through a vacuum.Silicon switch fabrics add latency measured in nanoseconds, which iseffectively zero for many networks that leave the system board.

Compared to single system simple SMP, a local cluster built on afirst-rate fabric will have about an order of magnitude higherlatency but very similar bandwidth. On the other hand, at thoselatencies you can increase the number of addressable processors withthat kind of bandwidth by an order of magnitude, so it is a bit of atrade. However, latency matters a lot such that one would have to bea lot smarter about partitioning synchronization across that fabriceven though one would lose nothing in the bandwidth department.

But again, multimaster isn't hard because there of some inherently
slow property of networks.

Eh? As far as I know, the difficulty of multi-master is almostentirely a product of the latency of real networks such that they aretoo slow for scalable distributed locks. SMP is little more than adistributed lock manager implemented in silicon. Therefore, multi-master is hard in practice because we cannot drive networks fastenough. That said, current state-of-the-art network fabrics arewithin an order of magnitude of SMP fabrics such that they could bereal contenders, particularly once you get north of 8-16 processors.

The really sweet potential is in Opteron system boards withInfiniband directly attached to HyperTransport. At that level ofbandwidth and latency, both per node and per switch fabric, thearchitecture possibilities start to become intriguing.



J. Andrew Rogers



---------------------------(end of broadcast)---------------------------
TIP 5: don't forget to increase your free space map settings

Re: [HACKERS] Replication on the backend

Reply via email to