Re: Issues with clock_gettime(CLOCK_REALTIME, &wait) pre macOS 10.13

Fred Wright Sat, 27 Oct 2018 16:12:01 -0700


On Tue, 23 Oct 2018, Chris Jones wrote:

I've stumbled into the same issue twice in recent days, with two differentports, which is the use of
clock_gettime(CLOCK_REALTIME, &wait);
which is only available in macOS 10.12 or newer. See for instance the issue Ifound yesterday in xrootd.
https://github.com/xrootd/xrootd/issues/846
I am still waiting to see what upstream say, but I am hopeful they willconsider it a bug. ( It would seem quite extreme to reduce the supported OSXreleases from 10.7+ to 10.12+ in a minor patch revision...)
But I was wondering, is this something anyone else has stumbled over, and dowe have a way of fixing this particular issue in the older OSes ?

Yes, in both GPSD and ntpsec. Rather than trying to figure out which ofthe many messages in this thread to answer directly, I'll just throw outwhat I know about this issue.

There are three "global" timescales potentially provided byclock_gettime(): CLOCK_REALTIME, CLOCK_MONOTONIC, andCLOCK_MONOTONIC_RAW. Only the first is eligible for clock_settime().

CLOCK_REALTIME is the same "Unix" timescale as the original time() andlater gettimeofday(), but with (ostensibly) nanosecond resolution. It'ssubject to both slewing and step adjustments, as needed to synchronize itsvalue to some time source.

CLOCK_MONOTONIC was created to avoid problems (including crashes)sometimes caused by the backward step adjustments that may be applied toCLOCK_REALTIME. Although the official documentation is woefullyunderspecified, it's typically implemented as a variation onCLOCK_REALTIME that excludes all step adjustments (including the initialone that "sets the clock"), but includes all slewing. This makes itcontinuous as well as monotonic, but its rate accuracy is corrupted by theslewing adjustments. In practice, it's almost never what you really want.

CLOCK_MONOTONIC_RAW is also woefully underspecified, but is usually justthe raw hardware time source scaled to standard units based on theassumed clock rate, but not steered at all. Since even the cheapestcrystals are typically rated at +/- 100ppm or better, and since theslewing adjustments applied to CLOCK_MONOTONIC can easily be much largerthan that, CLOCK_MONOTONIC_RAW is usually a more accurate timescale forrates, durations, and delays than CLOCK_MONOTONIC.

There are basically two fallback options for CLOCK_REALTIME:gettimeofday() and the Mach-specific clock_get_time() based onCALENDAR_CLOCK. The latter ostensibly has nanosecond resolution, but inreality it's only the *representation* that has nanosecond resolution,while the values are all multiples of 1000 nanoseconds. In addition, it'smore than an order of magnitude slower than gettimeofday() even in thebest case, and the commonly circulated example of its use is even slower,as well as having a "port leak" bug. Thus, it's best to simply usegettimeofday() with a microseconds->nanoseconds as a fallback. Thisapproach also works for substituting settimeofday() forclock_settime(CLOCK_REALTIME, ...). IMO, the microsecond->nanosecondconversion should be done without "rounding", but thenanosecond->microsecond conversions should round by adding 500ns prior tothe floored division by 1000. The unrounded conversion is consistent withboth clock_get_time() and the "official" clock_gettime() in 10.12+, whichstill only has microsecond actual resolution.

For CLOCK_MONOTONIC, I believe the only functionally correct fallback isto use clock_get_time() with SYSTEM_CLOCK. As noted above, programsshouldn't really be using CLOCK_MONOTONIC anyway, but it's necessary toinclude it for compatibility with programs too dumb to know that. Theproblem with clock_get_time() is that it requires messing with Mach ports.The most efficient way to do this is to obtain a SYSTEM_CLOCK port onceinitially, and the reuse it on each call. Even with this, it takes over700ns on a 3.46GHz Mac Pro, as compared to ~40ns for gettimeofday(), butthat's the price of correctness. Since something intended to be a drop-inreplacement for clock_gettime() can't rely on initialization or cleanupfunctions, the best it can do is to allocate the Mach port on first call,and then rely on exit cleanup to eventually deallocate it.

For CLOCK_MONOTONIC_RAW, the straightforward approach is to usemach_absolute_time() with the proper scaling. As long as the scalefactors are constant, they can be obtained once initially and then cachedfor later use. Unlike the clock_get_time() case, cleanup isn't even anissue. However, I've seen some mention of the possibility that the rateof mach_absolute_time() may not be constant. I'm not aware of any caseswhere this actually happens, and perhaps it's only theoretical, but if itdid actually happen, it would complicate things significantly. In orderto convert a variable-rate clock to standard units, it's necessary to knownot only the current scale factor, but also the last time that the factorchanged and what the correspondence was at that time. Since thatinformation isn't provided, either the scale is actually constant or theAPI is deficient. Hopefully it's the former.

My clock_gettime() replacement for ntpsec is defined directly in a headerfile as an inline function. Although it currently only supportsCLOCK_REALTIME, it does include a switch() on the clock_id forextensibility. A significant advantage of the inline approach is thatwhenever the clock_id is a compile-time constant (as is almost always thecase in real use cases), the optimizer can completely remove the switch()and degenerate into just the inline code needed (quite simple forCLOCK_REALTIME) in the relevant case. And of course it also avoidsadding new link-time dependencies.


Fred Wright

Re: Issues with clock_gettime(CLOCK_REALTIME, &wait) pre macOS 10.13

Reply via email to