This is interesting.
I know this is not what you need but for the sake of experiment, what
happens if you switch the clock source from tsc to hpet? What are all the
clock sources available on that system?
Can you please post the relevant entries from dmesg?


On Fri, 11 May 2018, 10:02 Himanshu Sharma, <[email protected]> wrote:

> Hi guys
>
>
> We recenty got new Supermicros SYS-1029UX-LL1-S16 20 core servers for
> testing. We are running it with RedHat 7.4. In our testing we have observed
> that the system clock on these servers is running way too fast.
>
> Evidence of observation
>
> 1. Chrony
>
> We sync our servers using chronyd and the frequency for this one is
> something we havent seen anywhere. I am not sure that clock can be slewed
> to accommodate this much frequency offset. (SYS-1029UX-LL1-S16 is gaining
> 26 millisecond every second).
>
> > chronyc tracking
> Reference ID    : xxxxxxxx
> Stratum         : 4
> Ref time (UTC)  : Fri May 11 08:48:20 2018
> System time     : 0.000001912 seconds fast of NTP time
> Last offset     : +0.000001952 seconds
> RMS offset      : 0.000001457 seconds
> Frequency       : 26814.393 ppm fast
> Residual freq   : +0.134 ppm
> Skew            : 0.124 ppm
> Root delay      : 0.194362879 seconds
> Root dispersion : 0.000912781 seconds
> Update interval : 4.0 seconds
> Leap status     : Normal
>
> It is taking forever to adjust clock rate to accomodate this much
> frequency difference.
>
>
> 2. PTPD
>
> We also tried running ptpd to sync time with max_offset_ppm = 1000 to
> adjust for maximum possible frequency difference, but this was also futile.
>
> > /var/run/ptpd2.event.log
>
> 2018-05-11 14:21:21.249612 ptpd2[113396].enp94s0 (info)      (slv)
> TimingService.PTP0: acquired clock control
> *2018-05-11 14:21:49.976752 ptpd2[113396].enp94s0 (critical)  (slv) Offset
> above 1 second (1.001336627 s). Clock will step.*
> 2018-05-11 14:21:48.975450 ptpd2[113396].enp94s0 (warning)   (slv) Stepped
> the system clock to: 05/11/18 14:21:48.975425504
> 2018-05-11 14:21:49.169540 ptpd2[113396].enp94s0 (notice)    (lstn_reset)
> Now in state: PTP_LISTENING
> 2018-05-11 14:21:51.022561 ptpd2[113396].enp94s0 (info)      (lstn_reset)
> UTC offset is now 37
> 2018-05-11 14:21:51.022639 ptpd2[113396].enp94s0 (info)      (lstn_reset)
> New best master selected: 88f031fffec32ec1(10.30.128.1)/0
> 2018-05-11 14:21:51.022788 ptpd2[113396].enp94s0 (notice)    (slv) Now in
> state: PTP_SLAVE, Best master: 88f031fffec32ec1(10.30.128.1)/0
> (IPv4:172.27.210.123)
> 2018-05-11 14:21:51.024175 ptpd2[113396].enp94s0 (notice)    (slv)
> Received first Sync from Master
> 2018-05-11 14:21:52.057232 ptpd2[113396].enp94s0 (notice)    (slv)
> Received first Delay Response from Master
> 2018-05-11 14:21:52.057246 ptpd2[113396].enp94s0 (notice)    (slv)
> Received new Delay Request interval 1 from Master (was: 0)
> 2018-05-11 14:21:59.257989 ptpd2[113396].enp94s0 (notice)    (slv) Servo:
> Going to slew the clock with the maximum frequency adjustment
> *2018-05-11 14:22:28.029476 ptpd2[113396].enp94s0 (critical)  (slv) Offset
> above 1 second (1.011173248 s). Clock will step.*
> 2018-05-11 14:22:27.018337 ptpd2[113396].enp94s0 (warning)   (slv) Stepped
> the system clock to: 05/11/18 14:22:27.18314450
> 2018-05-11 14:22:27.196373 ptpd2[113396].enp94s0 (notice)    (lstn_reset)
> Now in state: PTP_LISTENING
> 2018-05-11 14:22:29.056742 ptpd2[113396].enp94s0 (info)      (lstn_reset)
> UTC offset is now 37
> 2018-05-11 14:22:29.056797 ptpd2[113396].enp94s0 (info)      (lstn_reset)
> New best master selected: 88f031fffec32ec1(10.30.128.1)/0
> 2018-05-11 14:22:29.056843 ptpd2[113396].enp94s0 (notice)    (slv) Now in
> state: PTP_SLAVE, Best master: 88f031fffec32ec1(10.30.128.1)/0
> (IPv4:172.27.210.123)
> 2018-05-11 14:22:29.063858 ptpd2[113396].enp94s0 (notice)    (slv)
> Received first Sync from Master
> 2018-05-11 14:22:30.066198 ptpd2[113396].enp94s0 (notice)    (slv)
> Received first Delay Response from Master
> 2018-05-11 14:22:30.066215 ptpd2[113396].enp94s0 (notice)    (slv)
> Received new Delay Request interval 1 from Master (was: 0)
>
> Approximately every 40 seconds the clock has to be adjusted back 1 second
> to accomodate for the time gained which cannot be offset by the frequency
> adjustment.
>
>
>
> There is clearly nothing we can do using only frequency adjustment to
> accommodate a clock frequency this high. We also dont want the clock to go
> back in time because it can affect some of the mathematical calculations
> based on time difference which are written (not by me) on the premise that
> time moves only forward (which indeed it should :)).
>
> Would really appreciate any solutions to rectify this issue.
>
> --
> You received this message because you are subscribed to the Google Groups
> "mechanical-sympathy" group.
> To unsubscribe from this group and stop receiving emails from it, send an
> email to [email protected].
> For more options, visit https://groups.google.com/d/optout.
>

-- 
You received this message because you are subscribed to the Google Groups 
"mechanical-sympathy" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
For more options, visit https://groups.google.com/d/optout.

Reply via email to