ServicePulse is showing negative graph

MiBo · March 12, 2020, 3:10pm

Hi,

Our stats in ServicePulse are showing negative stats on Critical Timings. I have tried to find out what is causing this - since our other endspoints do not show this kind of behaviour.

This specific endpoint is running on a on-premise windows machine (self-hosted dotnet core).
This issue is occuring for our endpoint in ACC adn PROD - for the endpoint in TST this is not the case.

The other endpoints you see in the screenshot are running in Kubernetes. And we use RabbitMq as transport.

Any suggestions on what may be causing this?

ramonsmits · March 12, 2020, 4:43pm

Negative critical time can occur when system clocks are not synchronized. However, I would assume negative values to be visualized 0 or as an error.

indualagarsamy · March 13, 2020, 11:28pm

Hi,

Thanks for letting us know. I’ve reached out the team to report this.

Is there anything that’s unique about these endpoints? Are they deployed on a different server compared to the ones that report correctly that can exhibit the time sync issues like Ramon suggested?

What’s the version of ServicePulse and ServiceControl monitoring?

We’ll get back to you if we need more details on this or if we have any updates.

Thanks,
Indu Alagarsamy

MiBo · March 16, 2020, 7:26am

Hi Indu,

Yes, this specific endpoint is running on a Windows host. While the others are hosted in Kubernetes. So the time difference could well be an issue, but what should be a correct fix for this? How could we synchronize these?

The versions we are using are:

ServicePulse: 1.23.0
ServiceControl: 4.5.1

Thanks and awaiting your response.

tmasternak · March 16, 2020, 8:09am

Hi Michael,

the most common approach to time synchronization is to configure NTP (or Time Synchronization on Windows). This is done at the operating system level (Kubernetes node).

Cheers,
Tomek

MiBo · March 16, 2020, 12:25pm

Hi Tomek,

Ok, will give this a try.
Will let you know what the outcome is.

Regards,
Michael

MiBo · March 16, 2020, 7:46pm

Hello Tomek,

Have configured the NTP on the machine on which the Legacy endpoint was running and this seems to be the fix! Will monitor some more the upcoming days, but this looks promissing.

Thank you for the help.

Regards,
Michael