Random 'Monitoring stopped' on one server
RichyD
Posts: 8
I'm using SQL Monitor 3.5 to support about 15 servers, and am used to the occasional connection issue. One of my servers, however, is suffering from intermittent 'Monitoring stopped (SQL Server credentials)' errors throughout the day - on average about six times a day at random times. Each time, the alert is ended within a few seconds as SQLMonitor successfully reconnects.
The account being used for this server is the same Windows account for all, and no other servers are showing a similar problem...
I've checked the target SQL and Windows event logs, the monitoring server logs, and can't find anything at all to indicate a problem.
Has anyone else experienced this kind of thing, and/or have pointers on where to investigate next?
Cheers,
Rich
The account being used for this server is the same Windows account for all, and no other servers are showing a similar problem...
I've checked the target SQL and Windows event logs, the monitoring server logs, and can't find anything at all to indicate a problem.
Has anyone else experienced this kind of thing, and/or have pointers on where to investigate next?
Cheers,
Rich
Comments
The full SQL Monitor log shows all sorts of random exceptions, but none with times that correspond with the monitoring failures...
This is one of the odd things about the problem - SQL Monitor saying that it has had SQL credential problems, but the SQL Server itself denies all knowledge. Most peculiar.
As an experiment, I moved the Base Monitor to a different server, and I haven't had a connect error since the move. It's only been two hours, but I'll keep my eye on it and hope that it was a problem with the monitor host.
Things couldn't be totally fixed, of course - I've now got 100% CPU usage on the new host I'll raise that in a new thread if it doesn't settle down this afternoon...
Did you find moving the base monitor to a new server resolve this issue? I am having exactly the same symptoms with one server always having SQL Monitoring Stopped errors, with no login failures on the server. The server is working fine ad the error is ended in a few seconds.
Thanks.
I've scheduled a hotfix to be applied, but my OS team is slow to roll these things out, so i can't state if this will definitely solve the problem...
For ref, the Windows 2008r2 hotfix is here: http://support.microsoft.com/kb/2832248, and the vanilla 2008 one is here:http://support.microsoft.com/kb/958124
If that sorts out your issues, please let me know
Rich
Thanks again.
I'll be rolling that hotfix out to all Windows 2008/r2 servers over the next month to make eliminate the rest of the connection failures I get.