Error in system event log on x64 cluster system
qcjims
Posts: 38
getting the following error several times per second in the system event log of several of our x64 cluster systems after installing red-gate Bakcup 6.
Event Type: Error
Event Source: ClusSvc
Event Category: Database Mgr
Event ID: 1080
Date: 9/1/2009
Time: 10:12:36 PM
User: N/A
Computer: xxxxxxxxxxxxx
Description:
Cluster service could not write to a file (C:\DOCUME~1\MCS64\LOCALS~1\Temp\CLS623.tmp). The disk may be low on disk space, or some other serious condition exists.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 05 00 00 00 ....
I have upgraded to 6.2 from 6.0 and still have the same error. i have also clean installed and still have the same problem. The disk is not low on space, and the account the cluster service runs under (MCS64) does have access to write to that directory. Any ideas?
-js
Event Type: Error
Event Source: ClusSvc
Event Category: Database Mgr
Event ID: 1080
Date: 9/1/2009
Time: 10:12:36 PM
User: N/A
Computer: xxxxxxxxxxxxx
Description:
Cluster service could not write to a file (C:\DOCUME~1\MCS64\LOCALS~1\Temp\CLS623.tmp). The disk may be low on disk space, or some other serious condition exists.
For more information, see Help and Support Center at http://go.microsoft.com/fwlink/events.asp.
Data:
0000: 05 00 00 00 ....
I have upgraded to 6.2 from 6.0 and still have the same error. i have also clean installed and still have the same problem. The disk is not low on space, and the account the cluster service runs under (MCS64) does have access to write to that directory. Any ideas?
-js
Comments
Could you send us as much information about your cluster as possible? (OS + SP + any other updates, installed instances + version + bitness, any other software installed on the system, how you installed SQL Backup, any recent fail overs etc). We have had one other similar report but have not managed to reproduce it yet.
Feel free to email support@red-gate.com with the information if you do not want to publish it here - plesae include the url to this page when you do so.
Sorry for the problems you are facing with SQL Backup.
Regards,
James
Head of DBA Tools
Red Gate Software Ltd
This issue may be related to a registry key replication issue.
As a tempory workaround you can disable this feature for SQL Backup.
For each SQL Server group In Cluster Administrator, right click ‘SQLBackupAgent_[INSTANCE]’ and select ‘properties’. Under the Registry Replication tab, remove the entries – ignoring any errors. Once registry replication is removed, the event log errors will no longer be produced. The obvious drawback to this technique is that any special reg keys you want to set will have to be manually copied over (such as BrowsingUserList, templates etc).
On my cluster, re-entering keys for replication did not cause the problem to reoccur - though I am still testing the full behaviour.
Please let me know how things go if you try this,
Thanks
Development
Red-Gate Software
Can you confirm that there are no entries here? This bug can also occur when using Microsoft Exchange on your cluster.
Development
Red-Gate Software
Development
Red-Gate Software
If I look in the cluster.log for errors, I see the following entries (or similar). The .tmp file mentioned matches up with the error in the event logs.
Here are the cluster.log entries:
Here is the corresponding error in the system event log:
When I add the replicated registry entries back using CLUADMIN, the errors do not return. However, it does not appear that registry replication is actually doing anything at this point (which might explain the lack of errors). When I attempt to make changes to these values using the SQLBackup GUI (changing smtp setting for example), the values are recorded on the node that the cluster group is running on, but the values are not replicated to the other nodes.
I have also noticed that there are inconsistencies between the instances listed under the BackupSettings, BackupSettingsGlobal and InstalledInstances registry keys across the cluster nodes. Shouldn't all the instances listed under each key be the same across the entire cluster?
-js
It's been explained to me that the keys aren't actually replicated unless you fail the generic service resource. I can't seem to find any Microsoft documentation about this, though.
Any inconsistencies are probably a result of the fact that key replication breaks when this bug occurs.
Development
Red-Gate Software
so taking the SQLBACKUP_AGENT offline and then bringing it back online is not enough? This KB article seems to indicate that registry changes are replicated when the resource goes offline. So, if i just fail the resource then it should replicate across all nodes right?
http://support.microsoft.com/kb/174070
What bug are you referring to?
-js
The active/active registry key replication issue. So far, only observed on 64-bit Windows 2003 multi-instance clusters. On such systems I would recommend turning off key replication by the aforementioned method whilst we investigate (logged as SB-4349)
Development
Red-Gate Software
Having removed the registry keys from replication in 1 clustered instance and then attempting to remove it from the next instance either resulted in a message telling me that it wasn't there or when the dialog box was opened, no registry keys were present.
That's what I think it is based on my own experience so far and would probably explain why testing in an attempt to replicate the issue may have failed, the testing was attempted on 1 clustered instance of SQL Server rather than multiple instances.
The registry keys are meant to be instance specific. If they're not in your deployment then something has gone wrong with the install.
For each SQL Backup entry in the cluster, the two keys for replication should be:
SOFTWARE\Wow6432Node\Red Gate\SQL Backup\BackupSettings\(local)
SOFTWARE\Wow6432Node\Red Gate\SQL Backup\BackupSettingsGlobal\(local)
where '(local)' would be replaced with the instance name if not default.
What were the keys for replication on your system? Were they all set to one instance?
Development
Red-Gate Software
Sorry, yes, you are correct. I made an assumption based on the error in the cluster log. One thing I have noticed is that for some clusters, the data path key points to the clustered drive and in others, it's a local path to the C drive.
I'm sorry to hear this has affected you. I discovered this myself very recently on 2008R2. The last step of the installer dialog offers the user the chance to choose where the data is kept - but the default entry sometimes points to C:\ rather than the shared drive. This should be corrected, either at install time or manually via the registry.
Development
Red-Gate Software
The 6.3 release does not include any changes to the key replication behaviour. I can reproduce the issue, but have not yet been able to determine the cause. If you are affected by this bug, please disable key replication for SQL Backup whilst we investigate. I'm sorry for any inconvenience this may cause.
Regards,
Development
Red-Gate Software
Is red-gate working with Microsoft to try and determine the cause of this issue?
-js
I just noticed this error on our Windows 2003 Enterprise R2 x64, SQL Backup 6.3.048 (so far, 2 clusters have the error and 3 don't)
Interesting... they're all SQL 2005 Standard
I'm gonna remove the registry replication from those failed ones
DBA, MCITP
please fix this ASAP.
We are currently undergoing release testing, however, if you would like to receive the patch early please get in touch via private message or email (robin.anderson@red-gate.com).
Regards,
Development
Red-Gate Software
Great news, eager to wait for the new release!!
DBA, MCITP
I just upgraded all my production from 6.3 to 6.4 and Registry Replication has re-surfaced in the SQLBackupAgent
HOWEVER............ the error continues non-stop UNTIL a restart of SQLBackupAgent (take offline, bring online), keep that in mind!!!
I had to do this for 2 * 4 clusters we have
DBA, MCITP
I just upgrade a 64_bit cluser from 6.3 to 6.4. if I look in the Cluster Admin, right click 'SQLBackupAgent' amd select properties I can see 4 entries inder the Registry replication tab. 2 are for Software\WOW6432Node\Red Gate and the other 2 are Software\Red Gate.
Which ones should I remove?
Thanks
Chris
We are seeing the Event 1080 in some 64_bit clusters but not all of ours.
If the Microsoft clustering service was attempting key replication whilst you were upgrading, then you will still see errors appearing in the log until it gives up or completes. However, so long as there are no wow6432node paths set and the actual registry keys do exist then the issue will not reoccur.
Regards,
Development
Red-Gate Software