SCOM 2007 R2 CU4 has been released to the web. You can download it here
CU 4 contains a fix that I have been waiting for since SCOM 2007 RTM. It has the ability for the RMS to recover from a SQL disconnect, restart, or failure. This is obviously super important for a monitoring system.
Automatic recovery of Health Service in a SQL Server failure scenario
After SQL Server goes offline, Health Service of Root Management Server (RMS) stops responding. For example, Health Service stops responding after SQL Server disconnects, restarts, or fails. To recover from this issue when SQL Server is available again, you must restart Health Service.
By default, this automatic recovery feature is disabled. The DALInitiateClearPoolSeconds setting controls when RMS drops the current connection pool and when RMS tries to reestablish an SQL connection. We recommend that you set this setting to 60 seconds or more to avoid performance issues. To configure this recommended setting, set the following values:
- SOFTWARE\Microsoft\Microsoft Operations Manager\3.0\DAL\DALInitiateClearPool = true
- SOFTWARE\Microsoft\Microsoft Operations Manager\3.0\DAL\DALInitiateClearPoolSeconds = 60
*Note* To apply the DALInitiateClearPoolSeconds setting, restart Health Service of RMS.
Kevin Holman has done a nice job providing step-by-step instructions on CU4. Link