Pages

torsdag 27. februar 2014

OpsMgr 2012: NiCE Log File Management Pack

I had the chance to take a look at a new management pack from NiCE. It will be available for FREE next week and it is a most welcome addition to the existing functionality that Opertations Manager already provide for monitoring log files. Another great blogger, Stefan Roth, already blogged about it here so I will not go into a lot of details here.

To get started you download a Quick Start guide in PDF format and the MSI installer file from www.nice.de.

The installer, like most Management Pack installers, will extract the management pack to a specified folder. Then you have to import it to your Management Group using the Operations Manager Console. You can uninstall it from Windows Programs and Features after that. However, I would recommend to keep the management pack in a file repository, by version, so you can easily revert to older versions if a new version have problems or changed functionality that you do not like.

The Quick Start guide will tell you with NiCE Step-By-Step guides, how to get started. Quite helpfull.

After playing around with it for a bit I have to say that this is awesome, you should give it a try. Highly recommended.

Highly recommended

UPDATE 28. february 2014: I have found two problems with the current version (1.0.26.0):

1) Self Monitoring Rules targets Windows Computers
The purpose of the Self Monitoring Rules are to monitor the Operations Manager Event Log for warnings and errors in the Operations Manager event log related to this Management Pack. The problem is if you have Windows Clusters. Then the rules will also target the cluster address (the virtual node). This may result in event 26004 being logged on the active node. The event would look something like this:

Log Name: Operations Manager
Source: Health Service Modules
Date: 27.02.2014 14:41:49
Event ID: 26004
Task Category: None
Level: Error
Keywords: Classic
User: N/A
Computer: host.lab.internal
Description:
The Windows Event Log Provider is still unable to open the Operations Manager event log on computer 'cluster.lab.internal'. The Provider has been unable to open the Operations Manager event log for 720 seconds.

Most recent error details: The RPC server is unavailable.

This will be picked up by the monitor "Failed Accessing Windows Event Log" targeting the "Health Service" and make the agent go into a warning state.

The workarround is to disable the following Rules for Class Windows Computers:
Self Monitoring: NiCE Log File Provider (Errors)
Self Monitoring: NiCE Log File Provider (Warnings)

2) Missing Console Task
This problem may be related to my environment, however it is of greater impact. In my console I noticed that the console tasks was missing. For example in the Windows Computers view, I would normally see the following sections in the Tasks pane: State Actions, Tasks, Navigation, Windows Computer Tasks and Report Tasks. After importing the NiCE Log File Management Pack, only State Actions remained. Closing the console did not help. Starting the console with /clearcache switch did not help.

Only after removing the NiCE Log File Management Pack and restarting the console did the Tasks reappear. I tried to import the management pack once more and after restarting the console, the tasks was missing again. In the event log I found two events that occured at the time I imported the Management Pack, so they are likely to be related:

Log Name:      Operations Manager
Source:        DataAccessLayer
Date:          28.02.2014 16:43:55
Event ID:      33333
Task Category: None
Level:         Warning
Keywords:      Classic
User:          N/A
Computer:      scom.lab.internal
Description:
Data Access Layer rejected retry on SqlError:
Request: ResourceByCriteria -- (LanguageCode1=ENU), (LanguageCode2=), (Category0=ad3be2e1-d1e2-bbf7-0c05-67ae8781a16a)
Class: 16
Number: 10316
Message: The app domain with specified version id (25189) was unloaded due to memory pressure and could not be found.

The other event was:

Log Name:      Operations Manager
Source:        OpsMgr SDK Service
Date:          28.02.2014 16:43:55
Event ID:      26319
Task Category: None
Level:         Error
Keywords:      Classic
User:          N/A
Computer:      scom.lab.internal
Description:
An exception was thrown while processing GetResourcesByCriteria for session ID uuid:89adfa9b-ddca-4143-a1e0-cf91c49e0703;id=35.
Exception message: The creator of this fault did not specify a Reason.
Full Exception: System.ServiceModel.FaultException`1[Microsoft.EnterpriseManagement.Common.UnknownDatabaseException]: The creator of this fault did not specify a Reason. (Fault Detail is equal to The app domain with specified version id (25189) was unloaded due to memory pressure and could not be found.).

UPDATE 6. march 2014: A new release has arrived 1.0.27.0 where the second problem (Missing Console Tasks) outlined above has been resolved. To get this great management pack you need to register at the www.nice.de site and then you will be able to download it for free.