This article was updated to support v12.1.3 of Goliath Performance Monitor.
The process watch monitoring rule has the ability to monitor the following scenarios for processes on your Windows servers/workstations using our Goliath Intelligent Agent to alert on specified conditions in real-time:
- If a process should always be running, alert if the process is stopped and optionally attempt to restart the process.
- If a process should never be running, alert if the process is running and optionally terminate the process.
- If a process is running more the X instances, alert and optionally terminate the processes.
- Monitor process utilization, whenever the process runs, and alert if the process is exceeding the desired threshold(s) and optionally terminate the process.
Configure a Process Watch rule
- To create a new monitoring condition, navigate to the Configure - Monitoring Rules page and click the New button
- A selection pane will appear, select the radio button option for Process Watch and then click OK
- Now the monitoring rule pane will appear. At the top of the pane name the Monitoring Rule via the Rule Name field, as well as define the description and the severity.
- The first tab, ProcessWatch is where you will define condition(s) to be monitored
- In the Process Name field, define the process to be monitored.
-
The single asterisk (‘*’) Wildcard character is supported, and when specified, all processes are monitored against the specified thresholds.
- When selecting this option it will also disable the ‘should be running or not running’ options along with the remediation to restart the process.
- Use the asterisk (‘*’) Wildcard character as part of a Process Name to refer to a group of like named processes, for example, a Process Name of ‘abc*’ (exclude the quotes when actually using) would match any process name that started out with the characters ‘abc… '
- You can also use the ‘…’ button to find processes that are currently running on a specific server
-
The single asterisk (‘*’) Wildcard character is supported, and when specified, all processes are monitored against the specified thresholds.
-
In the Process Path field, define the fully qualified path for the Process being monitored.
- This is only applicable and required if the Restart option is selected.
- The path includes the driver letter but does NOT include the machine name. For example, "C:\Program Files\MyProcesses\Process.exe" is a correct specification.
-
In the Should be field, define whether or not the process(es) being monitoring should always be running or should never be running using the radio button options.
- You can disable this option by editing the process name field as described in steps 5.1-5.3.
- The Notify Only check box, when checked, will alert you on the monitoring condition and will not try to terminate or restart the process.
-
The Restart radio button, when selected, will alert you on the monitoring condition AND will attempt to restart the process.
- This field is defaulted when the ‘Running’ option is selected.
- You can disable this option by using the process name field as described in steps 2.1-2.3.
-
The Terminate radio button, when selected, will alert you on the monitoring condition AND will attempt to terminate the process.
- This field is defaulted when then Not Running radio button for Should be is selected.
- In the Delay field, optionally specify a duration value in minutes that the process is detected as running or not running before the alert condition triggers.
- In the Thresholds field, click the arrow button to view the optional threshold parameters to monitor CPU and memory utilization of a running process.
-
In the Instance Count field, optionally define the minimum number of instances for a process that should be running; if the number of instances of the specified process running is less than this then an alert condition exists.
- If the Not Running radio button for Should be is selected, then this count defines a 'ceiling' such that if the instance count of the specified process is equal to or greater than this count, then an alert condition exists
- If the Terminate option is selected then instances of the specified process will be terminated until the number of instances is less than this count.
-
The WildCard Exclusions field is only applicable if there is a WildCard ‘*’ specified in the Process Name field.
- Optional parameter, if specified defines one or more processes, separated by a semi-colon, that are excluded from the process check.
- The Incl All check box, meaning include all, when enabled when using a wild-card specification for the Process Name and you want the CUMULATIVE CPU Utilization of all processes that match the wild-card to be checked against the Threshold ‘CPU (%)’.
- In the Selections tree, select the servers/workstations that you want to monitor the specified condition on.
Configure the Schedule
The Schedule tab of a monitoring rule allows users to define how frequently the rule will alert. This can be done by adjusting the following fields:
Required Options (one of the below must be selected):
-
Alert Every Time: Defines whether an alert is generated every time the conditions are on the previous tab are met.
- When checked, an alert is generated every time the specified condition is met.
- When unchecked, the alert is only generated if the alert conditions are met, and the Minimal Notification Interval, see below, is exceeded since the last alert for this type.
-
Minimal Notification Interval: Defines the minimum amount of time that must elapse between events for the specified condition before another alert will be generated.
-
For example, if the interval is 15 minutes and the condition is being met every 3 mins, you will receive 1 alert every 15 minutes instead of being alerted at each occurrence.
- However, each alert occurrence is considered unique based on the details. For example, an Event Log alert is considered the same based on being the same Event Type and ID, from the same server/workstation.
- The Alert Every Time checkbox must be unchecked in order to use this option.
- For ServerWatch IP Services, this also defines the minimum elapsed time since a service is first detected as down or failed before an alert is generated.
-
For example, if the interval is 15 minutes and the condition is being met every 3 mins, you will receive 1 alert every 15 minutes instead of being alerted at each occurrence.
Additional Options:
-
Maximum Notification Interval: Defines the maximum number of times you want to be notified during a continuous failure situation.
- The default value of '0' means infinite; no maximum is defined so you will continue to be notified according to your Alert Every Time and Minimal Notification Interval settings.
- A non-zero value means that after you have been notified the number of times defined in the Maximum Alert Notifications, and according to your Alert Every Time and Minimal Notification Interval settings, you will not be notified again.
- For example, if "5" is selected, the event will alert for the first 5 events and all additional events will be ignored.
- Active Only if Server 'Owns".... This Cluster Group: This option is for monitoring machines that are apart of a cluster. If the checkbox is checked, then the rule is monitoring and will alert only if the server is a member of a cluster, 'owns' the specified Cluster Group and the specified Cluster Group is 'Online'.
Additional Configuration
For additional configuration options please see the following articles: