Table of ContentsView in Frames

Commands for monitoring the health of your system

You can use the system health commands to display information about the health of system resources, to respond to alerts, and to configure future alerts. Using the CLI commands enables you to view in-depth information about how health monitoring is configured. For more information, see the man pages for the commands.

Displaying the status of system health

If you want to... Use this command...
Display the health status of the system, which reflects the overall status of individual health monitors system health status show
Display the health status of subsystems for which health monitoring is available system health subsystem show

Displaying the status of cluster connectivity

If you want to... Use this command...
Display the status of shelves from the cluster-level view system health system-connectivity shelf show
Display detailed information about each shelf, including the shelf's UUID and ID, its connected nodes, and the number of paths to the shelf system health system-connectivity shelf show-instance

Displaying the status of node connectivity

If you want to... Use this command...

Display the status of shelves from the node-level view, along with other information, such as the owner node, shelf name, and how many disks and paths the shelf has

system health node-connectivity shelf show

Use the -instance parameter to display detailed information about each shelf.

Display the status of disks, along with other information, such as the owner node, disk name and bay number, and the number of paths to the disk

system health node-connectivity disk show

Use the -instance parameter to display detailed information about each disk.

Display the status of adapters, along with other information, such as the owner node, whether they are used and enabled, and the number of shelves attached

system health node-connectivity adapter show

Use the -instance parameter to display detailed information about each adapter.

Managing the discovery of cluster and management network switches

If you want to... Use this command...
Display the switches that the cluster monitors system health cluster-switch show
Display the switches that the cluster currently monitors, including switches that you deleted (shown in the Reason column in the command output), and configuration information that you need for network access to the cluster and management network switches

This command is available at the advanced privilege level.

system health cluster-switch show-all
Configure discovery of an undiscovered switch system health cluster-switch create
Modify information about a switch that the cluster monitors (for example, device name, IP address, SNMP version, and community string) system health cluster-switch modify
Disable monitoring of a switch system health cluster-switch modify -disable-monitoring
Display the interval in which the health monitor polls switches to gather information system health cluster-switch polling-interval show
Modify the interval in which the health monitor polls switches to gather information

This command is available at the advanced privilege level.

system health cluster-switch polling-interval modify
Disable discovery and monitoring of a switch and delete switch configuration information system health cluster-switch delete
Permanently remove the switch configuration information which is stored in the database (doing so reenables automatic discovery of the switch) system health cluster-switch delete -force

Responding to generated alerts

If you want to... Use this command...
Display information about generated alerts, such as the resource and node where the alert was triggered, and the alert's severity and probable cause system health alert show
Display information about each generated alert system health alert show -instance
Indicate that someone is working on an alert system health alert modify
Acknowledge an alert system health alert modify -acknowledge
Suppress a subsequent alert so that it does not affect the health status of a subsystem system health alert modify -suppress
Delete an alert that was not automatically cleared system health alert delete

Display information about the AutoSupport messages that alerts triggered within the last week—for example, to determine if an alert triggered an AutoSupport message

system health autosupport trigger history show

Configuring future alerts

If you want to... Use this command...

Enable or disable the policy that controls whether a specific resource state raises a specific alert

system health policy definition modify

Displaying information about how health monitoring is configured

If you want to... Use this command...

Display information about health monitors, such as their nodes, names, subsystems, and status

system health config show
Note: Use the-instance parameter to display detailed information about each health monitor.

Display information about the alerts that a health monitor can potentially generate

system health alert definition show
Note: Use the-instance parameter to display detailed information about each alert definition.

Display information about health monitor policies, which determine when alerts are raised

system health policy definition show
Note: Use the -instance parameter to display detailed information about each policy. Use other parameters to filter the list of alerts—for example, by policy status (enabled or not), health monitor, alert, and so on.

Downgrading software versions between minor releases

When downgrading a cluster from Data ONTAP 8.2.1 to Data ONTAP 8.2, Data ONTAP clears the history of deleted switches. It also sets the Model column to OTHER if a switch that it automatically discovers is not supported for health monitoring.

If you want to... Use this command...
Downgrade the software version command system health cluster-switch prepare-to-downgrade