Manual Pages


Table of Contents

NAME

na_system_health_alert - Displays and modifies system health alert and alert definition.

DESCRIPTION

system health alert definition show - Display system health alert definition.

system health alert delete - Delete system health alert.

system health alert modify - Modify system health alert.

system health alert show - View system health alert.

USAGE

system health alert definition show [ -fields ] [ -instance ] [ -monitor ] [ -alert-id ] [ -perceivedseverity ] [ -probable-cause ] [ -probable-causedescription ] [ -subsystem ] [ -possible-effect ] [ -corrective-actions ] [ -additional-information ] [ -tags ]

This command displays information about the various alerts defined in the system health monitor policy file. Using -instance will display detailed information on each alert defined.

{ [-fields <fieldname>, ...]

Displays the fields specified.

| [-instance] }

Displays additional information on each alert definition.

[-monitor <hm_type>] - Monitor

Displays information about all the alert definitions with the specified monitor name.

[-alert-id <text>] - Class of Alert

Displays information about all the alert definitions with the specified alert identifier.

[-perceived-severity <hm_perceived_sev>] - Severity of Alert

Displays information about all the alert definitions with the specified perceived severity.

[-probable-cause <hm_probable_cause>] - Probable Cause

Displays alert definitions for the specified probable cause of the alert.

[-probable-cause-description <text>] - Probable Cause Description

Displays alert definitions for the specified probable cause description.

[-subsystem <hm_subsystem>] - Subsystem Name

Displays alert definitions for the specified subsystem.

[-possible-effect <text>] - Possible Effect

Displays alert definitions for the specified possible effect.

[-corrective-actions <text>] - Corrective Actions

Displays alert definitions for the specified corrective action.

[-additional-information <text>] - Additional Relevant Data

Displays definitions for the specified additional information.

[-tags <hm_alert_type>, ...] - Additional Alert Tags

Query the alerts based on keywords.

system health alert delete [ -monitor ] [ -alert-id ] [ -alerting-resource ]

This command deletes all the alerts with the specified input options.

[-monitor <hm_type>] - Monitor

Deletes alerts generated on the monitor specified.

[-alert-id <text>] - Alert ID

Deletes alerts generated on the alert ID specified.

[-alerting-resource <text>] - Alerting Resource

Deletes alerts generated on the alerting resource specified.

system health alert modify [ -monitor ] [ -alert-id ] [ -alerting-resource ] [ -acknowledge ] [ -suppress ] [ -acknowledger ] [ -suppressor ]

This command suppresses alerts generated and sets the acknowledgement state for an alert.

[-monitor <hm_type>] - Monitor

Specifies the monitor name that you want to change the state.

[-alert-id <text>] - Alert ID

Specifies the alert ID that you want to change the state.

[-alerting-resource <text>] - Alerting Resource

Specifies the alerting resource name that you want to change the state.

[-acknowledge {true|false}] - Acknowledge

Sets the acknowledgement state to true or false.

[-suppress {true|false}] - Suppress

Sets the suppress state to true or false.

[-acknowledger <text>] - Acknowledger

Sets the acknowledger as the filter for setting state.

[-suppressor <text>] - Suppressor

Sets the suppressor as the filter for setting state.

system health alert show [ -fields ] [ -instance ] [ -monitor ] [ -alert-id ] [ -alerting-resource ] [ -subsystem ] [ -indication-time ] [ -perceived-severity ] [ -probable-cause ] [ -probable-cause-description ] [ -possible-effect ] [ -corrective-actions ] [ -acknowledge ] [ -suppress ] [ -policy ] [ -acknowledger ] [ -suppressor ] [ -tags ] [ -additional-info ] [ -alertingresource-name ]

This command displays all the alerts generated on the system. Using -instance will display detailed information on each alert that was generated.

{ [-fields <fieldname>, ...]

Displays the fields you specify.

| [-instance] }

Displays additional information on each alert generated.

[-monitor <hm_type>] - Monitor

Displays information about all of the alerts with the specified monitor name.

[-alert-id <text>] - Alert ID

Displays information about all the alerts with the specified alert ID.

[-alerting-resource <text>] - Alerting Resource

Displays information about all the alerts with the specified alerting resource name.

[-subsystem <hm_subsystem>] - Subsystem

Displays information about all the alerts generated on the monitoring subsystem.

[-indication-time <Date>] - Indication Time

Displays information about all the alerts with the specified indicated time.

[-perceived-severity <hm_perceived_sev>] Perceived Severity

Displays information about all the alerts with the perceived severity level.

[-probable-cause <hm_probable_cause>] - Probable Cause

Displays information about all the alerts that contain the specified probable cause.

[-probable-cause-description <text>] - Description

Displays information about all of the alerts containing the specified probable cause description.

[-corrective-actions <text>] - Corrective Actions

Displays information about all the alerts with the specified recommended corrective action.

[-possible-effect <text>] - Possible Effect

Displays information about all the alerts with the specified possible effect.

[-acknowledge {true|false}] - Acknowledge

Displays information about all the alerts with the specified acknowledgement status.

[-suppress {true|false}] - Suppress

Displays information about all of the alerts with the specified suppressor field status of true or false.

[-policy <text>] - Policy

Displays information about all the alerts with the specified policy name.

[-acknowledger <text>] - Acknowledger

Displays information about all the alerts with the specified acknowledger field.

[-suppressor <text>] - Suppressor

Displays information about all the alerts with the specified suppressor field.

[-additional-info <text>, ...] - Additional Information

Displays information about all the alerts with the specified additional information.

[-alerting-resource-name <text>] - Alerting Resource Name

Displays information about all the alerts with the specified alerting resource name.

[-tags <hm_alert_type>, ...] - Additional Alert Tags

Query the alert based on keywords.

EXAMPLES

  This example displays information about all the alert definitions that are present in the alert definition file:

  node> system health alert definition show

  Node          Monitor                Subsystem         Alert ID
  ------------- ---------------------- ----------------- -----------------------
  csiptc-2240-23 node-connect          SAS-connect       ControllerToShelfIomA_
                                                         Alert
                    Severity: Major
              Probable Cause: Cable_tamper
  Probable Cause Description: Disk shelf $(nschm_shelf_info.id) is connected to
                              controller $(LOCALHOST) through IOM A only.
             Possible Effect: Access to disk shelf $(nschm_shelf_info.id) will
                              be lost with a single hardware failure of IOM A,
                              HBA $(nschm_shelf_info.ioma-adapter), or any
                              intervening IOM or SAS cable.
          Corrective Actions: 1. Halt controller $(LOCALHOST) and all controllers attached to disk shelf $(nschm_shelf_info.id).
                              2. Connect disk shelf $(nschm_shelf_info.id) IOM A and IOM B to controller $(LOCALHOST) following the rules in the Universal SAS and ACP Cabling Guide.
                              3. Reboot the halted controllers.
                              4. Contact support personnel if the alert
                              persists.
             Additional Info: -
                        Tags: quality_of_service


  This example displays information about all the alert definitions in detail that are present in the alert definition file:

  node> system health alert definition show -instance

                        Node: csiptc-2240-23
                     Monitor: node-connect
              Class of Alert: ControllerToShelfIomA_Alert
           Severity of Alert: Major
              Probable Cause: Cable_tamper
  Probable Cause Description: Disk shelf $(nschm_shelf_info.id) is connected to controller $(LOCALHOST) through IOM A only.
             Possible Effect: Access to disk shelf $(nschm_shelf_info.id) will be lost with a single hardware failure of IOM A, HBA $(nschm_shelf_info.ioma-adapter), or any intervening IOM or SAS cable.
          Corrective Actions: 1. Halt controller $(LOCALHOST) and all controllers attached to disk shelf $(nschm_shelf_info.id).
  2. Connect disk shelf $(nschm_shelf_info.id) IOM A and IOM B to controller $(LOCALHOST) following the rules in the Universal SAS and ACP Cabling Guide.
  3. Reboot the halted controllers.
  4. Contact support personnel if the alert persists.
              Subsystem Name: SAS-connect
    Additional Relevant Data: -
       Additional Alert Tags: quality_of_service


  This example shows how to delete an alert with the specified alert-id:

  node> system health alert delete -alert-id ControllerToShelfIomA_Alert -alerting-resource *


  This example modifies the alert field states:

  node> system health alert modify -alert-id ControllerToShelfIomA_Alert -suppress true

  This example displays information about all the alerts generated:

  node> system health alert show

                 Node: csiptc-2240-23
             Resource: ha_node_pair_info
             Severity: Major
       Probable Cause: The administrator has disabled storage failover on the
                       node "csiptc-2240-23".
      Possible Effect: There will be storage downtime if the node
                       "csiptc-2240-23" fails.
   Corrective Actions: Enable the storage failover feature on the node
                       "csiptc-2240-23" using the command "storage failover
                       modify -enabled true".


  This example displays additional information about a specific alert generated:

  node> system health alert show -monitor node-connect -instance


                        Node: csiptc-2240-23
                     Monitor: node-connect
                    Alert ID: StorageFailoverDisabled_Alert
           Alerting Resource: ha_node_pair_info
                   Subsystem: HA-health
             Indication Time: Fri Sep 28 14:58:39 2012
          Perceived Severity: Major
              Probable Cause: Loss_of_redundancy
                 Description: The administrator has disabled storage failover on the node "csiptc-2240-23".
          Corrective Actions: Enable the storage failover feature on the node "csiptc-2240-23" using the command "storage failover modify -enabled true".
             Possible Effect: There will be storage downtime if the node "csiptc-2240-23" fails.
                 Acknowledge: false
                    Suppress: false
                      Policy: StorageFailoverDisabled_Policy
                Acknowledger: -
                  Suppressor: -
      Additional Information: node_id: 1785241917
                              node_name: csiptc-2240-23
                              partner_node_name:
                              partner_id: 0
                              takeover_status: to_admin_disabled
      Alerting Resource Name: ha_node_pair_info
       Additional Alert Tags: -

SEE ALSO


Table of Contents