Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-71-1455819.1
Update Date:2012-06-18
Keywords:

Solution Type  Technical Instruction Sure

Solution  1455819.1 :   How to set up a temporary alert from a cell which does more than "alter cell validate mail"  


Related Items
  • Oracle Exadata Storage Server Software
  •  
  • Exadata Database Machine V2
  •  
Related Categories
  • PLA-Support>Database Technology>Engineered Systems>Oracle Exadata>DB: Exadata_EST
  •  
  • .Old GCS Categories>ST>Server>Engineered Systems>Exadata>Administration and Configuration
  •  


For a confidence that alerts are working it is sometimes required that an alert be generated from an Exadata Cell, and do something more than the standard "alter cell validate mail" command.

Created from <SR 3-5660863111>

Applies to:

Exadata Database Machine V2 - Version Not Applicable and later
Oracle Exadata Storage Server Software - Version 11.1.0.3.0 and later
Information in this document applies to any platform.

Goal

 The customer want to test the alerting system on an Exadata Cell more fully than "alter cell validate mail"

Fix

From a Cell Server as celladmin

cellcli> list threshold 

 

You will probably see no thresholds set up.


List some metrics which have a value

cellcli> list metriccurrent gd_io_rq_r_sm_sec

 

This lists the current value of I/O Read /second requests of small size for grid disks. Most systems will have a non-zero value for this metrics

Then to set up a temporary alert

cellcli> create threshold gd_io_rq_r_sm_sec comparison = '>', critical = 0

 

Confirm that the threshold is set up

 

CellCLI> list threshold

gd_io_rq_r_sm_sec

 

Or

 

CellCLI> list threshold detail
name: gd_io_rq_r_sm_sec
comparison: >
critical: 0.0

 

This comparison means a critical alert is generated for any value above 0.

Wait for the alert (assuming your e-mail smtp settings are correct). This will take less than 5 minutes.

Then to get rid of the alert

cellcli> drop threshold gd_io_rq_r_sm_sec



To confirm that the threshold has gone

cellcli> list threshold



You should get an e-mail like this in HTML format

Critical: Threshold Alert 25_1
Event Time 2012-05-04T12:46:44-04:00
Description The critical threshold for the following metric has been crossed.

Metric Name GD_IO_RQ_R_SM_SEC
Metric Description Number of requests to read small blocks per second from a grid disk
Object Name DBFS_DG_CD_09_dmorlcel08
Current Value 9.4 IO/sec
Threshold Value 0.0 IO/sec
Affected Cell
Name dmorlcel08
Server Model Oracle Corporation SUN FIRE X4270 M2 SERVER SAS
Chassis Serial Number 1036FMM04H
Release Version 11.2.2.4.2
Release Label OSS_11.2.2.4.2_LINUX.X64_111221
Recommended Action Examine the metric value that is violating the specified threshold, and take appropriate actions if needed.



When the alert is dropped or the problem resolved you get a message like this

Resolved: Threshold Alert 25_2
Event Time 2012-05-04T12:47:37-04:00
Description Threshold GD_IO_RQ_R_SM_SEC.DBFS_DG_CD_09_dmorlcel08 was dropped, that triggered alert to state clear
Affected Cell
Name dmorlcel08
Server Model Oracle Corporation SUN FIRE X4270 M2 SERVER SAS
Chassis Serial Number 1036FMM04H
Release Version 11.2.2.4.2
Release Label OSS_11.2.2.4.2_LINUX.X64_111221
Recommended Action The threshold GD_IO_RQ_R_SM_SEC.DBFS_DG_CD_09_dmorlcel08 was dropped and related active alert was cleared. No further action is required.

 

 


Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback