Sun Microsystems, Inc.  Sun System Handbook - ISO 4.1 October 2012 Internal/Partner Edition
   Home | Current Systems | Former STK Products | EOL Systems | Components | General Info | Search | Feedback

Asset ID: 1-72-1495746.1
Update Date:2012-10-08
Keywords:

Solution Type  Problem Resolution Sure

Solution  1495746.1 :   Exadata: MS crashed- RS-7445 [Serv MS Is Absent] [It Will Be Restarted]  


Related Items
  • Oracle Exadata Storage Server Software
  •  
  • Exadata Database Machine X2-2 Half Rack
  •  
  • Exadata Database Machine X2-2 Full Rack
  •  
  • Exadata Database Machine X2-2 Hardware
  •  
  • Exadata Database Machine X2-8
  •  
  • Oracle Exadata Hardware
  •  
  • Exadata Database Machine X2-2 Qtr Rack
  •  
Related Categories
  • PLA-Support>Database Technology>Engineered Systems>Oracle Exadata>DB: Exadata_EST
  •  




Created from <SR 3-6260163251>

Applies to:

Exadata Database Machine X2-2 Half Rack - Version All Versions to All Versions [Release All Releases]
Exadata Database Machine X2-2 Full Rack - Version All Versions to All Versions [Release All Releases]
Exadata Database Machine X2-2 Hardware - Version All Versions to All Versions [Release All Releases]
Exadata Database Machine X2-2 Qtr Rack - Version All Versions to All Versions [Release All Releases]
Exadata Database Machine X2-8 - Version All Versions to All Versions [Release All Releases]
Information in this document applies to any platform.

Symptoms

Cell  image  versions lower than 11.2.3.2.0

MS process crashed and got restarted automatically .  Cell alert log had  RS-7445 [Serv MS is absent] [It will be restarted] [] [] [] [] [] [] [] [] [] [] signalling the restart

+ No obvious errors in the ms-odl.log /cell alert log and the incident (rs*) traces on why the MS crashed ,  apart from the RS-7445 signalling the detection of its absence.

+ Callstack in incident trace shows a very generic stack:

Problem Key: RS 7445
Error: RS-7445 [Serv MS is absent] [It will be restarted] [] [] [] [] [] [] [] [] [] []
[00]: dbgePostErrorDirect [diag_dde]
[01]: ossrsutl_dump_incident []<-- Signaling
[02]: ossrsutl_monitor_srvc []
[03]: ossrsutl_monitor_srvc_prc []
[04]: sossrs_prc_start []
[05]: ossrsutl_monitor_monpr_thd []
[06]: start_thread []
[07]: clone []
[08]: 0000000000000000 []


+ Reviewing the /var/log/oracle/deploy/hs_err_pid<PID #>.log

Stack: [0x0000000040b8d000,0x0000000040c8e000),  sp=0x0000000040c8c540,  free space=1021k
Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code)
V  [libjvm.so+0x65099e]
V  [libjvm.so+0x56163b]
V  [libjvm.so+0x38612b]
V  [libjvm.so+0x3aa15b]
C  [libmsosscomm11.so+0x2fdc]                                                                                                                                             <<<<<<<<<<
C  [libmsosscomm11.so+0x142e]  Java_oracle_ossmgmt_ms_core_MSOSSComm_static_1sendrecv+0x1a2
j  oracle.ossmgmt.ms.core.MSOSSComm.static_sendrecv(I[CLjava/lang/Object;)I+0
j  oracle.ossmgmt.ms.core.MSOSSComm.getOSSMetrics(Loracle/ossmgmt/ms/core/OSSMetricList;Loracle/ossmgmt/ms/core/Position;)I+66


Cause

<Bug 14521381> - RS-7445 [SERV MS IS ABSENT] [IT WILL BE RESTARTED]

Which has been closed as a duplicate of :

<Bug 11903713> - CELL-2628 DURING LOOP TEST OF CELLCLI LIST QUERIES
 

Solution

1. Ignore the error as MS will be automatically restarted upon crash, and this will not affect any functionality

2. Apply the 11.2.3.2.0 image for a permanent fix.
 

References

<BUG:11903713> - CELL-2628 DURING LOOP TEST OF CELLCLI LIST QUERIES
<BUG:14521381> - RS-7445 [SERV MS IS ABSENT] [IT WILL BE RESTARTED]

Attachments
This solution has no attachment
  Copyright © 2012 Sun Microsystems, Inc.  All rights reserved.
 Feedback