Asset ID: |
1-75-1339901.1 |
Update Date: | 2012-03-21 |
Keywords: | |
Solution Type
Troubleshooting Sure
Solution
1339901.1
:
Sun SPARC[TM] Enterprise M3000, M4000, M5000, M8000, M9000: Troubleshooting a Noisy FAN on OPL Servers
Related Items |
- Sun SPARC Enterprise M9000-64 Server
- Sun SPARC Enterprise M9000-32 Server
- Sun SPARC Enterprise M8000 Server
- Sun SPARC Enterprise M3000 Server
- Sun SPARC Enterprise M4000 Server
- Sun SPARC Enterprise M5000 Server
|
Related Categories |
- PLA-Support>Sun Systems>SPARC>Enterprise>SN-SPARC: Mx000
- .Old GCS Categories>Sun Microsystems>Servers>OPL Servers
|
In this Document
Purpose
Last Review Date
Instructions for the Reader
Troubleshooting Details
Applies to:
Sun SPARC Enterprise M9000-64 Server - Version: Not Applicable and later [Release: N/A and later ] Sun SPARC Enterprise M9000-32 Server - Version: Not Applicable and later [Release: N/A and later] Sun SPARC Enterprise M8000 Server - Version: Not Applicable and later [Release: N/A and later] Sun SPARC Enterprise M5000 Server - Version: Not Applicable and later [Release: N/A and later] Sun SPARC Enterprise M3000 Server - Version: Not Applicable and later [Release: N/A and later] Information in this document applies to any platform.
Purpose
This resolution path addresses problems with reported noisy FAN on OPL Mx000 systems.
Last Review Date
July 20, 2011
Instructions for the Reader
A Troubleshooting Guide is provided to assist
in debugging a specific issue. When possible, diagnostic tools are included in the document
to assist in troubleshooting.
Troubleshooting Details
The following are the steps to be taken in case one is under the impression that the FANs of an OPL server are too noisy.
NOTE: Typically the perceived noise of a FAN is due to the fact that it is running at a higher speed than normal.
Step #1 - Verify fault presence that generate FAN rotation speed increaseCheck the output of "showstatus" command to see if there are faults present on the platform. A system without HW issues will provide an output like this one:
XSCF> showstatus No failures found in System Initialization.
Check the output of the following command to see if the FANs are rotating at a higher speed than the expected in a sane situation (Low speed - Level-4 on M3000 are the values expected in case of normal operation)
XSCF> showenvironment Fan
this is an example of a "normal" situation (took from an M5000):
XSCF> showenvironment Fan FAN_A#0:Low speed FAN_A#0: 3367rpm FAN_A#1:Low speed FAN_A#1: 3317rpm FAN_A#2:Low speed FAN_A#2: 3367rpm FAN_A#3:Low speed FAN_A#3: 3317rpm PSU#0 PSU#0:Low speed PSU#0: 3668rpm PSU#0: 3479rpm PSU#1 PSU#1:Low speed PSU#1: 3552rpm PSU#1: 3515rpm PSU#2 PSU#2:Low speed PSU#2: 3308rpm PSU#2: 3276rpm PSU#3 PSU#3:Low speed PSU#3: 3154rpm PSU#3: 3245rpm
If the output of the showstatus command is showing non healthy HW present in the platform and the FANs are not running at Low speed (or Level-4 on M3000) than there could be a relationship between the reported FAN noise and the HW faults.
This situation needs to be analyzed and properly addressed.
Please open a Service Request to Oracle Support (see Step #5)
Step #2 - Check if all the power cords are plugged into PSUs
When some PSUs are not used the FAN raise their speed to compensate this situation. Typically unused PSUs appear similar to the following "showhardconf" command output: PSU#0 Status:Normal; Serial:1357FYG-0845AD0036; + FRU-Part-Number:CF00300-2011 0202 /300-2011-02-02; + Power_Status:Input fail; AC: - ;
In this scenario the lack of power on PSU(s) is the issue to be investigated.
If power cords are plugged in but the command output field is still " - " then the first thing to do should be to have an electrician check the datacenter power source. There may be an issue on that side.
If PSUs are plugged in and Power Source is good then the situation has to be analyzed and properly addressed.
Please open a Service Request to Oracle Support (see Step #5)
Service personnel should replace the PSU.
Step #3 - Check for environmental situations that could generate FAN
rotation speed increases
See the following doc for a guidance on expected
behavior of FAN functionality based on the environmental
situations:
Sun
SPARC Enterprise[R] Mx000 server fan/fantray temperature and
failure behavior. (Doc ID 1019147.1)
According to the information present in this document an increased rotation speed doesn't always mean the presence of an issue.
Step #4 - Verify the maintenance flag is not set on the platform
While performing maintenance activities the FAN speed is raised. If, for some reason, that flag is not removed at the end of the maintenance the FAN speed stays high.
The maintenance flag is set at two levels.
- system level (visible from showhardconf output)
SPARC Enterprise M9000; + Serial:AKD0907115; Operator_Panel_Switch:Locked; + Power_Supply_System:Dual-3Phase; Ex:Dual-3Phase; SCF-ID:XSCF#0; + System_Power:Maintenance; System_Phase:Cabinet Power On;
To release the maintenance flag run the unlockmaintenance command - FRU level (visible form showboards -va output)
XSB R DID(LSB) Assignment Pwr Conn Conf Test Fault COD ---- - -------- ----------- ---- ---- ---- ------- -------- ---- 00-0 00(00) Assigned y y y Passed Normal n 00-1 02(00) Assigned y y y Passed Normal n 00-2 04(00) Assigned y y y Passed Normal n 00-3 01(00) Assigned y y y Testing Normal n <... snip ...>
An example of the "showenvironment Fan" output in this situation is:
FAN_A#0:High speed FAN_A#0: 5113rpm FAN_A#0: 5037rpm FAN_A#0: 4891rpm
FAN_A#1:High speed FAN_A#1: 5273rpm FAN_A#1: 5273rpm FAN_A#1: 5357rpm
FAN_A#2:High speed FAN_A#2: 5113rpm FAN_A#2: 5037rpm FAN_A#2: 4963rpm
FAN_A#3:High speed FAN_A#3: 5192rpm FAN_A#3: 5357rpm FAN_A#3: 5357rpm
<... snip ...>
The maintenance flag is managed by replacefru command which raises it at the start of the maintenance task and remove it when/if the activity finish correctly
Similar situations have to be analyzed and addressed. Typically in that case, any further replacefru will fail.
Please open a Service Request to Oracle Support (see Step #5)
Step #5 - Raise a Service Request to Oracle Support
If the previous steps did not helped you in understanding the situation or further action has to be taken as explicitly stated, then a SR has to be open. In this case you'll have to gather and send to Oracle the snapshot of the Active XSCF.
See:
If Customer is reporting an issue regarding the noise of the FAN and a SR has been raised then you should collect a snapshot as described here: Gathering diagnostic data for Sun SPARC Enterprise[TM] Mx000 (OPL) Servers (Doc ID 1008229.1)
With that data you can follow the previous reported troubleshooting steps.
Additional possible cause to be checked (Internal Only information)
On M4000/M5000 it has been reported
that Manufacturing is using 2 different vendors for the fans to be
built into the fan modules. Some customer are finding that the noise
of the type 2 fan is "unpleasant" and loud. Although the noise generated
is within the specification, the pitch is different. Both types have the same P/N, a visual inspection is required to determine the type.
Manufacturing is using 2 different vendors for the fans to be built into the fan modules.
Type 1: San Ace 172, 7 blades (Model 9GV5748P5H03) sound file

Type 2: Japan Servo, 5 blades (Model G1751M48BYAP-22) sound file

Some customer are finding that the noise of the type 2 fan is "unpleasant" and loud. Although the noise generated is within the specification, the pitch is different. Both Type 1: San Ace 172, 7 blades (Model 9GV5748P5H03) and Type 2: Japan Servo, 5 blades (Model G1751M48BYAP-22) can be either p/n 541-3447 and 541-0573, so visual inspection is always needed.
If you have a case / task where a customer complains about the noise, first ensure that the fan are really at low speed:
XSCF> showenvironment Fan The values should be around 3400 RPM, then fill out the table located at Noisy Fan Tracking Page (Beehive Workspace).
If
you're dealing with a Customer complaining about fans being too noisy
in M4000 / M5000 systems and you've already excluded the possible
situation described above then
raise a collaboration SR.
NOTE: in case of - multiple fan failures
- all fans running at the exact same speed
the problem is probably a controller failure. Keywords: OPL, Mx000, thermal, temp, fan, fantray, M4000, M5000, M8000, M9000, M9000+ speed, maintenance, environment, noise, noisy
Attachments
This solution has no attachment
|