Sun System Handbook - ISO 3.4 June 2011 Internal/Partner Edition | |||
|
|
Solution Type Problem Resolution Sure Solution 1003864.1 : Invalid ID 125 on Sun StorEdge[TM] 3510 causing multiple drive/loop problems.
PreviouslyPublishedAs 205426 Symptoms The symptoms for this problem were as follows:- The symptoms for this problem were as follows:- 1. Multiple disk drives failed (LEDs turned amber) Upon trying to troubleshoot, it was found that an invalid ID 125 was to be seen on the loop maps from both channels 2 and channels 3 which was causing the above problem to be seen. This document will help in troubleshooting this problem and getting rid of the invalid id and hence resolving the same. Resolution Invalid ID 125 on Sun StorEdge[TM] 3510 causing multiple drive/loop problems. This problem happened on the following configuration but it can happen on any SE3510 with or without a JBOD connected and the resolution would be the same. Configuration:- RAID dual controller with 2 JBODs running 3.27R. The RAID head with chassis ID 0 and the JBODs with chassis IDs 1 and 2. The IDs on the SE3510 are set based on the chassis ID and following are the valid ID range for any SE3510 configurations:- Chassis ID Switch Setting ID Range Please check <Document: 1007692.1> for more details. <Document: 1007692.1> Sun StorEdge[TM] 3510 FC Array switch settings and disk IDs. Considering the chassis id = 7, we have from the above... Target Disk drive(s) Loop IDs are 112-123 When the problem occurs, we see the following when we run show loop-map command. sccli> show loop-map channel 2 40 devices found in loop map Channel Loop Map retrieved from CH 2 ID 12AL_PA SEL_ID SEL_ID TYPE ENCL_ID SLOT When the above problem happened, we could see following messages logged in the event log of the SE3510. snippet from the show events.... Mon Oct 18 15:33:39 2004 Mon Oct 18 15:33:40 2004 Mon Oct 18 15:33:41 2004 Mon Oct 18 15:33:41 2004 end snippet. The above messages were seen for many drives and the paths to the drives were constantly getting failed/unfailed due to this problem. Resolution :- Unfortunately, there is no one step process if we have more than one chassis and the problem isolation process would be to disconnect each chassis and re-run the loop map command and check if the invalid id is still seen in the loop map. On the above mentioned problem, it was resolved once the JBOD with chassis ID 2 was disconnected. Once we isolate the chassis, follow the following procedure to isolate the bad IOM from that chassis. 1. Reconnect the suspect in the loop and replace the top IOM and check with loop Notes: 1. In the above example, the problem happened to be on the second JBOD but it 2. To isolate the problem, downtime is needed as the problem cant be solved 3. Instead of physically disconnecting the JBODs, we can use the "bypass" 4. This problem happened with 3.27R but the chances are that it can also happen 5. Lastly, there is a possibility that this problem may also happen with SE3511 Additional Information With the RAID head with chassis ID set to "0" and one JBOD with ID "1" and the other on ID "2", following would be the loop-map output as seen from channel 2. This is a GOOD output taken from a system with NO problem. sccli> show loop-map channel 2 41 devices found in loop map Channel Loop Map retrieved from CH 2 ID 12AL_PA SEL_ID SEL_ID TYPE ENCL_ID SLOT (hex) (hex) (dec) Product Sun StorageTek 3510 FC Array Internal Comments There are at least couple of escalations with this problem. See escalations 1-4541318 and 1-9523889. sccli, 3510, loop-map, SES, JBOD Previously Published As 82392 Change History Date: 2007-06-29 User Name: 7058 Action: Approved Comment: Activated link for Normalization (ease of tracking & reference) Notes for Normalizaton: This document is referenced by:86947 Subset Root path: 82392-->86947-->89127-->89034-->89031-->89050/86520 This document references: 80185 Project: Minnow Normalization Version: 4 Date: 2007-06-29 User Name: 7058 Action: Update Started Comment: Activating link to 80185 Version: 0 Date: 2005-08-24 User Name: 95826 Action: Approved Comment: - verified metadata - changed review date to 2006-08-24 - checked for TM - 2 added - checked audience : contract Publishing Version: 3 Attachments This solution has no attachment |
||||||||||||
|