Sun Oracle SPARC T3-1B Service Manual

Server
Hide thumbs Also See for SPARC T3-1B:
Table of Contents

Advertisement

Quick Links

SPARC T3-1B Server Module
Service Manual
Part No.: E29280-02
July 2012

Advertisement

Table of Contents
loading
Need help?

Need help?

Do you have a question about the SPARC T3-1B and is the answer not in the manual?

Questions and answers

Summary of Contents for Sun Oracle SPARC T3-1B

  • Page 1 SPARC T3-1B Server Module Service Manual Part No.: E29280-02 July 2012...
  • Page 2 Copyright © 2010, 2012 Oracle and/or its affiliates. All rights reserved. This software and related documentation are provided under a license agreement containing restrictions on use and disclosure and are protected by intellectual property laws. Except as expressly permitted in your license agreement or allowed by law, you may not use, copy, reproduce, translate, broadcast, modify, license, transmit, distribute, exhibit, perform, publish, or display any part, in any form, or by any means.
  • Page 3: Table Of Contents

    Contents Using This Documentation ix Identifying Components 1 Front and Rear Panel Components 2 Illustrated Parts Breakdown 3 Detecting and Managing Faults 5 Diagnostics Overview 5 Diagnostics Process 7 Diagnostics LEDs 10 Managing Faults (Oracle ILOM) 12 Oracle ILOM Troubleshooting Overview 12 Fault Management 13 Fault Clearing 13 Oracle Solaris Fault Manager Commands in Oracle ILOM 14...
  • Page 4 Checking if Oracle VTS Software Is Installed 48 Oracle VTS Overview 49 ▼ Check if Oracle VTS Software Is Installed 50 Preparing for Service 51 General Safety Information 51 Safety Symbols 52 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 5 ESD Safety Measures 52 Antistatic Wrist Strap Use 52 Antistatic Mat 52 Tools Needed for Service 53 ▼ Find the Modular System Serial Number 53 ▼ Find the Server Module Serial Number 54 ▼ Locate the Server Module 55 Removing the Server Module From the Modular System for Service 55 ▼...
  • Page 6 Replacing the Server Module Enclosure Assembly 107 ▼ Transfer Components to Another Enclosure Assembly 108 Returning the Server Module to Operation 111 ▼ Replace the Cover 111 ▼ Install the Server Module Into the Modular System 112 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 7 ▼ Start the Server Module Host 114 Glossary 115 Index 121 Contents...
  • Page 8 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 9: Using This Documentation

    Using This Documentation This service manual explains how to identify faults, replace parts, and add additional options in the SPARC T3-1B server module from Oracle. This document is written for technicians, system administrators, authorized service providers, and users who have advanced experience troubleshooting and replacing hardware.
  • Page 10 Oracle Integrated Lights http://www.oracle.com/pls/topic/lookup?ctx=ilom30 Out Manager (Oracle ILOM) Oracle Solaris OS and http://www.oracle.com/technetwork/indexes/documentation/#sys_sw other system software Oracle VTS software http://www.oracle.com/pls/topic/lookup?ctx=OracleVTS7.0 SAS-1/SAS-2 http://www.oracle.com/pls/topic/lookup?ctx=E22513_01 Compatibility Feedback Provide feedback about this documentation at: http://www.oracle.com/goto/docfeedback SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 11 Support and Accessibility Description Links Access electronic support http://support.oracle.com through My Oracle Support For hearing impaired: http://www.oracle.com/accessibility/support.html Learn about Oracle’s http://www.oracle.com/us/corporate/accessibility/index.html commitment to accessibility Using This Documentation...
  • Page 12 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 13: Identifying Components

    Identifying Components These topics explain the components of the server module, focusing on the components that can be removed and replaced for service. “Front and Rear Panel Components” on page 2 ■ “Illustrated Parts Breakdown” on page 3 ■ Related Information “Detecting and Managing Faults”...
  • Page 14: Front And Rear Panel Components

    Amber LED: Drive Service Action Required Blue LED: Drive Ready to Remove RFID (sticker indicates serial number of the server module) Universal connector port (UCP) Chassis power connector Chassis data connector SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 15: Illustrated Parts Breakdown

    Related Information “Diagnostics LEDs” on page 10 ■ “Illustrated Parts Breakdown” on page 3 ■ Illustrated Parts Breakdown This topic identifies components in the server module that you can install, or remove and replace. The following table provides information about the replaceable components. Identifying Components...
  • Page 16 63 Related Information “Front and Rear Panel Components” on page 2 ■ “Detecting and Managing Faults” on page 5 ■ “Replacing the Server Module Enclosure Assembly” on page 107 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 17: Detecting And Managing Faults

    Detecting and Managing Faults These topics explain how to use various diagnostic tools to monitor server module status and troubleshoot faults in the server module. “Diagnostics Overview” on page 5 ■ “Diagnostics Process” on page 7 ■ “Diagnostics LEDs” on page 10 ■...
  • Page 18 “Managing Faults (Oracle Solaris PSH)” on page 25 ■ “Managing Faults (POST)” on page 31 ■ “Managing Components (ASR Commands)” on page 44 ■ “Checking if Oracle VTS Software Is Installed” on page 48 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 19: Diagnostics Process

    Diagnostics Process The following flowchart illustrates the complementary relationship of the different diagnostic tools and indicates a default sequence of use. Detecting and Managing Faults...
  • Page 20 POST performs basic tests of the server module • “Managing Faults (POST)” components and reports faulty FRUs. on page 31 Run POST. • “Oracle ILOM Properties That Affect POST Behavior” on page 33 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 21 Diagnostic Flowchart Reference Table (Continued) TABLE: Diagnostic Action Possible Outcome Additional Information Flowchart item 6. Determine if the fault is an environmental fault or a • “Check for Faults (show configuration fault. faulty Command)” on Check if the fault is page 18 environmental.
  • Page 22: Diagnostics Leds

    The Oracle ILOM show faulty command provides details about any faults that cause this indicator to light. Under some fault conditions, individual component fault LEDs are turned on in addition to the Service Action Required LED. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 23 LED or Button Icon or Label Color Description Power OK LED Green Indicates the following conditions: • Off – System is not running in its normal state. System power might be off. The SP might be running. • Steady on – System is powered on and is running in its normal operating state.
  • Page 24: Managing Faults (Oracle Ilom)

    The SP runs independently of the server module, using the server module’s standby power. Therefore, Oracle ILOM firmware and software continue to function when the server module OS goes offline or when the server module is powered off. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 25: Fault Management

    Fault Management Error conditions detected by Oracle ILOM, POST, and the Oracle Solaris PSH technology are forwarded to Oracle ILOM for fault handling. The Oracle ILOM fault manager evaluates error messages it receives to determine whether the condition being reported should be classified as an alert or a fault. Alerts –...
  • Page 26: Oracle Solaris Fault Manager Commands In Oracle Ilom

    Oracle Integrated Lights Out Manager (ILOM) 3.0 Concepts Guide ■ SPARC T3 Series Servers Administration Guide ■ “Oracle ILOM Troubleshooting Overview” on page 12 ■ “Access the SP (Oracle ILOM)” on page 15 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 27: Access The Sp (Oracle Ilom)

    “Display FRU Information (show Command)” on page 17 ■ “Check for Faults (show faulty Command)” on page 18 ■ “Check for Faults (fmadm faulty Command)” on page 20 ■ “Clear Faults (clear_fault_action Property)” on page 21 ■ “Service-Related Oracle ILOM Command Summary” on page 21 ■...
  • Page 28 “Check for Faults (show faulty Command)” on page 18 ■ “Check for Faults (fmadm faulty Command)” on page 20 ■ “Clear Faults (clear_fault_action Property)” on page 21 ■ “Service-Related Oracle ILOM Command Summary” on page 21 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 29: Display Fru Information (Show Command)

    “Oracle ILOM Properties That Affect POST Behavior” on page 33 ■ ▼ Display FRU Information (show Command) Use the Oracle ILOM show command to display information about individual FRUs. ● At the -> prompt, enter the show command. In the following example, the show command displays information about a memory module.
  • Page 30: Check For Faults (Show Faulty Command)

    | sunw-msg-id | SPT-8000-5X faults/0 /SP/faultmgmt/0 | uuid | 64d52ce4-614e-693f-bb71-ea3f829d faults/0 | ad73 /SP/faultmgmt/0 | timestamp | 2010-10-14/20:14:13 faults/0 /SP/faultmgmt/0 | detector | /SYS/PS0/S1/V_IN_ERR faults/0 /SP/faultmgmt/0 | product_serial_number | 1030NND0D2 faults/0 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 31 /SP/faultmgmt/0 | chassis_serial_number | 0000000-0000000000 faults/0 -> Example of the show faulty command displaying a fault that was detected by ■ POST. These kinds of faults are identified by the message Forced fail reason, where reason is the name of the power-on routine that detected the fault. ->...
  • Page 32: Check For Faults (Fmadm Faulty Command)

    “Display FRU Information (show Command)” on page 17 ■ “Check for Faults (show faulty Command)” on page 18 ■ “Clear Faults (clear_fault_action Property)” on page 21 ■ “Service-Related Oracle ILOM Command Summary” on page 21 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 33: Clear Faults (Clear_Fault_Action Property)

    ▼ Clear Faults (clear_fault_action Property) Use the clear_fault_action property with the set command to manually clear PSH-detected faults for a FRU. If Oracle ILOM detects a FRU replacement, it will automatically clear the fault so that you do not have to clear the fault manually. For PSH-diagnosed faults, if the replacement of the FRU is detected by the system or the fault is manually cleared on the host, the fault will also be cleared from Oracle ILOM.
  • Page 34 Displays information about the operating state of the show /HOST host system, whether the hardware is providing service, and system firmware version information. Displays information about the system serial number. show /SYS SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 35: Interpreting Log Files And System Messages

    Related Information “Oracle ILOM Troubleshooting Overview” on page 12 ■ “Access the SP (Oracle ILOM)” on page 15 ■ “Display FRU Information (show Command)” on page 17 ■ “Check for Faults (show faulty Command)” on page 18 ■ “Check for Faults (fmadm faulty Command)” on page 20 ■...
  • Page 36: Check The Message Buffer (Dmesg Command)

    1. Log in as superuser. 2. Type: # more /var/adm/messages Or, if you want to view all logged messages, type: # more /var/adm/messages* Related Information “Check the Message Buffer (dmesg Command)” on page 24 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 37: List Fru Status (Prtdiag Command)

    From an Oracle Solaris OS command line, run the prtdiag command. FRU status information is displayed. Example: # prtdiag System Configuration: Sun Microsystems sun4v SPARC T3-1B Memory size: 130560 Megabytes ================================ Virtual CPUs ================================ CPU ID Frequency Implementation Status ------ --------- ---------------------- -------...
  • Page 38: Oracle Solaris Psh Technology Overview

    I/O subsystem ■ The PSH console message provides the following information about each detected fault: Type ■ Severity ■ Description ■ Automated response ■ Impact ■ Suggested action for system administrator ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 39: Psh-Detected Fault Example

    If the PSH facility detects a faulty component, use the fmadm faulty command to display information about the fault. Alternatively, you can use the Oracle ILOM command show faulty for the same purpose. Related Information “Check for Faults (show faulty Command)” on page 18 ■...
  • Page 40: Check For Psh-Detected Faults

    Date and time of the fault (Aug 13 11:48:33). ■ UUID, which is unique for every fault ■ (21a8b59e-89ff-692a-c4bc-f4c5cccca8c8). Message identifier, which can be used to obtain additional fault information ■ (SUN4V-8002-6E). SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 41 Faulted FRU. The information provided in the example includes the part ■ number of the FRU (part=511127809) and the serial number of the FRU (serial=1005LCB-1019B100A2). The FRU field provides the name of the FRU (/SYS/MB for motherboard in this example). 2.
  • Page 42: Clear Psh-Detected Faults

    If no fault is reported, you do not need to do anything else. Do not perform the ■ subsequent steps. If a fault is reported, continue to the next step. ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 43: Managing Faults (Post)

    3. Clear the fault from all persistent fault records. In some cases, even though the fault is cleared, some persistent fault information remains and results in erroneous fault messages at boot time. To ensure that these messages are not displayed, type the following Oracle Solaris command: # fmadm repair UUID For the UUID in the example shown in Step...
  • Page 44: Post Overview

    “Run POST With Maximum Testing” on page 37 ■ “Interpret POST Fault Messages” on page 39 ■ “Clear POST-Detected Faults” on page 40 ■ “POST Error Message Syntax” on page 42 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 45: Oracle Ilom Properties That Affect Post Behavior

    Oracle ILOM Properties That Affect POST Behavior The following table describes the Oracle ILOM properties that determine how POST performs its operations. Note – The value of keyswitch_state must be normal when individual POST parameters are changed. Parameter Values Description The system can power on and run POST (based on the /SYS keyswitch_state normal...
  • Page 46 No POST output is displayed. none The following flowchart illustrates the same set of Oracle ILOM set command variables. The following table shows combinations of Oracle ILOM parameters and associated POST modes. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 47: Configure How Post Runs

    Normal Diagnostic Mode Service Mode Using the Oracle ILOM Parameter (Default Settings) No POST Execution Keyswitch_state keyswitch_state normal normal diag /HOST/diag mode normal /HOST/diag level /HOST/diag trigger hw-change error-reset none /HOST/diag verbosity normal Description of POST This is the default POST POST does not run, POST runs the full Execution...
  • Page 48 = max error_reset_verbosity = normal hw_change_level = max hw_change_verbosity = normal level = max mode = normal power_on_level = max power_on_verbosity = normal trigger = hw-change error-reset verbosity = normal SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 49: Run Post With Maximum Testing

    Commands: show -> Related Information “POST Overview” on page 32 ■ “Oracle ILOM Properties That Affect POST Behavior” on page 33 ■ “Run POST With Maximum Testing” on page 37 ■ “Interpret POST Fault Messages” on page 39 ■ “Clear POST-Detected Faults” on page 40 ■...
  • Page 50 “Oracle ILOM Properties That Affect POST Behavior” on page 33 ■ “Configure How POST Runs” on page 35 ■ “Interpret POST Fault Messages” on page 39 ■ “Clear POST-Detected Faults” on page 40 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 51: Interpret Post Fault Messages

    ▼ Interpret POST Fault Messages 1. Run POST. “Run POST With Maximum Testing” on page 2. View the output and watch for messages that look similar to the following syntax descriptions and example: POST error messages use the following syntax, where c = the core number, s = ■...
  • Page 52: Clear Post-Detected Faults

    | Value ----------------------+------------------------+----------------------------- /SP/faultmgmt/0 | fru | /SYS/MB/CMP0/BOB1/CH0/D0 /SP/faultmgmt/0 | timestamp | Dec 21 16:40:56 /SP/faultmgmt/0/ | timestamp | Dec 21 16:40:56 faults/0 /SP/faultmgmt/0/ | sp_detected_fault | /SYS/MB/CMP0/BOB1/CH0/D0 faults/0 | Forced fail(POST) SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 53 2. Take one of the following actions based on the show faulty output: No fault is reported – The system cleared the fault and you do not need to ■ manually clear the fault. Do not perform the subsequent steps. Fault reported –...
  • Page 54: Post Error Message Syntax

    UE if VEU = 1, or VEF = 1, or higher priority error in same cycle. 2010-07-03 18:44:14.614 0:7:2> MEC 60 R/W1C Set to 1 on a CE if VEC = 1, or VEU = 1, or VEF = 1, or another error in same cycle. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 55 2010-07-03 18:44:14.804 0:7:2> VEU 57 R/W1C Set to 1 on an UE, if VEF = 0 and no fatal error is detected in same cycle. 2010-07-03 18:44:14.983 0:7:2> VEC 56 R/W1C Set to 1 on a CE, if VEF = VEU = 0 and no fatal or UE is detected in same cycle. 2010-07-03 18:44:15.169 0:7:2>...
  • Page 56: Managing Components (Asr Commands)

    In most cases, POST automatically disables a faulty component. After the cause of the fault is repaired (FRU replacement, loose connector reseated, and so on), you might need to remove the component from the ASR blacklist. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 57: Display System Components

    The following ASR commands enable you to view and add or remove components (asrkeys) from the ASR blacklist. You run these commands from the Oracle ILOM -> prompt. ASR Commands TABLE: Command Description Displays system components and their current state. show components set asrkey component_state= Removes a component from the asr-db blacklist,...
  • Page 58 | component_state | Enabled CH1/D0 /SYS/MB/GBE | component_state | Enabled /SYS/MB/USB | component_state | Enabled /SYS/MB/VIDEO | component_state | Enabled /SYS/MB/PCI- | component_state | Enabled SWITCH0 /SYS/MB/PCI- | component_state | Enabled SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 59: Disable System Components

    SWITCH1 -> Related Information “View the System Message Log Files” on page 24 ■ “Disable System Components” on page 47 ■ “Enable System Components” on page 48 ■ ▼ Disable System Components You disable a component by setting its component_state property to Disabled. This adds the component to the ASR blacklist.
  • Page 60: Enable System Components

    For comprehensive VTS information, refer to the Oracle VTS 7.0 documentation. “Oracle VTS Overview” on page 49 ■ “Check if Oracle VTS Software Is Installed” on page 50 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 61: Oracle Vts Overview

    Related Information “Diagnostics Overview” on page 5 ■ “Diagnostics Process” on page 7 ■ “Managing Faults (Oracle ILOM)” on page 12 ■ “Interpreting Log Files and System Messages” on page 23 ■ “Managing Faults (Oracle Solaris PSH)” on page 25 ■...
  • Page 62: Check If Oracle Vts Software Is Installed

    You can obtain the VTS software from the following places: Oracle Solaris OS media kit (DVDs) ■ As a download from the web ■ Related Information Oracle VTS documentation ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 63: Preparing For Service

    ■ Follow all cautions and instructions described in the documentation that shipped ■ with your system and in the SPARC T3-1B Server Module Safety and Compliance Guide. Ensure that the voltage and frequency of your power source match the voltage ■...
  • Page 64: Safety Symbols

    Following this practice equalizes the electrical potentials between you and the server module. Antistatic Mat Place ESD-sensitive components such as cards and DIMMs on an antistatic mat. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 65: Tools Needed For Service

    Antistatic wrist strap ■ Antistatic mat ■ Stylus or pencil (to operate the power button) ■ UCP-3 dongle (UCP-4 dongle can be used, but see instructions in the SPARC T3-1B ■ Server Module Installation Guide) Blade filler panel ■ Related Information “General Safety Information”...
  • Page 66: Find The Server Module Serial Number

    Targets: SERVICE LOCATE PS_FAULT TEMP_FAULT FAN_FAULT Properties: type = Host System keyswitch_state = Normal product_name = SPARC T3-1B product_serial_number = 0723BBC006 <- fault_state = OK clear_fault_action = (none) power_state = On SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 67: Locate The Server Module

    Related Information “Locate the Server Module” on page 55 ■ “Find the Modular System Serial Number” on page 53 ■ ▼ Locate the Server Module To identify a specific server module from others in the modular system, perform the following steps. 1.
  • Page 68: Shut Down The Oracle Solaris Os

    THE SYSTEM server1 IS BEING SHUT DOWN NOW ! ! ! Log off now or risk your files being damaged # svc.startd: The system is coming down. Please wait. svc.startd: 100 system services are now being stopped. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 69: Power Off The Server Module (Power Button - Standby Mode)

    Jun 28 13:06:34 dt90-366 syslogd: going down on signal 15 svc.startd: The system is down. syncing file systems... done Program terminated SPARC T3-1B, No Keyboard OpenBoot 4.30, 16256 MB memory available, Serial # 87305111. Ethernet address 0:21:28:34:2b:90, Host ID: 85342b90. {0} ok 6.
  • Page 70: Power Off The Server Module (Emergency Shutdown)

    3. Type: -> set /SYS/ prepare_to_remove_action=true Set ‘prepare_to_remove_action’ to ‘true’ The server module is in standby mode. Power is removed from the host while standby power is applied to the SP. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 71: Remove The Server Module From The Modular System

    4. Confirm that the server module is in standby mode by viewing the blue Ready to Remove LED on the front of the server module. “Front and Rear Panel Components” on page 2 to locate this LED. If the Ready to Remove LED is on, the server module is ready for removal from the modular system chassis.
  • Page 72 2. Open both ejector arms (panel 2). Squeeze both latches on each of the two ejector arms. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 73 3. Pull the server module halfway out (panel 3). 4. Close the ejector arms. 5. Remove the server module from the modular system. Lift the server module with two hands. 6. Place the server module on an antistatic mat or surface. 7.
  • Page 74: Remove The Cover

    (1 cm). 3. Lift the cover off the server module chassis. Related Information “Illustrated Parts Breakdown” on page 3 ■ “Replace the Cover” on page 111 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 75: Servicing Hard Drives

    Servicing Hard Drives The following topics apply to hard drives installed in the external slots of the server module. Description Links Determine if you can remove and replace a “Drive Hot-Plugging Rules” on page 63 drive using hot-plugging capabilities. Replace a drive. “Remove a Drive”...
  • Page 76: Remove A Drive

    It will not be illuminated if Oracle Solaris was shut down. 4. Remove the drive as described in the following steps: a. Push the latch release button on the drive (panels 1 and 2). SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 77: Replace Or Add A Drive

    b. Grasp the latch and pull the drive out of the drive slot (panel 3). 5. Insert a drive filler if you are not replacing the drive in this slot. “Install a Drive Filler” on page 67 Related Information “Install a Drive Filler” on page 67 ■...
  • Page 78 If the disk is not in the list, such as with a newly installed disk, you can use ■ devfsadm to configure it into the tree. See the devfsadm man page for details. Related Information “Remove a Drive” on page 64 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 79: Remove A Drive Filler

    ▼ Remove a Drive Filler All drive bays must be populated by either a drive or a filler. 1. Open the filler lever (panels 1 and 2). 2. Pull to remove the filler (panel 3). Related Information “Replace or Add a Drive” on page 65 ■...
  • Page 80 2. Push the filler into place. 3. Close the filler lever (panels 2 and 3). Related Information “Remove a Drive” on page 64 ■ “Remove a Drive Filler” on page 67 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 81: Servicing Memory

    Servicing Memory The following topics describe how to determine which DIMMs are faulty, remove DIMMs, install DIMMs, and verify DIMM functionality after installation. Description Links Understand memory faults. “Memory Faults” on page 69 Replace a faulty DIMM. “Locate a Faulty DIMM (LEDs)” on page 70 “Remove a DIMM”...
  • Page 82: Locate A Faulty Dimm (Leds)

    “Detecting and Managing Faults” on page 5 ■ ▼ Locate a Faulty DIMM (LEDs) This procedure describes how to use the DIMM LEDs on the motherboard to pinpoint the physical location of a faulty DIMM. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 83 Note – You can also obtain the location of the faulty DIMM using the Oracle ILOM show faulty command. This command displays the FRU name (such as /SYS/MB/CMP0/BOB0/CH0). Use the FRU name and information to locate the faulty DIMM. See “DIMM Configuration Reference”...
  • Page 84 Locate button for LEDs of faulty DIMMs 4. Remove the faulty DIMM. “Remove a DIMM” on page Related Information “DIMM Configuration Reference” on page 81 ■ “Remove a DIMM” on page 73 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 85: Remove A Dimm

    ▼ Remove a DIMM Caution – This procedure involves handling circuit boards that are extremely sensitive to static electricity. Ensure that you follow ESD preventative practices to avoid damaging the circuit boards. Caution – Components inside the chassis might be hot. Use caution when servicing components inside the chassis.
  • Page 86: Install A Replacement Dimm

    4. Line up the replacement DIMM with the connector. Align the DIMM notch with the key in the connector, as in panel 3. This action ensures that the DIMM is oriented correctly. Panel 2 shows an incorrect alignment. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 87: Clear The Fault And Verify The Functionality Of The Replacement Dimm

    5. Push the DIMM into the connector until the ejector tabs lock the DIMM in place. If the DIMM does not easily seat into the connector, verify that the orientation of the DIMM is correct. Never apply excessive force. 6. Return the server module to operation. “Returning the Server Module to Operation”...
  • Page 88 DIMM and clear the fault. Example: -> set /SYS/MB/CMP0/BOB0/CH0/D0 component_state=Enabled 3. Perform the following steps to verify the repair: SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 89 a. Set the virtual keyswitch to diag so that POST will run in Service mode. -> set /SYS/keyswitch_state=Diag Set ‘keyswitch_state’ to ‘Diag’ b. Power cycle the system. -> stop /SYS Are you sure you want to stop /SYS (y/n)? y Stopping /SYS ->...
  • Page 90 Use the same UUID that was displayed from the output of the Oracle ILOM show faulty command. # fmadm repair 3aa7c854-9667-e176-efe5-e487e5207a8a Related Information “Install a Replacement DIMM” on page 74 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 91: Verify Dimm Functionality

    “Verify DIMM Functionality” on page 79 ■ ▼ Verify DIMM Functionality 1. Access the Oracle ILOM -> prompt. Refer to the SPARC T3 Series Servers Administration Guide for instructions. 2. Use the show faulty command to determine how to clear the fault. If show faulty indicates a POST-detected fault, go to Step ■...
  • Page 92 Switch to the system console and type the Oracle Solaris OS fmadm faulty command. # fmadm faulty If any faults are reported, see the diagnostics instructions in “Oracle ILOM Troubleshooting Overview” on page 5. Switch to the Oracle ILOM command shell. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 93: Dimm Configuration Reference

    There are 16 DIMM slots that support industry-standard DIMMs. ■ You can install quantities of 4, 8, or 16 DIMMs. ■ Supported DIMM capacities: 2 Gbyte, 4 Gbyte, and 8 Gbyte. ■ Refer to the SPARC T3-1B Server Module Product Notes for the latest information. Servicing Memory...
  • Page 94 Figure Legend DIMM slots controlled by BOB0 DIMM slots controlled by BOB1 DIMM slots controlled by BOB3 DIMM slots controlled by BOB2 Fault remind button Memory fault LED for the adjacent DIMM SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 95 The slots are color coded to indicate which slots to use to install different quantities of DIMMs. 4 DIMMs: Blue slots ■ 8 DIMMs: White and blue slots ■ 16 DIMMs: Black, white, and blue slots ■ The following table summarizes details on using each of the 16 DIMM slots. Slot Is Used For DIMM This Quantity of...
  • Page 96 “Clear the Fault and Verify the Functionality of the Replacement DIMM” on ■ page 75 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 97: Servicing A Rem

    Servicing a REM The server module support the installation of one REM. Only certain REMs are supported. For a list of supported REMs, refer to the SPARC T3-1B Server Module Product Notes. Description Links Replace a REM. “Remove a REM” on page 85 “Install a REM”...
  • Page 98: Install A Rem

    REM, refer to the REM documentation. 1. Prepare for service by performing the following tasks: “Shut Down the Oracle Solaris OS” on page 56 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 99 “Prepare the Server Module for Removal” on page 58 ■ “Remove the Server Module From the Modular System” on page 59 ■ “Remove the Cover” on page 62 ■ “ESD Safety Measures” on page 52 ■ (If needed) “Remove a REM” on page 85 ■...
  • Page 100 Related Information “Remove a REM” on page 85 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 101: Servicing A Fem

    Servicing a FEM The server module supports the installation of one FEM. To see a list of supported FEMs for this server module, refer to the SPARC T3-1B Server Module Product Notes. Description Links Replace a FEM. “Remove a FEM” on page 89 “Install a FEM”...
  • Page 102: Install A Fem

    This procedure applies to any of the form factors of FEM cards that are supported by this server module. 1. Prepare for service by performing the following tasks: “Shut Down the Oracle Solaris OS” on page 56 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 103 “Prepare the Server Module for Removal” on page 58 ■ “Remove the Server Module From the Modular System” on page 59 ■ “Remove the Cover” on page 62 ■ “ESD Safety Measures” on page 52 ■ (If needed) “Remove a FEM” on page 89 ■...
  • Page 104 If the card has rubber bumpers you can press directly on them to seat the card into the connectors. 5. Return the server module to operation. “Returning the Server Module to Operation” on page 111. Related Information “Remove a FEM” on page 89 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 105: Servicing A Service Processor Card

    Servicing a Service Processor Card The server module has a service processor card with firmware that provides the SP. “Remove the Service Processor Card” on page 93 ■ “Install the Service Processor Card” on page 94 ■ Related Information “Detecting and Managing Faults” on page 5 ■...
  • Page 106: Install The Service Processor Card

    Related Information “Install the Service Processor Card” on page 94 ■ ▼ Install the Service Processor Card 1. (If needed) Remove the service processor card. “Remove the Service Processor Card” on page SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 107 2. Insert the replacement service processor card into the retainer (panel 1). Make sure the tab is aligned with the key (panel 2) 3. Lower the service processor card until it is aligned with the connector (panel 3). 4. Seat the service processor card into the connector by pressing the card toward the tabs while pressing down (panel 4).
  • Page 108 8. If you created a backup of the SP configuration, use the Oracle ILOM restore utility to restore the configuration. 9. Return the server module to operation. Related Information “Remove the Service Processor Card” on page 93 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 109: Servicing The Id Prom

    Servicing the ID PROM The system ID PROM, sometimes referred to as the SCC, provides the server module with the host ID, MAC addresses, and some Oracle ILOM configuration information. The system ID PROM does not typically require replacement. However, if you replace the ID PROM, be aware that the host ID and MAC address will change.
  • Page 110: Install The Id Prom

    “Verify the ID PROM” on page 99 ■ ▼ Install the ID PROM 1. (If needed) Remove the ID PROM. “Remove the ID PROM” on page 2. Locate the ID PROM socket on the motherboard. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 111: Verify The Id Prom

    3. Align the ID PROM notched end with the notched end on the motherboard socket and press in place. 4. Return the server module to operation. “Returning the Server Module to Operation” on page 111. 5. Verify the ID PROM. “Verify the ID PROM”...
  • Page 112 1500 index inet 10.6.91.117 netmask fffffe00 broadcast 10.6.91.255 ether 0:21:28:7f:68:44 Related Information “Remove the ID PROM” on page 97 ■ “Install the ID PROM” on page 98 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 113: Servicing A Usb Flash Drive

    Servicing a USB Flash Drive You can install one USB flash drive in the server module. Description Links Replace a USB flash drive. “Remove a USB Flash Drive” on page 101 “Install a USB Flash Drive” on page 102 Add a USB flash drive. “Install a USB Flash Drive”...
  • Page 114: Install A Usb Flash Drive

    “Remove the Cover” on page 62 ■ “ESD Safety Measures” on page 52 ■ (If needed) “Remove a USB Flash Drive” on page 101 ■ 2. Locate the USB connector on the motherboard. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 115 3. Plug your USB flash drive into the upper port of the USB connector (panels 1 and 2). Do not use the lower port of this connector. 4. Return the server module to operation. “Returning the Server Module to Operation” on page 111.
  • Page 116 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 117: Servicing The Battery

    Servicing the Battery The battery operates the clock for the server module. “Replace the Battery” on page 105 ■ Related Information “Detecting and Managing Faults” on page 5 ■ “Preparing for Service” on page 51 ■ ▼ Replace the Battery The battery maintains system time when the server module is powered off.
  • Page 118 = Thu JUN 17 16:19:56 2010 timezone = GMT (GMT) usentpserver = disabled Related Information “Servicing a FEM” on page 89 ■ “Returning the Server Module to Operation” on page 111 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 119: Replacing The Server Module Enclosure Assembly

    Replacing the Server Module Enclosure Assembly When certain parts and components in the server module, such as the motherboard, require replacing, you must replace a high-level assembly called the enclosure assembly. This includes a new server module chassis with the motherboard and many other components already installed.
  • Page 120: Transfer Components To Another Enclosure Assembly

    8. Transfer the service processor card from the original server module to the enclosure assembly. “Servicing a Service Processor Card” on page 9. Transfer the ID PROM from the original server module to the enclosure assembly. “Servicing the ID PROM” on page SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 121 17. Transfer the serial number and product number to the FRUID of the new enclosure assembly. Refer to the SPARC T3-1B knowledge article for specific instructions for updating FRUID. Note – The replacement enclosure assembly does not have a label with the serial number on the front of the system, as was present on the original server module.
  • Page 122 RFID on the new enclosure assembly. The RFID on the original server module contained different values. Related Information “Detecting and Managing Faults” on page 5 ■ “Identifying Components” on page 1 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 123: Returning The Server Module To Operation

    Returning the Server Module to Operation The following topics describe hot to return the server module to operation after removing it from the modular system for service: “Replace the Cover” on page 111 ■ “Install the Server Module Into the Modular System” on page 112 ■...
  • Page 124: Install The Server Module Into The Modular System

    Caution – Hold the server module firmly with both hands so that you do not drop it. The server module weighs approximately 17 pounds (8.0 kg). 1. Remove the rear connector cover from the server module before inserting it in the modular system. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 125 2. Remove a filler panel from the modular system chassis slot you intend to use. When the modular system is operating, you must fill every slot with a filler panel or a server module within 60 seconds. 3. Hold the server module in a vertical position so that both ejector levers are on the right (panel 1).
  • Page 126 By default, the server module boots the Oracle Solaris OS. 2. Perform any diagnostics that verify the results of servicing the server module. Related Information “Detecting and Managing Faults” on page 5 ■ SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 127 Glossary ANSI SIS American National Standards Institute Status Indicator Standard. Automatic system recovery. Generic term for server modules and storage modules. blade blade server Server module. chassis Modular system enclosure. Command-line interface. Chassis monitoring module. ILOM runs on the CMM, providing lights out management of the components in the modular system chassis.
  • Page 128 Oracle systems. ILOM enables you to remotely manage your Oracle servers regardless of the state of the host system. ID PROM Chip that contains system information for the server module. Internet Protocol. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 129 Keyboard, video, mouse. Refers to using a switch to enable sharing of one keyboard, one display, and one mouse with more than one computer. MAC or MAC address Media access controller address. MSGID Message ID. Top-level ILOM CMM target. name space Network express module.
  • Page 130 REMs and FEMs. Service processor. Secure shell. storage module Modular component that provides computing storage to the server modules. Universal connector port. User interface. Coordinated Universal Time. UUID Universal unique identifier. SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 131 WWID World-wide identifier. A unique number that identifies a SAS target. Glossary...
  • Page 132 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 133 Index front and rear panel, 2 identifying, 1 accessing the service processor, 15 location, 3 accounts, ILOM, 15 managing with ASR, 44 airflow, blocked, 9 configuration guidelines, memory, 81 antistatic mat and wrist strap, 52 configuration reference DIMMs, 81 disabling components, 47 configuring how POST runs, 35 enabling components, 48 cover...
  • Page 134 65 flash drive installing, 102 removing, 101 LEDs topics, 101 DIMMs, 70 fmadm command, 30, 75 front panel, 2 fmadm faulty command, 20 interpreting, 10 fmdump command, 28 Remind Power, 70 SPARC T3-1B Server Module Service Manual • July 2012...
  • Page 135 locating faulty running in Diag Mode, 37 DIMMs, 70 troubleshooting with, 9 using for fault diagnosis, 8 locating the server module to be serviced, 55 POST-detected faults, 18 log files, 8, 24 power button, 57, 58 logging into ILOM, 15 powering on, 114 power-on self-test See POST...
  • Page 136 files, 24 time setting, 105 tools for service, 53 troubleshooting by checking Oracle Solaris OS log files, 8 using POST, 8, 9 using the show faulty command, 8 using VTS, 8 SPARC T3-1B Server Module Service Manual • July 2012...

Table of Contents