HP ProLiant Servers Troubleshooting Guide June 2006 (Fifth Edition) Part Number 375445-005
Introduction 10 Introduction In this section What's new...
Error messages 100 1. Enable OBDR 2. Exit Audible Beeps: None Possible Cause: A USB tape device that supports One Button Disaster Recovery (OBDR) is
Error messages 101 100 Series 101-I/O ROM Error Audible Beeps: None Possible Cause: Options ROM on a PCI, PCI-X, or PCI Express device is corrupt. A
Error messages 102 102-System Board Failure, DMA Test Failed Audible Beeps: None Possible Cause: 8237 DMA controllers, 8254 timers, and similar devi
Error messages 103 180-Log Reinitialized Audible Beeps: None Possible Cause: The IML has been reinitialized due to corruption of the log. Action: Ev
Error messages 104 207-Invalid Memory Configuration - Incomplete Bank Detected in Bank X Audible Beeps: 1 long, 1 short Possible Cause: Bank is miss
Error messages 105 207-Invalid Memory Configuration - Unsupported DIMM in Socket X Audible Beeps: 1 long, 1 short Possible Cause: Unregistered DIMMs
Error messages 106 210-Memory Board Power Fault on board X Audible Beeps: 1 long, 1 short Possible Cause: A problem exists with a memory board power
Error messages 107 303-Keyboard Controller Error Audible Beeps: None Possible Cause: System board, keyboard, or mouse controller failure occurred. A
Error messages 108 600 Series 601-Diskette Controller Error Audible Beeps: None Possible Cause: Diskette controller circuitry failure occurred. Acti
Error messages 109 2. Run Insight Diagnostics ("HP Insight Diagnostics" on page 61) and replace failed components as indicated. 1100 Seri
Getting started 11 • Updated contacting HP: • Contacting HP technical support or an authorized reseller • Server information you need Getting st
Error messages 110 1611-CPU Zone Fan Assembly Failure Detected. Single fan... ...failure. Assembly will provide adequate cooling. Audible Beeps: Non
Error messages 111 1611-Fan x Not Present (Fan Zone I/O) Audible Beeps: 2 short Possible Cause: Required fan is not installed or spinning. Action: 1
Error messages 112 1615-Power Supply Configuration Error Audible Beeps: None Possible Cause: The server configuration requires an additional power s
Error messages 113 Audible Beeps: None Possible Cause: Upgrade the Array Accelerator module to a larger size. Action: Migrate logical drives to RAID
Error messages 114 1720-S.M.A.R.T. Hard Drive Detects Imminent Failure Audible Beeps: None Possible Cause: A hard drive SMART predictive failure con
Error messages 115 1727-Slot X Drive Array - New Logical Drive(s) Attachment Detected... ...If more than 32 logical drives, this message will be fol
Error messages 116 Expansion will resume when automatic data recovery has been completed. Audible Beeps: None Possible Cause: The capacity expansion
Error messages 117 1774-Slot X Drive Array - Obsolete Data Found in Array Accelerator Audible Beeps: None Possible Cause: Drives were used on anothe
Error messages 118 1776-Drive Array Reports Improper SCSI Port 1 Cabling Audible Beeps: None Possible Cause: • The integrated array enabler board f
Error messages 119 1779-Slot X Drive Array - Replacement drive(s) detected OR previously failed drive(s) now operational:... ...Port Y: SCSI ID Z: R
Getting started 12 Pre-diagnostic steps WARNING: To avoid potential problems, ALWAYS read the warnings and cautionary information in the server d
Error messages 120 2. Be sure all drives are fully seated. 3. Replace defective cables, drive X, or both. 1785-Slot X Drive Array Not Configured..
Error messages 121 1786-Slot 1 Drive Array Recovery Needed. Automatic Data Recovery Previously Aborted!... ...The following SCSI drive(s) need Autom
Error messages 122 a. Repair the connection and press the F2 key. b. If the problem persists, run ADU ("Array Diagnostic Utility" on page
Error messages 123 1794-Drive Array - Array Accelerator Battery Charge Low... ...Array Accelerator is temporarily disabled. Array Accelerator will b
Error messages 124 Audible Beeps: None Possible Cause: One or more logical drives failed due to loss of data in posted-writes memory. Action: • Pr
Error messages 125 Automatic operating system shutdown initiated due to fan failure Event Type: Fan failure Action: Replace the fan. Automatic Oper
Error messages 126 Real-Time Clock Battery Failing Event Type: System configuration battery low Action: Replace the system configuration battery. S
Error messages 127 Event Type: Host bus error CAUTION: Only authorized technicians trained by HP should attempt to remove the system board. If yo
Error messages 128 1. Press the server blade management module reset button. 2. Replace the server blade management module. Server blade managemen
Error messages 129 Interconnect B Error Code LED code: 14-1, 14-2, 14-3, or 14-4 Location: Interconnect device - side B Action: Perform the followin
Getting started 13 weight in kg weight in lb This symbol indicates that the component exceeds the recommended weight for one individual to handle
Error messages 130 For more information, refer to the HP BladeSystem Maintenance and Service Guide on the HP website (http://www.hp.com/products/ser
Error messages 131 Power management module board error codes LED code: 7-1, 7-2, 7-3, 7-4, 7-5, 7-6, 7-7, 7-8, 7-9, 7-10, 7-11, 7-12, or 7-13 Locati
Error messages 132 IMPORTANT: Reboot the server after completing each numbered step. If the error condition continues, proceed with the next step
Error messages 133 3. Reseat the remaining memory boards, rebooting after each installation to isolate any failed memory boards, if applicable. 4.
Error messages 134 IMPORTANT: Processor socket 1 and PPM slot 1 must be populated at all times or the server does not function properly. • PPMs,
Error messages 135 Description: The system encountered an NMI prior to this boot. The NMI source was: Uncorrectable cache memory error. Action: Repl
Error messages 136 MSG_CPU_RR_7 Event type: CPU speed is out of range. Action: Replace the processor. MSG_CPU_RR_8 Event type: Unable to update the
Error messages 137 Action: Replace the processor. MSG_CPU_RR_17 Event type: Stress integer math test has failed. Action: • Ensure proper ventilati
Contacting HP 138 Contacting HP In this section Contacting HP technical support or an authorized reseller...
Contacting HP 139 Server information you need Before contacting HP technical support, collect the following information: • Explanation of the issue
Getting started 14 CAUTION: The server is designed to be electrically grounded (earthed). To ensure proper operation, plug the AC power cord int
Contacting HP 140 • An updated Emergency Repair Diskette • If HP drivers are installed: • Version of the PSP used • List of drivers from the PSP
Contacting HP 141 Novell NetWare operating systems Collect the following information: • Whether the operating system was factory installed • Opera
Contacting HP 142 • If management agents are installed, version number of the agents • System dumps, if they can be obtained (in case of panics) •
Contacting HP 143 • DU number • List of drivers in the DU diskette • The drive subsystem and file system information: • Number and size of partit
Acronyms and abbreviations 144 Acronyms and abbreviations ACPI Advanced Configuration and Power Interface ACU Array Configuration Utility ADG Adva
Acronyms and abbreviations 145 IDE integrated device electronics iLO Integrated Lights-Out IMD Integrated Management Display IML Integrated Manag
Acronyms and abbreviations 146 ORCA Option ROM Configuration for Arrays OS operating system POST Power-On Self Test PPM processor power module P
Acronyms and abbreviations 147 SMART self-monitoring analysis and reporting technology SNMP Simple Network Management Protocol SSD support softwar
Index 148 1 120PCI.HAM 50 A accelerator error log 73 accelerator status 74, 75, 76 ACPI support 50 ACU (Array Configuration Utility) 54 ada
Index 149 diskette image creation 57 DMA error 94 documentation 69, 70 drive errors 36, 37, 78, 79, 80, 90, 97, 108 drive failure, detecting
Common problem resolution 15 Common problem resolution In this section Loose connections ...
Index 150 Management Agents 59 Management CD 58, 70, 71 management tools 58 media issue, tape drive 38 MEGA4 XX.HAM 50 memory 41, 56, 71,
Index 151 read/write errors 36, 37 read/write issue, tape drive 38 redundant ROM 65, 98, 113 registering the server 71 Remote Insight Lights-
Index 152 W warnings 13, 72 Web-Based Enterprise Service 63 website, HP 69, 70 white papers 70, 72 Windows Event Log processor error codes
Common problem resolution 16 Components for option firmware updates are also available from the HP Storage Products Software and Drivers website (ht
Common problem resolution 17 Activity LED (1) Online LED (2) Fault LED (3) Interpretation On Off Off Do not remove the drive. The drive is being ac
Common problem resolution 18 Online/activity LED (green) Fault/UID LED (amber/blue) Interpretation Flashing irregularly Amber, flashing regularly (1
Diagnostic flowcharts 19 Diagnostic flowcharts In this section Troubleshooting flowcharts ...
© Copyright 2004-2006 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice. The only warra
Diagnostic flowcharts 20 Start diagnosis flowchart Use the following flowchart to start the diagnostic process. General diagnosis flowchart
Diagnostic flowcharts 21 The General diagnosis flowchart provides a generic approach to troubleshooting. If you are unsure of the problem, or if the
Diagnostic flowcharts 22 Power-on problems flowchart Server power-on problems flowchart Symptoms: • The server does not power on. • The system pow
Diagnostic flowcharts 23 p-Class server blade power-on problems flowchart Symptoms: • The server does not power on. • The system power LED is o
Diagnostic flowcharts 24 • Loose or faulty power cord • Power source problem • Power-on circuit problem • Improperly seated component or interloc
Diagnostic flowcharts 25 c-Class server blade power-on problems flowchart Symptoms: • The server does not power on. • The system power LED is off
Diagnostic flowcharts 26 POST problems flowchart Symptoms: • Server does not complete POST NOTE: The server has completed POST when the system a
Diagnostic flowcharts 27 Server and p-Class server blade POST problems flowchart
Diagnostic flowcharts 28 c-Class server blade POST problems flowchart Operating system boot problems flowchart Symptoms: • Server does not boot a
Diagnostic flowcharts 29 • Use iLO to remotely attach virtual devices to mount the SmartStart CD onto the server blade. • Use a local I/O cable and
Contents 3 Contents Introduction...
Diagnostic flowcharts 30 NOTE: For the location of server LEDs and information on their statuses, refer to the server documentation. Possible cau
Diagnostic flowcharts 31 c-Class server blade fault indications flowchart
Hardware problems 32 Hardware problems In this section Procedures for all ProLiant servers ...
Hardware problems 33 UPS problems UPS is not working properly Action: 1. Be sure the UPS batteries are charged to the proper level for operation. S
Hardware problems 34 2. Refer to the release notes included with the hardware to be sure the problem is not caused by a change to the hardware relea
Hardware problems 35 • If the system boots and video is working, add each component back to the server one at a time, restarting the server after ea
Hardware problems 36 3. Be sure no loose connections (on page 15) exist. 4. Be sure the media from which you are attempting to boot is not damaged
Hardware problems 37 Drive is not found Action: Be sure no loose connections (on page 15) exist with the drive. Non-system disk message is displaye
Hardware problems 38 Read/write issue Action: 1. Run the Acceptance Test in HP StorageWorks Library and Tape Tools. CAUTION: Running the Accepta
Hardware problems 39 Hard drive problems System completes POST but hard drive fails Action: 1. Be sure no loose connections (on page 15) exist. 2.
Contents 4 Video problems...
Hardware problems 40 1. Be sure the files are not corrupt. Run the repair utility for the operating system. 2. Be sure no viruses exist on the serv
Hardware problems 41 Memory problems General memory problems are occurring Action: • Isolate and minimize the memory configuration. • Be sure th
Hardware problems 42 Server fails to recognize new memory Action: 1. Be sure the memory is the correct type for the server and is installed accordi
Hardware problems 43 c. Replace the remaining processor with a known functional processor. If the problem is resolved after you restart the server,
Hardware problems 44 7. Be sure a video expansion board, such as a RILOE board, has not been added to replace onboard video, making it seem like the
Hardware problems 45 Audio problems Action: Be sure the server speaker is connected. Refer to the server documentation. Printer problems Printer d
Hardware problems 46 Data is displayed as garbled characters after the connection is established Action: 1. Be sure both modems have the same setti
Hardware problems 47 3. Be sure no line interference exists. Retry the connection by dialing the number several times. If conditions remain poor, co
Hardware problems 48 2. Be sure the correct network driver is installed for the controller and that the driver file is not corrupted. Reinstall the
Software problems 49 Software problems In this section Operating system problems and resolutions...
Contents 5 Clustering software...
Software problems 50 If neither of these actions resolve the problem, contact an authorized service provider. For more information about debugging t
Software problems 51 3. Install the current drivers. If you apply the update and have problems, refer to the Software and Drivers Download website (
Software problems 52 • Linux—Refer to the operating system documentation for information. Linux operating systems For troubleshooting information s
Software problems 53 • One or more remote servers with system ROMs requiring upgrade • An administrative user account on each target system. The ad
Software tools and solutions 54 Software tools and solutions In this section Configuration tools...
Software tools and solutions 55 • Installing software drivers directly from the CD. With systems that have internet connection, the SmartStart Autor
Software tools and solutions 56 Auto-configuration process The auto-configuration process automatically runs when you boot the server for the first
Software tools and solutions 57 7. Press the Esc key to exit the current menu, or press the F10 key to exit RBSU. For more information on online spa
Software tools and solutions 58 Management CD The Management CD contains the latest tools available for easily managing the server, such as HP SIM (
Software tools and solutions 59 • Access advanced troubleshooting features through the iLO and iLO 2 interface. • Diagnose iLO and iLO 2 using HP S
Contents 6 Teardown procedures, part numbers, specifications ... 72 Technical
Software tools and solutions 60 The Virtual Machine Management Pack provides the following functionality: • Central management and control of VMwar
Software tools and solutions 61 System Management homepage To access the System Management homepage of a server, go to https://localhost:2381 (htt
Software tools and solutions 62 Smart Array SCSI Diagnosis feature NOTE: This feature is only available in HP Insight Diagnostics Online Edition.
Software tools and solutions 63 Array Diagnostic Utility ADU is a tool that collects information about array controllers and generates a list of det
Software tools and solutions 64 If you do not use the SmartStart CD to install an operating system, drivers for some of the new hardware are require
Software tools and solutions 65 Care Pack HP Care Pack Services offer upgraded service levels to extend and expand standard product warranty with ea
Software tools and solutions 66 For additional information, refer to the HP Online ROM Flash User Guide on the HP website (http://h18023.www1.hp.com
Software tools and solutions 67 2. Shut down each server where the system or option ROM images are to be upgraded and reboot using the correct ROMPa
Software tools and solutions 68 4. Verify the firmware update by checking the version of the current firmware.
HP resources for troubleshooting 69 HP resources for troubleshooting In this section Online resources ...
Contents 7 Drive Time-Out Occurred on Physical Drive Bay X... 80 Drive X I
HP resources for troubleshooting 70 White papers White papers are electronic documentation on complex technical topics. Some white papers contain in
HP resources for troubleshooting 71 Management of the server Refer to the HP Systems Insight Manager Help Guide on the Management CD or the HP websi
HP resources for troubleshooting 72 Server and option specifications, symbols, installation warnings, and notices Refer to the server documentation
Error messages 73 Error messages In this section ADU error messages...
Error messages 74 Accelerator Parity Write Errors: X Description: Number of times that write memory parity errors were detected during transfers to
Error messages 75 Description: The number of cache lines experiencing excessive ECC errors has reached a preset limit. Therefore, the cache has been
Error messages 76 Accelerator Status: Valid Data Found at Reset Description: Valid data was found in posted-write memory at reinitialization. Data w
Error messages 77 Cache Has Been Disabled; Likely Caused By a Loose Pin on One of the RAM Chips Description: Cache has been disabled due to a large
Error messages 78 page 63) examines each physical drive and looks for drives that have been moved to a different drive bay. Action: Look for message
Error messages 79 4. If the problem persists, power down the system and replace the cable. 5. If the problem persists, power down the system and re
Contents 8 Swapped Cables or Configuration Error Detected. A Drive Rearrangement... 88 Swapped Cables or Conf
Error messages 80 Drive Monitoring Features Are Unobtainable Description: ADU ("Array Diagnostic Utility" on page 63) is unable to get mon
Error messages 81 Identify Logical Drive Data did not Match with NVRAM Description: The identify unit data from the array controller does not match
Error messages 82 Action: Check for drive failures, wrong drive replaced, or loose cable messages. If a drive failure occurred, replace the failed d
Error messages 83 Logical Drive X Status = Wrong Drive Replaced Description: A physical drive in this logical drive has failed. The incorrect drive
Error messages 84 Other Controller Indicates Different Firmware Version Description: The other controller in the redundant controller configuration
Error messages 85 5. If the error persists after completing steps 1 through 4, contact an HP authorized service provider. SCSI Port X Drive ID Y Fa
Error messages 86 Description: A predictive failure warning for this hard drive has been generated, indicating that a drive failure is imminent. Act
Error messages 87 Description: A power supply in the external storage unit has failed. Action: Replace the power supply. Storage Enclosure on SCSI
Error messages 88 Swapped cables or configuration error detected. A configured array of drives... ...was moved from another controller that supporte
Error messages 89 Swapped cables or configuration error detected. The configuration information on the attached drives... ...is not backward compati
Contents 9 System Power Supply Failure (Power Supply X)... 126 Unrecover
Error messages 90 Description: ADU detected two different controller models installed in a redundant controller configuration. This is not supported
Error messages 91 Unsupported Processor Configuration (Processor Required in Slot #1) Description: Processor required in slot 1. Action: If you do n
Error messages 92 WARNING: Storage Enclosure on SCSI Bus X Indicated it is Operating in Single Ended Mode... ...SOLUTION: This usually occurs when a
Error messages 93 Non-numeric messages or beeps only Advanced Memory Protection mode: Advanced ECC Audible Beeps: None Possible Cause: Advanced ECC
Error messages 94 Critical Error Occurred Prior to this Power-Up Audible Beeps: None Possible Cause: A catastrophic system error, which caused the s
Error messages 95 Fatal Hub Link Error Audible Beeps: None Possible Cause: The hub link interface has experienced a critical failure that caused an
Error messages 96 Invalid memory types were found on the same node. Please check DIMM compatibility. - Some DIMMs may not be used Description: Inval
Error messages 97 NMI - Undetermined Source Audible Beeps: None Possible Cause: An NMI event has occurred. Action: Reboot the server. Node Interlea
Error messages 98 Power Fault Detected in Hot-Plug PCI Slot x Audible Beeps: 2 short Possible Cause: PCI-X Hot Plug expansion slot was not powered u
Error messages 99 Temperature violation detected - system Shutting Down in x seconds Audible Beeps: 1 long, 1 short Possible Cause: The system has r
Comentários a estes Manuais