Thursday, May 9, 2013

Single-bit ECC errors RAID controller

Yes, firmware can be applied to fix this memory error (Controller's memory bucket size).  The initial problem was error  at POST, Dell Cloud C6220 running LSI 9265 SAS RAID card.

Single-bit ECC errors were detected during the previous boot of the RAID controller. The DIMM on the controller needs replacement. Please contact technical support to resolve this issue. Press 'X' to continue or else power off the system and replace the DIMM module and reboot. If you have replaced the DIMM press 'X' to continue

Searching to find answer I found FW with ECC error fix, on Dells support drivers website.
Filename:

Tuesday, March 12, 2013

Megacli utility PERC controller logs


I finally found it... some where in my notes i found the best LSI MegaCli tty log Linux command, to pull logs from the LSI controllers.   Especially Dell PERC5, PERC6i, PERC H700/H800, PERC H710. Replace Megacli with Megacli64 if needed.
Command:
MegaCli -FwTermLog -Dsply -aALL > perc.txt
My other favorite/important commands:

MegaCli -LdPdInfo -aALL > ldpdinfo.txt

MegaCli -PdList -aALL > pdlist.txt
MegaCli -AdpEventLog -GetEvents -f events.log -aALL > adapterevents.txt

NOTE: the -aALL is for all LSI PERC Adapters installed, or use enumeration value assigned to adapter. Example for 'integrated' PERC ID 0  use -a0 and for PERC H800 ID 1 use -a1.  Using -aALL with multiple PERC Adapters installed, the logs will run together.



You can install the MegaCli program, from LSI, on either rpm based (RHEL/CentOS) or deb based (Debian or Ubuntu) OS.  Ubuntu install can use alien command to convert the rpm to deb (filename: MegaCli-8.07.14-1.noarch.rpm).
Then execute /opt/MegaRAID/MegaCli (./MegaCli or ./MegaCli64) file to pull the best logs from PERC controller.