Skip to content


内存问题服务器死机一例

硬件:R410 E5606*2 4G*6 Hynix 4GB18-H9
系统:centos5.5

服务器半天左右就会死机一次
tail /var/log/messages

Nov 25 09:28:20 c1g kernel: Machine check events logged
Nov 25 09:33:20 c1g kernel: Machine check events logged
Nov 25 09:38:20 c1g kernel: Machine check events logged
Nov 25 09:43:20 c1g kernel: Machine check events logged
Nov 25 09:48:20 c1g kernel: Machine check events logged
Nov 25 09:53:20 c1g kernel: Machine check events logged
Nov 25 10:03:20 c1g kernel: Machine check events logged
Nov 25 10:08:20 c1g kernel: Machine check events logged
Nov 25 10:13:20 c1g kernel: Machine check events logged
Nov 25 10:18:20 c1g kernel: Machine check events logged
Nov 25 10:23:20 c1g kernel: Machine check events logged
Nov 25 10:28:20 c1g kernel: Machine check events logged
Nov 25 10:44:46 c1g syslogd 1.4.1: restart.

tail -n100 /var/log/mcelog

HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 2 BANK 8 TSC 69cd4ba150c [at 2128 Mhz 0 days 0:56:56 uptime (unreliable)]
MISC c1ac44000081282 ADDR 5fa5c8580
MCG status:
MCi status:
Error overflow
MCi_MISC register valid
MCi_ADDR register valid
MCA: MEMORY CONTROLLER RD_CHANNELunspecified_ERR
Transaction: Memory read error
STATUS cc0001800001009f MCGSTATUS 0
MCE 2
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 6 BANK 8 TSC 69cd4ba18ca [at 2128 Mhz 0 days 0:56:56 uptime (unreliable)]
MISC c1ac44000081282 ADDR 5fa5c8580
MCG status:
MCi status:
Error overflow
MCi_MISC register valid
MCi_ADDR register valid
MCA: MEMORY CONTROLLER RD_CHANNELunspecified_ERR
Transaction: Memory read error
STATUS cc0001800001009f MCGSTATUS 0
MCE 3
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
CPU 4 BANK 8 TSC 69cd4ba1595 [at 2128 Mhz 0 days 0:56:56 uptime (unreliable)]
MISC c1ac44000081282 ADDR 5fa5c8580
MCG status:
MCi status:
Error overflow
MCi_MISC register valid
MCi_ADDR register valid
MCA: MEMORY CONTROLLER RD_CHANNELunspecified_ERR
Transaction: Memory read error
STATUS cc0001800001009f MCGSTATUS 0

日志中记录了内存出错,原来有24G内存拔掉了一根4G后问题没有再出现.

Posted in 技术.

Tagged with , .


No Responses (yet)

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.



Some HTML is OK

or, reply to this post via trackback.