Forum Moderators: open

Message Too Old, No Replies

Does anyone know what may be the problem?

         

LewisM

6:52 am on Dec 27, 2011 (gmt 0)

10+ Year Member



Hi, does anyone know what could be the problem with the server? It went offline with this error (1) and then each time we reboot it, it goes offline again with the error (2) while overloading with apache/php processes. Can it be RAM? Other which other hardware failure?

IMG (1) [imageshack.us ]

IMG (2) [imageshack.us ]

lammert

10:02 am on Dec 27, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Hi LewisM,

Look likes typical errors generated by a server with ECC memory where parity errors in the RAM are detected. The best thing you can do is to remove/replace the memory modules in the banks mentioned and see if the problem is solved.

LewisM

11:20 am on Dec 27, 2011 (gmt 0)

10+ Year Member



Well, the RAM tests where fine, so we are considering CPU failure. Could this be possible?

lammert

11:57 am on Dec 27, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



This type of error normally occurs during high load/temperatures of the components. Therefore a RAM test procedure may not show the error until all components like CPU, disks and power supply are back under full load.

J_RaD

4:39 pm on Dec 28, 2011 (gmt 0)



what did you use to test the ram?

LewisM

7:08 am on Jan 12, 2012 (gmt 0)

10+ Year Member



We have found the hard drive failure and needed to replace it.