[rescue] weird Opteron 865 / Tyan 4882 problem
patrick at zill.net
Tue Jul 14 21:20:31 CDT 2009
Jonathan Patschke wrote:
> On Tue, 14 Jul 2009, Patrick Giagnocavo wrote:
>> My thinking is that either:
>> 1. CPU1 is partially busted (specifically its memory controller) and
>> should be replaced
> Swap CPU0 and CPU1 to see if the problem follows the CPU.
Each CPU is connected with a heat pipe to another one, e.g.
CPU0 <--> CPU2
CPU1 <--> CPU3
Looks like I may have to hold off on deployment until I can fully debug
>> 2. There is a problem with the DIMM slots itself, like maybe a resistor
>> or some other electrical channel problem.
> Swap the memory between CPU0 and CPU1 to see if the problem follows the
I have swapped the memory, it works in the other slot.
>> Any ideas? Suggestions for further testing?
> Same as it ever was: set up two scenarios identical but for one variable
> (or a matrix with multiple changes) until you find the one variable that
Yeah, that sucks :-)
More information about the rescue