|
I’ve got a couple of dozen servers running taroon with the 2.4.21-9.0.3-smp packages. The servers are Dell 1750s
with 2 or 4gigs of memory, and either 2 72 gig drives or 3 146gb drives with
the Perc/4i raid controller. I’m not sure if I can rule out hardware failure,
because these perc/4i raid controllers are junk, and I’ve had 3 of them fail on me in
the past 6 months. Whatever the case, these boxes don’t seem to hold up
well under heavy file system load. I use rsnapshot to backup
once a night, and on a few of these servers I get occasional kernel panics. With the past 3 failures, I’ve had my / partition
wiped out.. I’ll reboot, run fsck,
it’ll delete the directory structure and put everything In lost+found. I haven’t found any evidence in errata
that going to the latest kernel will cure this, so I’d like to try to
rule out bad hardware if
possible. We’ve got “level 3” support on all of these boxes
from redhat, but not even redhat
can tell me what number to call at Dell to actually get them to
answer questions (apparently we have the joy of getting Dell to try to learn
how to pickup their phone for redhat support). Any thoughts on the subject matter? Michael T. Halligan -------------------- Mypoints.com Infrastructure Engineer 415-615-1160 |