?

Log in

No account? Create an account

Previous Entry | Next Entry

The main file server blew up yesterday (Saturday) morning.  Fixed it with a reboot, but the RAID controller said it had a bad disk.

Called Dell, and they wanted me to do low-level (ie. take the system down) verification, which is reasonable, before they send me another disk.

Of course, it would be too much to ask for the user community to actually take the (expletive deleted) weekend off - we had people screaming at 7:50am about the server being down...  There's only 200-some people left, you'd think they might want to spend time with their families.

Anyway, decided to run a backup against the degraded volume before I played with it, and the backup took until 8:30pm - BUT the CRC verification on the tape took until midnight.

So, now I get to spend the night in the 65 degree computer room babysitting disk verifications and a RAID rebuild.  The first disk said it had *ONE* error, and when I rebooted Windows it says the volume is corrupt and I need to run CHKDSK.  Now, this is WHY I'm running RAID on the (there's that expletive again) thing - to keep that from happening.  So now I'm expecting that one of the other disks blew chunks (of course, the RAID controller could say something other than "all the disks are fine, really!"), and when the controller tried to fail off of that disk it hit the bad sector on the other disk - or maybe not.  Maybe it's all just a REALLY, REALLY BAD DREAM.

At least I've got the backup - hope I didn't have bozos in here playing with the data all night while the tape was getting verified.

I'm just waiting for the server to say "I can't do that, Dave"...