Chronology | Current Month | Current Thread | Current Date |
[Year List] [Month List (current year)] | [Date Index] [Thread Index] | [Thread Prev] [Thread Next] | [Date Prev] [Date Next] |
we have something like six or seven servers for the
network, and several different Raid-5 systems. We have generally stocked
two replacement drives, but once we lost three in the same power failure. I
think two were on one server and the third was on a different server. Since
we only had two replacements in stock, we had to have the third drive
shipped overnight in order to get the system back up. That is an easy way
to make it take over 24 hours to get something back up... not having
sufficient replacement parts on hand. How many spare disk drives and spare
server computers do you think an organization should have on hand?
It can take 4 or 5 hours after power is restored just to figure out
what all is working and what is not working. Then, if we indeed lost two
Raid-5 disks, we have to restore from the backup, and that can indeed take a
long time (another 4 or 5 hours). ... Then, once the system is up, the
manager does some reliability testing before making things available
to the public again.
We were locked out of the
old colo building for *10 days* and could not get a single phone call
returned. Finally, the old colo owner, who knew the secrets of the
building, helped us *break in,* whereupon we "stole" our box out in
the dead of night, enabling us to bring it back up on a spare IP I
have at work. Still no word from the new (obviously fired) colo, like
for example "hey you can't just break in and take stuff!"
I contend my story competes,