Hi guys,
Not sure if this should be posted here but I'm gonna do it here anyways.
This past weekend, we took a hard hit at our office. We are in Melbourne, Florida where Hurricane Frances hit. Our generator failed on us after 24 hours so our UPS kicked in. Our UPS then couldn't handle the AC so that failed. I have Net Botz rack mount systems set up that takes the temperature, humidity, etc. readings and if it gets too hot, it will page all of the system admins. Well, it worked but our T1 provider line was down so it couldn't get through to the outside world.
So our 91 servers and a lot of Cisco equipment had no AC for three hours before my boss ventured in to make sure everything was okay. It was so hot that you couldn't touch the metal because you would of been burned.
Most of the servers will power back up but it's obvious we are gonna have a lot of problems, some of them are already flaking out. We got the go to gut everything and replace it, probably close to 350 to 400k in damages. We are gonna let the insurance battle it out later, we need to get this stuff replaced right away.
Lesson learned, but we need to set up some kind of system that will remotely shut down servers if it reaches a certain temperature. We have Windows, Solaris, Linux, and SGI servers so it would have to be able to shut down them all.
Does anyone have experience with this? I'm only the intern so this really isn't my responsibility, but it would really impress everyone if I could come up with a project that we could integrate in our server room.
Thanks guys!
Dan
Not sure if this should be posted here but I'm gonna do it here anyways.
This past weekend, we took a hard hit at our office. We are in Melbourne, Florida where Hurricane Frances hit. Our generator failed on us after 24 hours so our UPS kicked in. Our UPS then couldn't handle the AC so that failed. I have Net Botz rack mount systems set up that takes the temperature, humidity, etc. readings and if it gets too hot, it will page all of the system admins. Well, it worked but our T1 provider line was down so it couldn't get through to the outside world.
So our 91 servers and a lot of Cisco equipment had no AC for three hours before my boss ventured in to make sure everything was okay. It was so hot that you couldn't touch the metal because you would of been burned.
Most of the servers will power back up but it's obvious we are gonna have a lot of problems, some of them are already flaking out. We got the go to gut everything and replace it, probably close to 350 to 400k in damages. We are gonna let the insurance battle it out later, we need to get this stuff replaced right away.
Lesson learned, but we need to set up some kind of system that will remotely shut down servers if it reaches a certain temperature. We have Windows, Solaris, Linux, and SGI servers so it would have to be able to shut down them all.
Does anyone have experience with this? I'm only the intern so this really isn't my responsibility, but it would really impress everyone if I could come up with a project that we could integrate in our server room.
Thanks guys!
Dan