Thursday, January 10, 2008

My Last EC2 Availability Post

Tomorrow is my last day at the current gig, so this will be my last EC2 uptime report. I won't have access to these graphs/machines any longer.

So far, 100% availability of this EC2 dating back to November 27.

Unfortunately, there have been two other incidences with different EC2 hosted machines in the past couple of weeks. One machine rebooted itself. I'm not sure if it crashed or if there was another issue. It came back up fine, other than a few services that weren't configured to start at boot (that's mostly how we knew it rebooted).

Another machine disappeared for about an hour. It wasn't reachable at all -- ping, ssh, etc. When it came back to life, everything was fine. No change in uptime, so it didn't reboot. It was clustered with another EC2, which was fine during the outage, though it also couldn't access the down machine.

We're still in development, so at any one time, we're currently running 6-8 EC2's. With that rate of failure, be sure to back up your data and keep tabs on them with some sort of remote monitoring.

No comments: