A Case of Extreme Uptime

Edit: They tell us this should not be a badge of honor because it just means you are running unpatched machines, but I still think it is interesting. I was surprised that the links still work.

Originally posted November 27, 2012 on AIXchange

Ongoing maintenance of our machines is important. You should schedule change windows and make sure servers have the latest firmware and OS patches. Performing regular maintenance is the simplest way to avoid security vulnerabilities. Keeping current on fixes can save you from calling IBM Support; often their first response to a question is to tell you that your issue has been resolved in an already released service pack.

That said, AIX systems provides us with world-class technology. These machine are capable of running for a very long time without any care. And a few do.

A friend recently forwarded this email concerning one of his AIX machines:

            I have a production server that was here when I started in 1999. It was last booted on Jan. 14, 2000, almost 13 years ago…

            It was renamed after applications were migrated off of the server two weeks ago. It is now going to be used as a DR box. As you can see below, it was up 4,675 days before I rebooted it this morning. And yes, it came up just fine.

            # oslevel -r

            4330-11

            # uptime

            09:35AM   up 4675 days,   2:21, 2 users, load average: 1.22, 1.29, 1.28

This is, of course, first and foremost a tribute to the quality of AIX systems. However, a not insignificant amount of good fortune is also involved. This box ran continuously for almost 13 years. Power outages were never an issue. Any hardware issues were resolved through hot swapping. No one accidentally logged into this production server and accidentally ran a shutdown –Fr. The firewall that this box must have operated behind kept it safe from constant attacks.

I was impressed to hear of a production AIX server running for this amount of time without even a reboot. I imagine there are systems that have been up even longer, though I couldn’t find anything specific. If you’d care to do your own research, there are threads devoted to this sort of thing. See here, here, here, here and here.

Frankly, I wouldn’t recommend treating a machine this way. I always want to be sure I’m running a supported operating system with the latest fixes. Still, these types of stories surface every now and again, maybe you have your own. What’s the longest-running production system that you know of? What were the circumstances? Please share your anecdotes in Comments.