Sononaco: The Blog

Announcements

A Holiday “Gift” From the Server Gremlins

If you have been following our Twitter stream you will know we have been having problems with one of our web servers. Bad things happen. This was about the worst case scenario we could envision.

Fortunately we had backups of everything since about 12 hours before the crash. Everything has been restored, all sites are up and we are getting all of the accounts properly configured.

Here is how it all went down:

On Wednesday at approximately 3PM we received notices that sites were behaving erratically – database connections were dropping and required libraries we not being loaded. When we tried to log on to the server the connections were refused.

We called the server room and hooked up a terminal which showed a kernel panic – similar to the Windows Blue Screen of Death or the Mac’s “grey screen with white text” so we rebooted the machine. From there it never came back.

Our techs then ran a file system check to make sure the files were not corrupt. The system passed the check but on boot-up the server would kernel panic.

We attempted to resurrect the files directly on the server but the repeating kernel panic prevented us from booting the server.

So we provisioned a new server and began the time-consuming process of restoring the backups from the full backup on December 23rd. This would give us all of the files that were on the server the day it went down.

After the backups were complete we started the process of restoring each site individually which has been completed.

We are now in the process of updating the databases and files from the incremental backups performed over the last few days.

How do we prevent this in the future? We have taken the measure of installing a new server architecture and chipset, newer, more reliable hard drives and a new upgraded RAID system. And yes, we are continuing to back up everything on the server.

We were 8 days shy of this server being up for 1,000 days. We have never had a downtime experience that has lasted this long. Bad things happen. Electronics break. But we can make sure procedures are in place to minimize data loss and get your information back online as soon as possible.

Thank you for your patience and understanding while we worked to restore your data.

Mail server upgrade scheduled for Memorial Day Weekend

Several months ago a major upgrade to our mail server was released, Zimbra 7.0. We are currently running version 6.

I have been reluctant to upgrade the server because there were several changes made which concerned me, most notably the absence of the Instant Messaging and Documents tabs/applications. Plus, mission-critical operations such as e-mail are not the technologies to blindly jump onto the latest and greatest. When was the last time the mail server was down for something other than maintenance? Right: February 23rd, 2011. And before that? 1,010 days prior. Not a bad track record.

So now the software is at version 7.1. That “dot-1″ is key to me because it means the release-version bugs have been fixed. During the testing I have found how to re-enable the instant messaging application and access the Documents, which have been moved to the Briefcase application.

Unless there are no objections I am hoping to apply this upgrade to the mail server over Memorial Day weekend. If you have any questions, concerns or wish to try a demo of the new software please let me know and I will set up a demo account.

Mother’s Day Mail Server Upgrade

We will be upgrading the mail server this weekend by applying the latest security patch. For the most part these upgrades are extremely smooth, only requiring about 20-30 minutes of downtime.

The upgrade will be performed Saturday or Sunday. If you have any questions about this please contact us.

This upgrade will have *no* effect on web sites or e-commerce.

Fox: Final Two Episodes of Glee Lost to Data Center Crash

LOS ANGELES: Fox Studios reported today that the last two episodes of the of the popular sitcom Glee, set to air later this month, have been lost due to a “catastrophic” failure at their primary data center.

The episodes “A Night to Neglect” and “Born This Way” were scheduled to air April 19th and 26, respectively.

“We are working to restore as much of the episodes as we can from the production material,” said Sydney Tomlinson, a spokesman for Fox. He added, “even our backup systems were wiped out.”

Since moving to a purely digital format, studios have been increasingly reliant on technology, with all shows now stored on digital medium.

“Losing the last two episodes of such a popular show is not something you can plan for” said Mr. Tomlinson. Fox plans to air recently discovered “lost” episodes of Parker Lewis Can’t Lose until the missing episodes of Glee can be restored or re-shot.

All information © 2010 Sononaco, Inc.