Sononaco: The Blog

News

Downtime issues 6/16 around 3AM

In the wee hours of Thursday there came a bump in the night at our server host. The details of the outage are explained below.

In short, Cox Communications – an upstream internet provider for our hosting company – did something stupid by allocating a block of IP address to themselves. This essentially redirected the IP addresses to Cox’s network preventing the traffic from reaching the hosting company.

=====

At approximately 3:20 AM EDT, CARI.net internal monitoring began reporting problems with DNS resolution. The problem was immediately escalated to our on call senior network admins. Due to the nature of the problem, remote access was not possible to resolve the issue, onsite access would be required. Once onsite it was established that two of our upstream Bandwidth providers (Level3 and COX) were not passing traffic, however the connections themselves were functional. Both providers were contacted and tickets were opened with tier 1 support. Working with Level3, we were able to jointly identify that the problem was originating from the COX network.

COX was apparently routing 3 of CARI.net’s 5 IP allocations incorrectly causing traffic to be dropped in the COX network.

At 5:30 AM COX’s on call Hi-Cap engineer contacted us. Since this was a routing problem he had to transfer the issue to the routing group. At 6:06 AM the on call COX routing engineer contacted us and confirmed what we already knew and stated that he would work on the problem and call us back. At 6:40 AM CARI.net internal monitoring indicated that DNS was once again functioning and some traffic was once again flowing to Level3. At 7:05 AM COX called back indicating that the problem was fixed.

COX will be working to create a full report of the incident. We will not be using the COX service until we receive this report. During the outage, all of CARI.net’s services were internally functioning normally.

=====

Mail System Upgraded to Version 7.0

The mail server upgrade has been completed. You will find a few things have been moved around.

1. The instant messaging system is still there.

2. Your Documents have been moved into the Briefcase

3. New Zimlets are activated. Please check Preferences > Zimlets to enable them if they are not enabled already. The new “social” Zimlet has been in hot demand.

4. The new “Carbon” theme is beautiful. Find it under Preferences > General Theme

Enjoy the new system!

Server downtime April 17th

In the wee hours of Sunday morning April 17th something went wrong. But first, a little history and a quick lesson.

First a little Computers 101: Each file and folder have users and permissions. Users “own” the files/folders and permissions allow other users to take action such as reading, writing and executing on the files/folders. This is important for later.

Earlier this week, as we do every week, we applied security patches to our hosting servers. Usually this process goes so smoothly no one ever knows it’s happening or happened. The process was flawless and we went about our business.

Fast forward to Sunday morning at 4AM. That’s when our servers run through their weekly maintenance routines (cleaning up logs, clearing out caches, rotating logs). It usually lasts about 5 minutes and is a most unspectacular event. That is, unless there is a problem.

With the system security patch earlier in the week a tiny little bug was introduced when patching the module that handles all of the security routines of the web site. In human speak, it’s the thing that makes web addresses that begin with “https” secure.

When the upgrade was applied the “aliases” folder in the web root had permissions and ownership changed. During the maintenance routines the system tried to restart the Apache web server. With the permissions and ownership different on the “aliases” the server could not be restarted.

You may think this is kind of stupid that a silly permissions issue would prevent the server from restarting but it’s a good thing. We don’t want to grant access to everything on the server. That would be bad.

So Sunday morning we spent our time changing permissions and restarting the services. We apologize for the downtime and appreciate your understanding.

Mail Maintenance 12/16 10PM-11PM

The mail system will be offline momentarily this evening between 10 & 11PM for routine maintenance.  You will not be able to send or receive e-mail during this period.  All incoming e-mail will be held for delivery after the maintenance window.

Other services such as Calendar, Instant Messaging, File Sharing and Documents will not be available during this period.

Web sites and payment/donations gateways are not affected by this maintenance window.

If you have any questions about this maintenance please let us know.  And if you want to see some of the features coming up in the next version of the Mail/Calendar server (version 7.0), have a look here. And if you haven’t had the chance to download the new version of Zimbra Desktop it’s a free download!  You can get here for Mac, Linux and Windows.

Note: As with any milestone software release we will likely wait until version 7.0.1 before deploying it for everyone.

All information © 2010 Sononaco, Inc.