Just a bit of advice, but a post from Jin explaining why the site is down so much and what he's trying to do to improve it, instead of putting it through you guys will be *much* more well received.
Printable View
Just a bit of advice, but a post from Jin explaining why the site is down so much and what he's trying to do to improve it, instead of putting it through you guys will be *much* more well received.
Sorry for my recent lack of activity but things have been mounting up upon me.
One problem is since our rebuild we have lost a lot of our optimizations and monitoring tools which inform me whenever we have an issue. At the moment I only receive the basic notification when the server has gone completely down. I will restore these monitoring tools when I have the opportunity.
My inactivity has been due to some work issues currently I am trying to deal with little over 500 members of staff for Glastonbury alone and as soon as I got on top of that I received the PO for the Royal wedding just a week before which meant I had to squeeze 12 days of planning into six. The last 3 weeks have been a hellish nightmare of bureaucracy, politics and meetings.
One major issue with the server has been resolved, it was caused by the primary disk on the server filling up with backups which should have been written to a separate disk. The problem didn't emerge until the server was rebooted on the 18/04 and continued to go unnoticed until several notifications were received regarding the server frequently going down.
I hate to beat a dead horse or however the saying goes, but this is why a second administrator is crucial, even with basic access. Even for something as simple as the server I run for hosting/minecraft etc I appointed 3 different people who I knew could manage different aspects of the server whilst I was on holiday, this meant if one of them was rogue, not all my services would have been affected.
I respect that, Tom and I agree that the more server technicians a site has the more of a chance of getting fixed quicker, however if Jin doesn't want to hire extra help he doesn't have to, with it being his site and I'm sure he has reasons for this.
Although, as revealed on the "HabboxDev" twitter Jin does occasionally have helping hands, however they are not users of the site - which is naturally fine as a back end administrator doesnt need to, and perhaps the problem recently is what Jin highlighted about the notification system not being fully in place, so these administrators dont know that the sites having problems :P
I obviously wasn't online in the early hours, can anyone verify that the back-up problem Jin highlighted is now less of a down-time?
Downtime has become quite common in the past couple of days. So common that i'm no longer posting anything without copying it first incase it gets lost :/ (Hate it when that happens after typing an incredibly long post). Forum was down for around 5-10 minutes just a few minutes ago.