Forum Moderators: phranque
They say they doing nightly backups from 4am to 5am EST
which explains extreme lag but this is getting worrysome.
They are always open to my suggestions and do things for me
like minor software updates within an hour so I don't
want to bail out on them (also have like a dozen clients on
that server that I am responsible for).
What technical solutions can I ask them to do to their server backup
to speed up the process or allow some bandwidth in and out of it while this is happening?
(it's a "standard" apache server)
Thanks for any ideas! -aV-
seems like my host does nightly rsync backups and then weekly full backups,
and somehow they both triggered at the same time while there was decent server traffic
caused the server memory to go too low and it crashed!
just bothers me it took someone two hours to discover and correct the situation...
they have at least 98%+ uptime since I have been with them though...
-aV-
If this happens once a year I can live with it... but if it
happens again this year I might have to have a serious talk
with a senior tech there...
<added>Zero downtime would require physical redundancy of the server with automatic switchover mechanisms in place. That can be had, but is very expensive.</added>
That to me is acceptable and is what our host offers. I'm just an amateur, but would assume that this situation may have been a bit unprofessional on the part of your host.
Im sure large hosts have all sorts of back up systems and system monitors to prevent that exact case happening, as Bird sugegsted. I think our host does mention that "redundancy" word in their FAQ's on their server set, and it impressed me though i wouldnt have a clue what they were on about.
Timing is no excuse. Early morning in the US is actually our own peak times, so unless their clients have 90% or more North American users its not that much of a good excuse.
If i was happy with all the other aspects of their services however i would be happy with a report if what they have learned from the experience and what they are doing in future to prevent them, and would give them another chance or two.
Looks like a slipped a decimal. 99.9% is 8.76 hours a year. Consequently, 99.99% is less than an hour (53.56 minutes).
Just for the sake of completeness, the 98% uptime mentioned in the second post would allow for almost a week of downtime per year. I'm not sure if I would be happy with that.
Btw: Redundancy can be applied to many different things. Most commonly that is to connectivity and routing. But to prevent downtime on genuine server crashes, your site would need to be hosted on two seperate machines at the same time, which is rare.