2) be aware of problem as soon as possible and start to solve critical problems without any delay ... as you wrote your team doesn't cover 24 hours a day ... but probably should cover critical days/time and monitoring should be set this way that staff in emergency call get message if there is any critical issue in any time