Reliability Glossary

What is uptime monitoring?

Uptime monitoring is the practice of repeatedly checking a service's availability and response from outside the system, so you learn it's down the moment your users would.

Uptime monitoring, defined

Uptime monitoring works by sending requests to your service from external vantage points on a fixed schedule — say every 30 seconds — and recording whether each one succeeds and how long it takes. Because the checks originate outside your infrastructure, they catch failures that internal monitoring can miss: a dead load balancer, an expired TLS certificate, or a DNS misconfiguration that makes the whole service unreachable.

A monitor defines what "up" means: a successful HTTP status, a response within a latency threshold, or the presence of expected text in the body. When enough consecutive checks fail, the monitor declares the service down and triggers an alert. The interval, the geographic locations of the checkers, and the failure criteria are all things you configure.

How uptime monitoring works

A few configuration choices determine how fast, how reliable, and how meaningful your uptime monitoring is.

Check interval

How often the monitor probes your service. A shorter interval detects outages faster but generates more traffic; one minute is a common balance for most services.

Check locations

Probing from multiple regions distinguishes a real outage from a single network glitch and reveals problems that only affect users in certain geographies.

What counts as down

A monitor can assert on status code, response time, body content, or certificate validity. To avoid false alarms, most require several consecutive failures before declaring an outage.

Alerting

When a monitor trips, it notifies your team through the channels you configure, so the people who can fix it find out before a flood of customer complaints does.

Certificate & DNS checks

Beyond plain availability, monitors can warn before a TLS certificate expires or detect DNS problems — failures that take a whole service offline but aren't application bugs.

Why uptime monitoring matters

Uptime monitoring is your outside-in safety net. Your internal dashboards can look perfectly green while customers can't reach you at all, because the failure is in a layer your own telemetry never sees. Checking from outside is the only way to be confident the service is actually reachable.

It's also the foundation of your availability SLI: the percentage of successful external checks over time is a clean, user-aligned measure of whether you're meeting your reliability targets. Fast detection here directly lowers MTTD — and therefore MTTR.

Uptime monitoring in AllStak

AllStak's uptime monitoring checks your endpoints on a schedule and alerts your team when they fail, so you learn about outages from your tools rather than from your customers. Pair it with notification rules to route alerts to the right people.

Because uptime lives in the same platform as your status pages, error tracking, and incident management, an external failure can flow straight from detection to a customer-facing status update without leaving AllStak.

Uptime monitoring FAQ

How is uptime monitoring different from APM?

Uptime monitoring checks availability from outside your system, answering 'can users reach it?'. APM instruments code from inside to explain why a request is slow or failing. They're complementary: outside-in detection plus inside-out diagnosis.

How often should checks run?

One minute is a common default. Shorter intervals detect outages faster, which lowers your detection time, but generate more probe traffic — pick based on how critical the endpoint is.

What counts as downtime?

Whatever your monitor's criteria define — an error status code, a response slower than your threshold, missing expected content, or an invalid certificate. To reduce false positives, most tools require several consecutive failures.

Why monitor from multiple locations?

A single checker can report a failure that's really just its own network blip. Checking from several regions confirms whether an outage is real and reveals problems affecting only some geographies.

Know about outages before your users do

AllStak's uptime monitoring checks your endpoints and alerts your team the moment they fail — wired into your status pages and incidents. Start free.