LearnSeverity Levels
Fundamentals

Incident Severity Levels Explained

How to define SEV0 through SEV4 for consistent incident triage and response.

10 min readLast updated: January 2026

Why Severity Levels Matter

Severity levels create a shared language for incident urgency. Without them, every incident feels equally urgent (or equally ignorable). With them, your team knows exactly how to respond:

  • Appropriate response: A SEV0 needs all hands. A SEV3 can wait until Monday.
  • Clear communication: Stakeholders know what "SEV1" means without explanation.
  • Consistent triage: Different on-call engineers make the same classification decisions.
  • Resource planning: You can staff appropriately for different severity levels.

Severity Level Definitions

Most organizations use a 4 or 5-level system. Here's a common framework that works for most engineering teams:

SEV0
Critical

Complete service outage or data loss affecting all users

Examples

  • Website completely down
  • Database corruption with data loss
  • Security breach with active exploitation
  • Payment processing completely broken

Response Expectations

Acknowledgment:< 5 minutes
Status updates:Every 15 minutes
Resolution:All hands until resolved
SEV1
High

Major functionality broken affecting most users

Examples

  • Core feature completely unavailable
  • Significant performance degradation (>10s latency)
  • Authentication broken for all users
  • Major revenue impact

Response Expectations

Acknowledgment:< 15 minutes
Status updates:Every 30 minutes
Resolution:Dedicated team, after-hours escalation
SEV2
Medium

Functionality degraded affecting subset of users

Examples

  • Feature broken for specific user segment
  • Non-critical integrations failing
  • Performance issues affecting some regions
  • Workaround available but inconvenient

Response Expectations

Acknowledgment:< 1 hour
Status updates:Every 2 hours
Resolution:Normal business hours priority
SEV3
Low

Minor issues with limited impact

Examples

  • Cosmetic bugs
  • Non-critical feature issues
  • Minor performance degradation
  • Issues affecting <1% of users

Response Expectations

Acknowledgment:< 4 hours
Status updates:Daily
Resolution:Next sprint or backlog
SEV4
Informational

Tracking items, tech debt, or improvements

Examples

  • Deprecation warnings
  • Non-urgent technical debt
  • Feature requests from incidents
  • Documentation gaps discovered

Response Expectations

Acknowledgment:Best effort
Status updates:As needed
Resolution:Backlog prioritization

Triage Criteria

When an incident occurs, use these questions to determine severity:

Impact Assessment

  • How many users are affected? All users → higher severity. Small subset → lower.
  • Which functionality is broken? Core revenue path → higher. Nice-to-have feature → lower.
  • Is there a workaround? No workaround → higher. Easy workaround → lower.
  • Is data at risk? Data loss or security → automatic SEV0/SEV1.

Urgency Assessment

  • Is the situation getting worse? Spreading impact → higher severity.
  • Are SLAs at risk? Burning error budget → higher severity.
  • Is there external visibility? Customer complaints or press → higher severity.

Pro Tip: When in Doubt, Go Higher

It's easier to downgrade a SEV1 to SEV2 than to catch up after treating a SEV1 as SEV3. Encourage your team to err on the side of higher severity. You can always adjust.

Customizing for Your Organization

The framework above is a starting point. Customize it for your context:

  • Define "most users" — Is that 50%? 80%? Be specific.
  • List your critical paths — What constitutes "core functionality" for you?
  • Set SLA thresholds — At what error rate do you escalate?
  • Consider business context — A SEV2 during Black Friday might be a SEV1.

Common Mistakes

  • Severity inflation: Everything is SEV1 → nothing is. Reserve SEV0/1 for real emergencies.
  • Severity deflation: Downplaying incidents to avoid escalation. This hides real problems.
  • Arguing about severity during incidents: Pick one, move on. Argue later in the post-mortem.
  • Not adjusting mid-incident: Severity can change as you learn more. Update it.

Document and Train

Your severity definitions are only useful if everyone knows them:

  • Write them down in your runbooks or wiki
  • Include examples specific to your systems
  • Review classifications in post-mortems ("Was SEV1 appropriate?")
  • Train new team members during onboarding

Next Steps

Continue learning about incident management:

Track Incidents with OpsBrief

OpsBrief automatically categorizes and tracks incidents from all your tools. Get AI-powered analysis of incident trends and severity distributions.