Critical Controls Risk Management Safety Bowtie Analysis Mining

Control Effectiveness: How to Measure and Monitor Your Barriers

RiskSight Team

Most organisations are reasonably good at identifying controls. They appear in risk registers, safety plans, and bowtie diagrams. The hard part — the part most organisations do poorly — is knowing whether those controls are actually working.

Control effectiveness is the measure of how well a control is doing its job: preventing threats from reaching a top event, or limiting consequences when they do. Without ongoing monitoring of control effectiveness, your risk register reflects what you intended, not what’s actually protecting your people.

This is the gap that produces repeat incidents. Not a failure to identify hazards or select controls — but a failure to verify that the controls are functioning as designed, and to act when they’re not.

What Control Effectiveness Actually Means

A control is “effective” when it reliably interrupts the causal pathway it’s designed to interrupt, under the conditions it’s expected to operate in, at the time it’s needed.

That definition has three important components:

Reliably — not just once, under ideal conditions, but consistently over time and across personnel, shifts, locations, and environmental conditions.

Under the conditions it’s expected to operate in — a control that works on day shift in good weather may fail on night shift in wet conditions. Effectiveness is context-dependent.

At the time it’s needed — a control that works most of the time but fails at precisely the moment a threat occurs provides less protection than it appears to.

Most control assessments at the design stage consider whether a control can work. Monitoring control effectiveness is about whether it does work — continuously, not just at the point of assessment.

Leading vs Lagging Indicators of Control Effectiveness

The most common mistake in monitoring control effectiveness is relying exclusively on lagging indicators — measures that tell you something has already gone wrong.

Lagging indicators include:

  • Lost-time injury frequency rate (LTIFR)
  • Number of recordable incidents
  • Number of regulatory notices or prosecutions
  • Near-miss counts (which are lagging relative to the control failures that enabled them)

Lagging indicators are necessary for understanding historical performance. But they’re poor monitors of current control effectiveness because they only register failure after harm has occurred. By the time your LTIFR ticks up, the control failures that produced it have already happened — possibly multiple times.

Leading indicators measure conditions and activities that precede harm — they tell you the control is at risk of failing before the incident.

Useful leading indicators for control effectiveness include:

Verification completion rates — are the scheduled checks on your critical controls actually being completed on time? A verification completion rate below 90% for a critical control is a signal that the monitoring system itself is breaking down.

Defect and non-conformance rates — how often do field inspections find controls in a non-conforming state? A rising non-conformance rate on a specific control suggests it’s degrading.

Near-miss reports involving the control — near-misses where the control was relevant but either failed or wasn’t in place are direct evidence of degradation, even if no harm resulted.

Overdue maintenance on control-dependent equipment — if a critical control relies on equipment function (an interlock, a detection system, a physical barrier), overdue maintenance increases the probability of failure.

Workforce competency currency — for controls that depend on trained behaviour, are the relevant workers current in their training and competency assessment?

Permit and procedure compliance rates — how often are the administrative controls (permits to work, isolation procedures, confined space entry procedures) being completed correctly versus being back-signed or skipped?

The power of leading indicators is early warning. When verification completion rates fall, when non-conformances rise, when near-misses cluster around a specific control — these signals allow you to intervene before the incident, not after.

Levels of Control Verification

In critical control management frameworks, verification activities are structured in levels that correspond to different frequencies and perspectives.

Level 1 — Operational verification (daily to weekly)

Conducted by frontline workers or supervisors as part of normal operations. The goal is confirming that the control is in place and functioning before and during the work it’s protecting.

Examples:

  • Operator confirms proximity detection system function during pre-start
  • Supervisor checks that isolation points have been tagged out before maintenance begins
  • Worker confirms edge protection is in place before starting elevated work

Level 1 verification is frequent and brief. It doesn’t require specialist knowledge — it’s a field confirmation that the control is present and appears functional.

Level 2 — Supervisory verification (weekly to monthly)

Conducted by supervisors, safety professionals, or technical specialists. More detailed than Level 1, this verification often includes checking the quality of Level 1 records and assessing whether the control is performing to its defined standard.

Examples:

  • Supervisor reviews a sample of pre-start records and spot-checks corresponding equipment
  • Safety advisor audits permit-to-work completion quality for the previous two weeks
  • Maintenance team confirms calibration is current for all proximity detection units

Level 2 verification catches the gap between paperwork and reality — where Level 1 records show compliance but field conditions don’t match.

Level 3 — Management review (monthly to quarterly)

Conducted by operational management or HSEQ leadership. Focuses on trends, patterns, and systemic issues rather than individual instances.

Examples:

  • Operations manager reviews aggregated verification data for all critical controls and identifies controls with declining performance
  • HSEQ manager presents control health summary to site leadership team
  • Business unit review of critical control status across all sites

Level 3 verification is where leading indicator trends are interpreted and where decisions about escalation, additional resources, or control redesign are made.

How to Define a Control Performance Standard

You can’t measure control effectiveness without first defining what “effective” looks like. This is the performance standard — the specific, observable condition that confirms the control is functioning as intended.

A good performance standard answers the question: If someone came to verify this control right now, what would they look for to confirm it’s in place and working?

Weak performance standard: “Isolation procedure is in place”

Strong performance standard: “All energy sources are isolated, locked, and tagged. Lock box is in use. Isolation register is signed. A verification check confirms zero energy is present at the point of work before work commences.”

The strong version is specific enough that a field verification is unambiguous — either it’s done or it isn’t. The weak version invites interpretation and allows a paperwork-only check to pass as adequate.

Performance standards should also define:

  • Who is responsible for the control’s function
  • The verification method (physical check, instrument reading, record review)
  • The frequency of verification at each level
  • The escalation trigger if the standard is not met

Responding to Control Degradation

Monitoring only has value if it produces action. When a control shows signs of degradation, the response must be proportionate to the severity of the control and the nature of the signal.

Immediate signals (respond now):

  • A critical control is confirmed absent or non-functional
  • A near-miss occurs where a critical control failed or was absent
  • Field verification finds the control in a grossly non-conforming state

Response: Stop the associated high-risk work. Investigate the cause of the failure. Do not resume until the control is restored and the cause of failure is understood.

Trend signals (respond within days):

  • Verification completion rates falling below threshold
  • Increasing frequency of non-conformances on the same control
  • Multiple near-misses involving the same control pathway over a short period

Response: Escalate to site management. Assign an investigation. Review the control performance standard and verification approach. Implement interim compensating controls while the root cause is addressed.

Slow degradation signals (respond within weeks):

  • Gradual decline in verification quality
  • Increasing overdue maintenance on control-dependent equipment
  • Workforce competency for the control falling below target

Response: Include in the next management review agenda. Assign corrective actions with owners and due dates. Monitor trend to confirm the direction reverses.

The goal of the response framework is to match the urgency of the response to the significance of the signal — while ensuring no signal is ignored entirely.

Why Controls Degrade Over Time

Understanding why controls degrade is as important as detecting when they degrade. Common root causes:

Production pressure — the control slows work down, and when schedule pressure is high, it gets bypassed. This is particularly common with administrative controls (permits, pre-starts, sign-offs) where the record is created after the fact.

Familiarity — the control has always been there, nothing bad has happened, and it starts to feel like formality rather than protection. Workers and supervisors become less rigorous about verification.

Unclear ownership — when nobody is specifically responsible for the control’s performance, maintenance and monitoring slip. Multiple owners often means no owner.

Design mismatch — the control was designed for conditions that no longer exist. New equipment, changed processes, rotated personnel — the control no longer fits the risk it was meant to address.

Verification fatigue — if critical controls are too numerous, or verification activities too frequent relative to available resources, quality declines. The form gets completed but the physical check doesn’t happen.

Understanding these drivers shapes the corrective action. A control degrading due to production pressure needs a different response than one degrading due to unclear ownership.

Connecting Control Effectiveness to Your Risk Register

Control effectiveness shouldn’t be assessed in isolation. It should feed directly back into your risk ratings.

If a control that you’ve rated as “effective” in your risk register is showing degradation signals — rising non-conformances, falling verification rates, incident linkage — the residual risk rating for every hazard that control manages is higher than the register shows. That gap between assessed risk and actual risk is exactly what produces surprise events.

An integrated risk management platform closes this loop: when control effectiveness data updates, residual risk ratings update. When an incident links to a control failure, it flags the control for review. When verification rates fall below threshold, the risk rating escalates.

RiskSight’s critical risk software connects control verification, incident management, and risk register in a single model — so your risk register reflects reality, not the snapshot from the last formal assessment. Start your 30-day free trial with demo data included so you can see how it works before committing.

Ready to modernise your risk management?

Start your 30-day free trial. No credit card required.

Start free trial