Classical root cause analysis techniques for quality improvement and incident investigation. Covers 5 Whys (with Card's 2017 critique and boundary conditions), Ishikawa/fishbone diagrams, Fault Tree Analysis (FTA), Failure Modes and Effects Analysis (FMEA), Cause Mapping, and Doggett's method-selection framework. Use when investigating a failure, running a post-incident analysis, building a fishbone, calculating fault tree cut sets, running an FMEA pass, or deciding which classical technique fits the problem. Not suitable alone for complex socio-technical failures — escalate to rca-systems-theoretic or rca-causal-inference when the incident has multiple interacting causal chains.
Classical root cause analysis techniques are the inherited toolkit of quality and reliability engineering. They are best understood as a family with different strengths and known failure modes — not as interchangeable recipes. This skill teaches when to reach for each one, how to run it rigorously, and when to stop and escalate to a more capable method.
| Technique | Origin | Best for | Known weakness |
|---|---|---|---|
| 5 Whys | Toyota Production System (Sakichi Toyoda, Taiichi Ohno) | Simple, linear, single-cause problems on the shop floor | Forces single pathway, non-reproducible across analysts (Card 2017) |
| Fishbone (Ishikawa) | Kaoru Ishikawa, 1960s | Brainstorming categorized contributing factors | Can produce unwieldy diagrams, no intrinsic prioritization |
| Fault Tree Analysis (FTA) | Bell Labs, Minuteman ICBM (1960s) | Safety-critical systems with quantifiable failure probabilities |
| Requires known failure modes upfront, expensive to build |
| FMEA | U.S. military (MIL-P-1629, 1949); automotive (AIAG) | Proactive design review, risk prioritization via RPN | RPN ordinal-math is statistically unsound (Bowles 2003) |
| Cause Mapping | ThinkReliability | Multi-pathway causal narratives without imposing linearity | Tooling-dependent, visual sprawl |
| Doggett's framework | A. Mark Doggett (2005) | Choosing among the above based on problem characteristics | It is a meta-method, not an analysis technique itself |
Iterative questioning that walks backward from a symptom to a causal chain. Originally formalized by Taiichi Ohno within the Toyota Production System as "the basis of Toyota's scientific approach" (Ohno, 1988). Ohno's canonical example:
Problem: A machine stopped.
Why 1: A fuse blew because of overload.
Why 2: The bearing lubrication was inadequate.
Why 3: The lubrication pump wasn't pumping enough.
Why 4: The pump shaft was worn and rattling.
Why 5: There was no strainer attached, and metal scraps got in.
Alan J. Card's 2017 critique in BMJ Quality & Safety is the definitive takedown and must be understood before using this technique:
Card's recommendation for healthcare: abandon 5 Whys in favor of fishbone and multi-causal diagrams. Teruyuki Minoura, former Toyota managing director, independently called the technique "too basic" for adequate root-cause depth.
1. State the problem in observable terms. No causes, no blame.
2. Ask "why" — answer with a *factual* antecedent, not an opinion.
3. Before accepting an answer, ask: "Are there other reasons this could happen?"
If yes, branch. You are no longer doing 5 Whys; you are building a cause map.
4. Continue until the chain reaches an actionable factor — NOT a preset depth of 5.
5. Require two independent analysts to reach the same chain. If they do not,
the method has failed for this incident; escalate to Fishbone or STAMP.
A cause-and-effect diagram that organizes contributing factors into categories attached to a spine leading to the effect (the incident). Kaoru Ishikawa developed it in the 1960s at Kawasaki Heavy Industries as part of TQM.
Six M's (manufacturing):
Four P's (service industries): Policies, Procedures, People, Plant.
Eight P's (marketing): Product, Price, Place, Promotion, People, Process, Physical Evidence, Performance.
For software incidents, a modern adaptation used by SRE teams: Code, Config, Data, Dependency, Infrastructure, Process, People, Environment.
Empirical survey of 43 Polish industrial organizations found Ishikawa was "by far the quality tool used most with 5 Whys, far exceeding all other quality tools combined." This validates the industry-standard pattern of 5 Whys as probe + Fishbone as map rather than treating them as alternatives.
A top-down deductive technique for tracing a specified undesired event ("top event") back through combinations of lower-level events and basic causes. Invented at Bell Labs in 1962 for the Minuteman ICBM and standardized in NUREG-0492 and IEC 61025.
A proactive, forward-looking technique: list every way a component or process step can fail, and rate each failure mode on three axes. Developed by the U.S. military in 1949 (MIL-P-1629), adopted by NASA for Apollo, and institutionalized in automotive via the AIAG FMEA Handbook.
RPN = Severity × Occurrence × Detection
Bowles (2003) and later work demonstrate that multiplying ordinal ranks produces nonsense distances: the difference between RPN 120 and 125 is not meaningful, and two failure modes with the same RPN can have wildly different action priorities. AIAG's 2019 handbook replaces RPN with Action Priority (AP) — a lookup table that partitions S/O/D combinations into High/Medium/Low action categories without ordinal arithmetic.
Modern guidance: Use AP tables, not RPN, for any new FMEA. If a tool still emits RPN, ignore it and read the S/O/D triplet directly.
The 20-year scoping review (Paper 5 in our local research) found FMEA widely adopted in hospital settings but with poor follow-through on corrective actions. FMEA that stops at the risk table is a paperwork exercise; FMEA that drives design changes and monitoring is a risk-reduction engine.
A ThinkReliability technique that builds a cause-and-effect map without forcing linearity. You write the incident as a problem, then ask "why did this happen" and attach all causes that jointly produced it with AND/OR logic, recursing outward. The result resembles a fault tree but is built inductively from incident narrative rather than deductively from a top event.
A. Mark Doggett (2005, Quality Management Journal) published the first published framework for choosing an RCA technique based on problem characteristics. The decision matrix:
| Factor | 5 Whys | Fishbone | FTA | FMEA | Cause Map |
|---|---|---|---|---|---|
| Simple, linear | ✓ | ✓ | — | — | ✓ |
| Multi-factor | — | ✓ | ✓ | — | ✓ |
| Quantitative | — | — | ✓ | ✓ | — |
| Proactive (pre-failure) | — | — | ✓ | ✓ | — |
| Reactive (post-failure) | ✓ | ✓ | ✓ | — | ✓ |
| Brainstorming-friendly | ✓ | ✓ | — | — | ✓ |
| Evidence-linked | — | — | ✓ | ✓ | ✓ |
Use the framework as a sanity check before reaching for a familiar tool: "Is this problem actually shaped like a 5 Whys?"
The strongest classical RCA workflow chains techniques together:
1. Incident narrative → 5 Whys probes (rough causal chains)
2. Probes → Fishbone (structured contributing factors)
3. Fishbone → Cause Map (evidence-linked, multi-causal)
4. Critical components → FMEA (prospective prevention)
5. Safety-critical paths → FTA (quantified probabilities)
This chain upgrades rigor as evidence accumulates and never stops at a single technique.
Classical techniques assume the system is decomposable into components whose failure modes can be enumerated. Escalate to a different skill when:
rca-systems-theoretic (STAMP/STPA).rca-causal-inference (Pearl, Bayesian networks).rca-human-factors (HFACS, Just Culture).rca-distributed-systems.