Introduction &
Causal Concepts

Fundamental Epidemiological Concepts and Approaches

Kiffer G. Card, PhD, Faculty of Health Sciences, Simon Fraser University

Learning objectives for this lesson:

Trace the history of causal thinking in epidemiology
Understand component-cause and causal-web models
Describe the counterfactual concept for estimating causal effects
Explain how observational studies and experiments seek causal evidence
Distinguish inductive and deductive reasoning in science
Identify the key components of epidemiologic research
Apply causal criteria to evaluate associations

This course was developed by Kiffer G. Card, PhD, as a companion to Dohoo, I. R., Martin, S. W., & Stryhn, H. (2012). Methods in Epidemiologic Research. VER Inc.

Section 3

Seeking Causes & Models of Causation

⏱ Estimated reading time: 15 minutes

Learning Objectives

Define what constitutes a "cause" in epidemiology.
Explain the component-cause model including necessary, sufficient, and component causes.
Describe how causal complements affect the strength of association.
Understand the causal-web model and distinguish direct from indirect causes.

What Is a "Cause"?

For practical purposes in epidemiology, a cause is any factor that produces a change in the severity or frequency of an outcome. Some causes operate at the biological level within individuals (such as a specific microorganism), while others operate at the group or population level (such as lifestyle, nutrition, or weather).

Epidemiology deals with groups of individuals because the methods for determining causality require it. Researchers take a holistic approach, striving to study and measure every suspected causal factor for the outcome of interest — while recognizing that not every factor can be captured in a single study.

Pragmatic Focus

Epidemiologists prefer to identify causal factors that can be manipulated to prevent disease. But some non-manipulable factors (like genetic predisposition) may also be crucial for understanding disease patterns in populations.

The Component-Cause Model

This foundational model is based on the concepts of necessary and sufficient causes:

Necessary Cause ▼

A necessary cause is one without which the disease cannot occur. The factor will always be present if the disease occurs. For example, Mycobacterium tuberculosis is a necessary cause of tuberculosis — you cannot develop TB without the bacterium being present.

Sufficient Cause ▼

A sufficient cause is a set of conditions that, when present, will invariably produce the disease. In practice, very few single exposures are sufficient on their own. Instead, different groupings of factors combine to form sufficient causes.

Component Cause ▼

A component cause is one of a number of factors that, in combination, constitutes a sufficient cause. The factors might be present at the same time or follow one another in a temporal chain. When there are a number of causal chains with one or more factors in common, we can conceptualize the web of causal chains as a causal web.

Example: Childhood Respiratory Disease (CRD)

Consider four risk factors for CRD: the bacterium Streptococcus pneumoniae (STREP), a virus (RSV), environmental stressors like cold weather, and other bacteria like Mycoplasma pneumoniae (MP). Different two-factor combinations of these can form sufficient causes:

Component Causes	Sufficient Cause I	Sufficient Cause II	Sufficient Cause III	Sufficient Cause IV
STREP	+	+
RSV	+		+
Stressors		+	+	+
Other organism (MP)				+

Key Points from this Model

No single factor is a necessary cause of CRD (none appears in every sufficient cause). STREP is a component of 2 of the 4 sufficient causes. A child exposed to any complete combination will develop CRD. And critically, because the causal complements (the other factors in a sufficient cause) can vary in prevalence, the observed strength of association between an exposure like STREP and CRD can change even though the underlying causal mechanism has not changed.

Causal Complements and Strength of Association

A critical insight from the component-cause model is that the prevalence of causal complements — the other factors needed to complete a sufficient cause — directly affects the strength of association we observe between an exposure and outcome. Even when the causal mechanism stays the same, changes in the distribution of co-factors in the population can make the association appear stronger or weaker.

Worked Example: How Co-Factor Prevalence Matters

Imagine STREP requires RSV or Stressors as a co-factor to cause CRD. In Population A, where RSV prevalence is 30%, the risk ratio for STREP is 4.83. In Population B, where RSV prevalence rises to 70%, the risk ratio drops to 2.93 — even though the causal relationship between STREP and CRD has not changed at all.

The difference is due entirely to the change in the frequency of the co-factor RSV. This is why strength of association is not a fixed measure and is considered "population specific."

The Causal-Web Model

An alternative way to visualize how multiple factors combine to cause disease is the causal web, consisting of interconnected direct and indirect causal chains:

Direct (Proximal) Causes

A direct cause has no known intervening variable between it and the disease. Diagrammatically, the exposure is adjacent to the outcome. Examples often include specific microorganisms or toxins. However, in disease control, direct causes are not necessarily more valuable than indirect ones — many large-scale control efforts work by manipulating indirect rather than direct causes.

Indirect Causes

An indirect cause is one whose effects on the outcome are mediated through one or more intervening variables. For example, Stressors (cold weather) may make a child susceptible to STREP, RSV, and MP — so Stressors act as an indirect cause of CRD. Removing stress could reduce CRD even though stress itself is not a direct cause.

Implications of the Causal Web

The causal-web model complements the component-cause model but is not equivalent. It shows that we can control disease by preventing the action of direct causes (e.g., vaccination against RSV) or by removing indirect causes (e.g., reducing environmental stressors). The diagram also reveals gaps in our knowledge — apparent direct connections might actually reflect unmeasured intervening factors.

Proportion of Disease Explained

Using the concepts of necessary and sufficient causes, we can estimate the population attributable fraction (AF_p) — the proportion of disease in the population that is attributable to a given exposure. Because component causes can appear in multiple sufficient causes, the AF_p for all factors can sum to more than 100%. This is not an error; it reflects the reality of multicausal disease.

The Prevention Paradox

Even when a factor has a high AF_p (say a vaccine with AF_p = 50%), the benefit at the individual level may appear modest. If disease prevalence was 6%, universal vaccination would reduce it to 3%. While 94% of the vaccinated population would not have gotten the disease anyway, the 3% reduction is still a major population-level achievement. However, half of those who would have gotten sick will still get the disease despite being vaccinated. This creates a paradox: the average person may not perceive the same benefit that population-level data shows.

Key Takeaways

A cause in epidemiology is any factor that changes disease severity or frequency.
The component-cause model shows how different groupings of factors form sufficient causes, and why no single factor need be necessary for a disease.
The strength of association can vary between populations even when the underlying causal mechanism is unchanged, due to differences in the prevalence of causal complements.
The causal-web model distinguishes direct and indirect causes and guides study design and disease control strategies.
The population attributable fraction can exceed 100% because components are shared across multiple sufficient causes.

✦ Pass the knowledge check with 100% to continue

HSCI 341 — Lesson 1

Fundamental Epidemiological Concepts and Approaches

Introduction &Causal Concepts

Learning objectives for this lesson:

What Is Epidemiology?

Learning Objectives

Defining Epidemiology

Core Insight

A Brief History of Causal Thinking

Key Historical Milestones

Why the History Matters

Key Takeaways

Scientific Inference & Key Research Components

Learning Objectives

Why Scientific Inference Matters

Two Forms of Reasoning

Inductive Reasoning

Deductive Reasoning

Bayesian Thinking & Scientific Consensus

Key Components of Epidemiologic Research

The Central Goal

Key Takeaways

Seeking Causes & Models of Causation

Learning Objectives

What Is a "Cause"?

Pragmatic Focus

The Component-Cause Model

Example: Childhood Respiratory Disease (CRD)

Key Points from this Model

Causal Complements and Strength of Association

Worked Example: How Co-Factor Prevalence Matters

The Causal-Web Model

Direct (Proximal) Causes

Indirect Causes

Implications of the Causal Web

Proportion of Disease Explained

The Prevention Paradox

Key Takeaways

The Counterfactual Concept

Learning Objectives

What Is the Counterfactual?

The Thought Experiment

From Individuals to Populations

The Role of Randomization

Why Randomization Works

Confounding: A Threat to Causal Inference

Confounding in Action

Observational Studies and the Counterfactual

Reflection

Key Takeaways

Lesson Review & Final Assessment

Lesson Summary

Final Reflection

Final Knowledge Assessment

Lesson 1 Complete!

Introduction &
Causal Concepts