Design-Specific &
Temporal Biases

Evaluating Epidemiological Research

Learning objectives for this lesson:

Identify biases unique to randomized controlled trials, including placebo effects, Hawthorne effects, and contamination
Explain how inadequate allocation concealment and blinding inflate treatment effect estimates
Define immortal time bias and describe how misclassifying person-time produces spurious protective associations
Distinguish lead-time bias from genuine survival benefit in cancer screening evaluations
Recognize time-window bias in case-control studies of medication effects
Differentiate period effects from cohort effects in repeated cross-sectional data
Critically evaluate whether epidemiological studies have adequately addressed design-specific and temporal bias threats

This course was developed by Dr. Kiffer G. Card, Faculty of Health Sciences, Simon Fraser University.

Reference

Glossary: Key Terms, People & Concepts

📚 Reference page, available throughout the lesson

This glossary collects the key concepts, people, and ideas you will meet in this lesson. Use it as a reference while you work through the material, or as a review before assessments. Type in the search box to filter entries.

Trial Design Concepts

Randomization Random assignment of participants to treatment groups so that, in expectation, measured and unmeasured confounders are balanced. The defining feature of an RCT.

Allocation Concealment Procedures (sealed envelopes, central randomization) preventing those enrolling participants from knowing the next assignment. Distinct from blinding; protects against selection bias at entry.

Blinding (Masking) Concealing assignment from participants, providers, outcome assessors, or analysts to prevent expectations from influencing outcomes or reporting. Single, double, and triple blinding refer to who is masked.

Intention-to-Treat (ITT) Analyzing trial participants in the group to which they were randomized, regardless of adherence or crossover. Preserves the benefits of randomization at the cost of estimating effectiveness rather than efficacy.

Per-Protocol Analysis An analysis restricted to participants who adhered to the assigned intervention. Estimates a different parameter than ITT and is vulnerable to selection bias.

Person-Time The accumulated follow-up time contributed by participants, used as the denominator for incidence rates. Misallocating person-time across exposure categories produces immortal time bias.

Temporal Ambiguity Uncertainty about whether the exposure preceded the outcome, a fundamental weakness of cross-sectional studies and a precondition for many temporal biases.

Design-Specific & Temporal Biases

Placebo Effect A response to receiving an inert treatment driven by expectation, conditioning, and clinical context. Why trials require placebo or active comparator arms.

Hawthorne Effect A behaviour change driven simply by being observed or studied, independent of any specific intervention. Threatens both arms of a trial unless the comparison is well chosen.

Contamination When participants in the control arm receive elements of the intervention (or vice versa), pulling estimated treatment effects toward the null.

Compliance / Adherence Bias Systematic differences in adherence between trial arms that distort treatment effect estimates, particularly in per-protocol analyses.

Immortal Time Bias A bias in observational drug studies arising when person-time during which the outcome could not occur is misclassified as exposed. Produces spurious protective associations for treatments.

Lead-Time Bias Apparent improvement in survival in screened populations that arises only because disease is detected earlier, not because death is delayed. Survival is measured from diagnosis, so earlier diagnosis lengthens it artefactually.

Length-Biased Sampling Screening preferentially detects slower-progressing disease (which spends more time in a detectable preclinical phase), inflating apparent survival benefits of screening.

Prevalence–Incidence (Neyman) Bias A bias in cross-sectional or prevalent-case studies whereby rapidly fatal cases are underrepresented, distorting exposure–outcome estimates.

Healthy Worker Effect An apparent occupational health advantage caused by selection: people too unwell to work are excluded from worker cohorts, biasing comparisons with the general population toward the null.

Ascertainment / Detection Bias Differential effort to detect outcomes across exposure groups. Screened or closely monitored participants accumulate diagnoses faster, even if true incidence is identical.

Time-Window Bias A pharmacoepidemiology bias arising in case-control studies when controls have shorter exposure-eligible windows than cases, exaggerating apparent drug effects.

Period Effect A change in outcomes that affects all age groups simultaneously at a given calendar time (e.g., the launch of a new vaccine, an economic shock).

Cohort Effect A change in outcomes specific to people born or exposed in a particular era (e.g., the smoking-prevalence cohort of mid-20th-century men). Distinct from period and age effects but easily confused with them.

Key People

Samy Suissa Pharmacoepidemiologist whose work named and dissected immortal time bias and time-window bias in observational drug studies.

Archie Cochrane (1909–1988) Scottish epidemiologist whose advocacy for randomized trials and synthesis of evidence inspired the Cochrane Collaboration.

Sir Austin Bradford Hill (1897–1991) Designed the first modern double-blind RCT (streptomycin for tuberculosis, 1948) and articulated the “viewpoints” for causal inference still used today.

Miguel Hernán Epidemiologist whose target-trial framework operationalizes how observational pharmacoepidemiology can avoid immortal-time and other temporal biases.

No matching entries. Try a different search term.

Section 1 of 4

Randomized Trial Biases

⏱ Estimated reading time: 20 minutes

Section 1 of 4

Randomized Trial Biases

Allocation concealment, blinding, placebo and nocebo effects, the Hawthorne effect, adherence, and contamination.

The central question

How does study architecture create spurious associations?

Good randomization balances the groups at baseline. It does not protect against what happens after assignment: how people are observed, measured, and treated.

Randomized trials that inflate effects through failures of blinding and the psychology of being in a study
Observational drug studies that manufacture apparent survival benefits out of bookkeeping errors
Screening evaluations that confuse earlier detection with genuine benefit
Time-trend analyses caught by a constraint that is mathematical, not just a nuisance

Studies of studies

Allocation concealment and blinding

Flaw	Average inflation	Most affected outcome
No allocation concealment	30 to 40% exaggerated effect	All outcomes
No double blinding	15 to 25% exaggerated effect	Subjective (pain, function)
Both flaws	Up to 50% exaggerated effect	Subjective outcomes

Participant psychology

Placebo and nocebo effects

Placebo

20 to 40% improvement in placebo pain arms. Activates the body's own opioid pathways (Wager et al., 2004). In depression trials, about 80% of the drug arm's gain is reproduced in the placebo arm (Kirsch et al., 2008).

Nocebo

Negative expectations produce real adverse effects. Statin-trial participants warned about muscle pain report significantly more of it from identical placebo tablets (Gupta et al., 2017).

Both effects require placebo-controlled, double-blind designs to separate the pharmacology from the psychology.

Being watched changes behavior

The Hawthorne effect

People modify their behavior when they know they are being observed, regardless of whether they received an active intervention.

Srigley et al. (2014): hand hygiene compliance was about 70% under direct observation but only about 25% with covert electronic monitoring.

If the effect is equal across arms it biases toward the null. Unequal observation creates directional bias.

Adherence and contamination

Compliance bias and its mirror

Healthy adherer effect

Coronary Drug Project (1980): sticking to the placebo predicted 15% lower mortality. Adherence marks health-promoting behavior, not the pill. Per-protocol analyses are vulnerable; intention-to-treat is the safeguard.

Contamination

The comparison group receives elements of the intervention from outside the trial. The Community Intervention Trial for Smoking Cessation (1995): national anti-smoking campaigns reached control communities, diluting the contrast. Biases toward the null.

Carry forward

What to take into the next section

Effect inflation

Concealment failures and unblinded designs exaggerate effects by 30 to 50%. Placebo and nocebo effects require placebo control to isolate the drug.

Direction matters

Hawthorne and contamination bias toward the null. The healthy adherer effect and blinding failures bias away from the null. Knowing the direction helps you read the literature.

Introduction and Overview

Earlier lessons catalogued the three classical sources of bias one at a time: causal-specification, selection, and information bias. This lesson picks up where an earlier lesson left off: rather than introducing a fourth category, it works through biases that are specific to particular study designs or that arise from the way time is handled in the analysis. These usually combine elements from the three classical categories in characteristic ways. The three content sections move from RCT-specific biases (this section: allocation concealment, blinding, placebo, Hawthorne, adherence) to time-related biases that haunt observational pharmacoepidemiology and screening evaluation (a later section: immortal time bias, lead-time bias, length-biased sampling, overdiagnosis) to time-window and age–period–cohort effects that arise in any analysis of trends (a later section). A later lesson will then bring the lesson on confounding to a close, and a later lesson will integrate the entire course.

Learning Objectives

Explain why even properly randomized trials can yield biased effect estimates without adequate allocation concealment and blinding.
Distinguish performance, detection, and reporting biases that operate after randomization.
Describe how placebo effects, Hawthorne effects, contamination, and noncompliance distort intention-to-treat versus per-protocol estimates.
Use meta-epidemiologic evidence to predict the direction and magnitude of bias from specific RCT design flaws.

Biases in Randomized Controlled Trials

Randomized controlled trials (RCTs) are often described as the “gold standard” of causal inference. However, RCTs are not immune to bias. Design-specific biases can systematically distort treatment effect estimates even within a randomized framework. Understanding these biases is critical for interpreting trial results and for designing robust studies.

Key Concept: Why RCTs Can Still Be Biased

Randomization addresses confounding by balancing known and unknown prognostic factors between groups at baseline. However, biases can arise after randomization through inadequate allocation concealment, lack of blinding, differential compliance, contamination, or the psychological effects of being in a trial. These biases affect internal validity even when randomization itself is successful.

Allocation Concealment and Blinding

Meta-epidemiologic studies, that is, studies of studies, have documented that trials without adequate allocation concealment or blinding report systematically larger treatment effect sizes than trials with these safeguards. The Cochrane Collaboration has repeatedly confirmed these findings across hundreds of meta-analyses.

Case Study: Cochrane Meta-Epidemiologic Evidence

Schulz et al. (1995) analyzed 250 controlled trials from 33 meta-analyses and found that trials with inadequate allocation concealment yielded odds ratios that were exaggerated by an average of 30–40% compared to adequately concealed trials. Trials that were not double-blinded showed similar inflation. Wood et al. (2008) extended these findings across 146 meta-analyses, confirming that lack of blinding particularly inflated subjective outcome estimates (e.g., pain, functional status) while having less impact on objective outcomes like mortality.

Allocation concealment refers to procedures that prevent those enrolling participants from knowing upcoming treatment assignments. When concealment is inadequate, recruiters can selectively enroll sicker patients into the treatment arm or healthier patients into the control arm, or vice versa, introducing selection bias after randomization.

Adequate methods: Central telephone randomization, sequentially numbered sealed opaque envelopes, pharmacy-controlled allocation
Inadequate methods: Open random-number tables, unsealed envelopes, alternation by day of week or bed number

Blinding (masking) prevents participants, clinicians, or outcome assessors from knowing group assignment. Lack of blinding introduces several biases:

Performance bias: Clinicians may provide differential co-interventions to unblinded groups
Reporting bias: Participants may report outcomes differently based on perceived treatment allocation
Detection bias: Assessors may interpret ambiguous outcomes differently when they know group assignment

Empirical evidence of bias magnitude:

Methodological Flaw	Average Effect Inflation	Outcome Type Most Affected
Inadequate allocation concealment	30–40% exaggerated OR	All outcomes
Lack of double-blinding	15–25% exaggerated OR	Subjective outcomes (pain, function)
Both flaws combined	Up to 50% exaggerated OR	Subjective outcomes

Allocation concealment and blinding address mechanical and observer-driven biases. The next family of trial-specific biases comes from the participants themselves: from what they expect to feel, how being studied changes their behaviour, and whether they actually take the treatment they were assigned.

Placebo Effects

The placebo effect refers to measurable improvements in participants who receive an inert treatment, driven by expectation, conditioning, and the therapeutic context (catalogued among the classical biases by Sackett, 1979). Placebo effects are particularly pronounced in trials involving pain and depression, where substantial improvement is routinely observed in placebo arms.

Placebo in Pain TrialsClick to explore

Placebo in Depression TrialsClick to explore

Nocebo EffectsClick to explore

Hawthorne Effect

The Hawthorne effect (Wikipedia) describes the phenomenon in which participants modify their behavior simply because they know they are being observed or studied. This effect is named after the famous Western Electric studies in the 1920s–1930s, where factory workers increased productivity regardless of which workplace change was introduced, apparently because they were being monitored. McCambridge, Witton, & Elbourne (2014) systematically reviewed evidence for the effect and emphasized that “research participation effects” are heterogeneous and design-dependent.

Case Study: Hawthorne Effect in Hand Hygiene Studies

Srigley et al. (2014) measured hand hygiene compliance among healthcare workers using both direct observation (where workers knew they were being watched) and electronic monitoring (covert). Compliance rates were nearly three times higher when workers knew they were observed (estimated 70%+ vs. ~25% with electronic monitoring). This finding has profound implications for infection control studies: if the Hawthorne effect inflates compliance in all trial arms, the true baseline behavior is obscured, and interventions may appear less effective than they would be in unmonitored settings.

Compliance and Adherence Bias

Even in well-designed RCTs, not all participants adhere to their assigned treatment. Compliance bias arises when adherent participants differ systematically from non-adherent participants in ways that affect outcomes, a phenomenon sometimes called the “healthy adherer effect.”

Per-Protocol vs. Intention-to-Treat Analysis

Intention-to-treat (ITT) analysis includes all randomized participants in their assigned groups regardless of compliance. Per-protocol (PP) analysis includes only participants who adhered to the study protocol. PP analyses can introduce bias because compliant participants are systematically different from non-compliant ones (healthier, more motivated, higher socioeconomic status). The Coronary Drug Project Research Group (1980) demonstrated this dramatically: placebo-adherent patients had roughly half the five-year mortality of placebo non-adherent patients (about 15% versus 28%), confirming that adherence itself is a marker of overall health behavior.

The Healthy Adherer Effect

Simpson et al. (2006) conducted a meta-analysis showing that good adherence to placebo was associated with lower mortality (pooled OR = 0.56, 95% CI: 0.43–0.74). This means that adherence is a proxy for a constellation of health-promoting behaviors. Per-protocol analyses that compare “adherers to drug” vs. “all controls” conflate drug effects with the healthy adherer effect, inflating apparent treatment benefits.

Contamination in Community Trials

Contamination occurs when control group participants partially receive the intervention. This is particularly common in community intervention trials and pragmatic trials. For example, in the COMMIT (Community Intervention Trial for Smoking Cessation; COMMIT Research Group, 1995), control communities were exposed to national anti-smoking campaigns occurring simultaneously, which diluted the contrast between intervention and control and made the community-level intervention appear ineffective. Contamination biases results toward the null, reducing the apparent effect of the intervention.

Summary: RCT-Specific Biases

Bias	Mechanism	Likely Direction	Primary Safeguard
Allocation concealment failure	Selective enrollment post-randomization	Away from null	Centralized randomization
Lack of blinding	Differential co-interventions, reporting, detection	Away from null	Double-blind, placebo control
Placebo effect	Expectation and conditioning	Inflates control improvement	Placebo-controlled design
Hawthorne effect	Behavior change from observation	Toward null (if equal in arms)	Covert measurement when ethical
Healthy adherer effect	Adherence confounded with health behaviors	Away from null (PP analysis)	ITT analysis
Contamination	Controls exposed to intervention	Toward null	Cluster randomization, geographic separation

Section 2 of 4

Immortal Time & Lead-Time Bias

⏱ Estimated reading time: 20 minutes

Section 2 of 4

Immortal Time & Lead-Time Bias

Four biases that haunt drug studies and screening evaluation, all driven by how time enters the comparison.

The most common temporal bias

Immortal time bias: the logic

To be counted as exposed, a patient must survive until the first prescription. That pre-prescription window is immortal: no death can occur there without removing the patient from the exposed group.

Crediting that window to the exposed group inflates their apparent person-time and suppresses their apparent event rate. The result is a fake protective association for the drug.

Simulation

The R demonstration: bias out of thin air

Wrong analysis

Classify by ever-filled prescription. Mean survival: unexposed about 1.4 years, exposed about 4.3 years. Looks like a dramatic drug benefit. True drug effect: zero.

Right analysis

Split person-time at the prescription date. Unexposed rate about 0.337 per year, exposed rate about 0.343. Both sit at the simulated truth of one in three. The fake benefit disappears entirely.

The fix: time-dependent exposure

\[ \color{#1D4ED8}{\text{Rate}_{\text{unexposed}}} = \frac{\color{#C2410C}{\text{events before Rx}}}{\color{#6D28D9}{\text{person-time before Rx}}}, \quad \color{#0B7B6B}{\text{Rate}_{\text{exposed}}} = \frac{\color{#C2410C}{\text{events after Rx}}}{\color{#BE185D}{\text{person-time after Rx}}} \]

Rate_unexposed rate before the first prescriptionRate_exposed rate after the first prescriptionperson-time before Rx credited to unexposed, not exposedperson-time after Rx credited to exposed

Screening evaluation

Lead-time bias: the same death, moved earlier on the clock

Both patients die at 70. The 7-year “survival benefit” is lead time, not a real treatment gain. Mortality rates, not survival rates, detect the difference.

The speed-trap idea

Length-biased sampling and overdiagnosis

Length bias

Slow tumors with a long detectable window are more likely to be caught by a screening test. Screen-detected cancers are enriched for indolent disease. Their prognosis looks better because of which tumors were selected, not because of treatment.

Overdiagnosis

Tumors that would never have caused symptoms or death are found and treated. Welch & Black (2010): up to 60% of screen-detected prostate cancers and 15 to 25% of breast cancers. Real treatment burden, no survival benefit.

A single screen samples the long-window cases preferentially. Screen-detected cancers are therefore enriched for indolent disease, which is length bias.

Carry forward

What to take into the next section

Immortal time bias: pre-exposure survival time filed under the exposed group. Fix: time-dependent exposure or a landmark design.
Lead-time bias: earlier detection inflates survival from diagnosis without changing age at death. Fix: use mortality, not survival.
Length bias and overdiagnosis: screening loads the detected pool with indolent disease. Fix: randomized screening trials with mortality endpoints.

Introduction and Overview

An earlier section covered biases that arise inside the controlled environment of an RCT. This section turns to a family of biases that haunt the messier observational designs, especially pharmacoepidemiology and screening evaluations, where time is handled poorly. All four biases in this section share a common engine: a comparison group that has somehow been guaranteed extra survival or extra opportunity to be exposed, by virtue of how the data were assembled rather than anything about the underlying biology. We start with the most consequential of them in observational drug research.

Learning Objectives

Define immortal time bias and identify it in pharmacoepidemiologic study designs.
Apply time-dependent exposure classification and landmark designs to remove immortal time bias.
Distinguish lead-time bias, length-biased sampling, and overdiagnosis in screening studies, and explain why mortality is the only outcome immune to all three.
Critique an observational drug or screening study for the role of time-related biases in its reported effect.

Immortal Time Bias

Immortal time bias occurs when a period of follow-up during which the outcome cannot occur is misclassified or improperly handled in the analysis (Lévesque, Hanley, Kezouh, & Suissa, 2010). The term “immortal” refers to the fact that participants must survive (remain event-free) long enough to be classified as exposed. When this survival requirement is not properly accounted for, the exposed group appears to have artificially better outcomes.

▸ INTERACTIVE STORY: THE IMMORTAL TIME GHOST
Open full screen ↗

Walk through how a survival window before treatment can fake a treatment benefit. Next ▶ advances scenes.

A 6-scene visualization of immortal time: a patient timeline from diagnosis to death, the misclassified pre-treatment window, the spuriously divergent survival curves, the ghost of immortal time itself, and the time-varying-exposure fix.

Why “Immortal” Time?

Consider a study of statin use and mortality. To be classified as a “statin user,” a patient must survive long enough after hospital discharge to fill a prescription. The time between cohort entry and the first prescription is “immortal time”: by definition, the patient could not have died during this period and still been counted as exposed. If this time is credited to the exposed group (or excluded from the unexposed group), the result is a spurious survival advantage for statin users.

Case Study: Statins and Mortality After Acute Myocardial Infarction

Suissa (2008) demonstrated that several widely cited observational studies reporting dramatic mortality reductions from post-MI statin use (reductions of 25–50%) contained immortal time bias. These studies classified patients as “statin users” based on prescriptions filled after hospital discharge. Patients who died before filling a prescription were automatically classified as “non-users,” and the survival time before the first prescription was either misattributed to the exposed group or excluded entirely. When Suissa reanalyzed the data using time-dependent exposure classification, correctly assigning person-time before the first prescription to the unexposed period, the dramatic survival benefit was substantially reduced or eliminated.

R Immortal time bias in 30 lines of code

What you'll do: simulate a 2,000-patient cohort in which the drug has zero survival benefit, then run two analyses: one that ignores immortal time and one that handles it correctly. What to take away: the dramatic mortality reduction reported in some early observational drug studies is exactly the kind of artefact this code reproduces, and the fix is bookkeeping rather than biology.

Build a cohort where statins truly have no survival benefit, then misclassify person-time the wrong way. Watch a fake survival benefit appear out of nothing. Because the simulated death rate is set to 1/3 per year, the true average survival is about three years, and a correct analysis should recover a death rate close to 1/3 in each group.

set.seed(230)
N <- 2000

# Each patient survives Exp(rate=1/3) years; about 60% eventually fill a Rx.
# If they die before filling, they go in the unexposed group (no benefit).
death_t <- rexp(N, rate = 1/3)
rx_t    <- rexp(N, rate = 1/2)
ever_rx <- rx_t < death_t

## (1) WRONG analysis: classify by ever_rx, ignore immortal time
mean_surv_wrong <- tapply(death_t, ever_rx, mean)
mean_surv_wrong   # exposed group looks much better

## (2) RIGHT analysis: split person-time at the prescription date
unexp_pt <- pmin(death_t, rx_t)              # every pt's unexposed person-time
exp_pt   <- pmax(death_t - rx_t, 0) * ever_rx # only after Rx, only if they got it

unexp_events <- sum(!ever_rx)
exp_events   <- sum(ever_rx)
c(unexp_rate = unexp_events / sum(unexp_pt),
  exp_rate   = exp_events   / sum(exp_pt))

Console output

FALSE TRUE 1.43 4.27 # WRONG: huge fake "benefit" unexp_rate exp_rate 0.337 0.343 # RIGHT: rates almost identical

The bias is engineered, not measured. Wrong person-time bookkeeping, not a real treatment effect, produced the apparent benefit. Time-dependent exposure assignment is the fix.

R Reflect on what you just ran

Use the questions below to interpret the output you produced. Look at your console before answering.

1. In the WRONG analysis, mean survival times printed as ~1.43 for the unexposed (FALSE) and ~4.27 for the exposed (TRUE). The simulation built in zero treatment effect, so where did the apparent 3-year survival advantage come from? Be specific about which person-time is being miscounted.

Model answerThe 3-year ‘survival advantage’ comes from the immortal person-time between cohort entry and treatment initiation being credited to the exposed group. In the WRONG analysis, anyone who would later become exposed contributes their pre-exposure survival to the exposed column, but by definition they cannot die during that window without losing the exposed label, so that pre-exposure time is guaranteed event-free. The mean survival time of 4.27 (exposed) reflects truncation at the right end (people who die before being treated never get counted as exposed) plus left-counting of pre-exposure time. The lesson: any analysis where exposure status is assigned with information that occurs after time-zero will manufacture an apparent benefit out of thin air.

2. In the RIGHT analysis, the unexposed rate (0.337) and exposed rate (0.343) are nearly identical. Why is the rate of ~0.33 close to the simulated true death-rate parameter 1/3, and what does that confirm about the time-dependent classification you implemented?

Model answerThe simulation generated death times from rexp(n, rate = 1/3), so the true death rate is 1/3 per unit time ≈ 0.333. Both the corrected exposed rate (0.343) and unexposed rate (0.337) sit right at that value, confirming that the time-dependent classification, assigning person-time to unexposed until the moment of treatment, and to exposed only afterward, has eliminated the artefact. The agreement of both groups with the simulated truth is the cleanest test that the analytic fix works; the IRR is 1.02 instead of 3.0.

3. Suissa's reanalysis of post-MI statin studies showed the same pattern. If you were peer-reviewing an observational drug study reporting a 25-50% mortality reduction, what specific feature of the methods section would you check first, based on what this simulation showed?

Model answerFirst check in the methods: the time-zero alignment, specifically how exposed and unexposed person-time accrues. Look for any of (a) classifying patients by ever-use rather than time-varying exposure; (b) defining exposure based on prescriptions filled during follow-up but counting follow-up from cohort entry; (c) lack of a clear ‘new-user’ design; (d) absence of risk-set sampling for nested case-control. The Suissa diagnostic is: if patients had to survive long enough to fill a prescription before being labelled exposed, the analysis has built in immortal time, and the reported 25–50% mortality reduction is partly or entirely artefactual.

Saved.

Immortal time bias introduces a systematic distortion through three related mechanisms:

Survival requirement: Exposed participants must survive long enough to receive the exposure (e.g., fill a prescription, receive a transplant, attend a rehabilitation program)
Time misclassification: The pre-exposure survival time is either credited to the exposed group (inflating their person-time denominator) or excluded from analysis
Asymmetric comparison: The unexposed group includes all person-time from cohort entry, including deaths that occurred before exposed participants had the opportunity to be classified

The result is a comparison between a group with guaranteed survival during one period vs. a group with no such guarantee, producing a spurious protective effect.

Nobel Prize longevity study: A widely reported analysis suggested Nobel Prize winners live longer than nominees. However, the “exposure” (winning the prize) requires surviving to the date of the award. Nominees who died before the award ceremony contribute person-time only to the “non-winner” group. Correcting for immortal time eliminated the longevity advantage.

Metformin and cancer: Early observational studies reported that metformin users had dramatically reduced cancer risk compared to users of other diabetes drugs. Suissa and Azoulay (2012) showed that time-related biases, including immortal time bias, explained much of this apparent benefit.

Correcting immortal time bias:

Time-dependent exposure classification: Model exposure as a time-varying covariate, so person-time before the first prescription is correctly assigned to the unexposed period
Landmark analysis: Define a fixed time point after cohort entry and classify exposure status at that landmark; exclude patients who die or are lost before the landmark
Nested case-control or case-crossover designs: Match on time at risk to ensure comparable exposure windows
Target trial emulation: Design the analysis to mimic a hypothetical RCT, specifying time zero, eligibility, and treatment assignment simultaneously (Hernán & Robins, 2016)

Immortal time bias is the most common time-related bias in observational drug studies. The closely related family of biases that affect the evaluation of screening programs share the same fundamental issue, the way time enters the comparison, but they manifest differently. The next three subsections take them in turn.

Lead-Time Bias

Lead-time bias occurs when screening or early detection appears to improve survival simply because the diagnosis is made earlier, without actually changing the time of death. The “lead time” is the interval between screen detection and the time the disease would have been diagnosed clinically. If survival is measured from diagnosis, screened patients will appear to live longer even if screening does not change mortality.

Key Concept: Lead-Time Bias Illustrated

Imagine two identical patients with the same cancer developing at age 55 and causing death at age 70. Patient A is diagnosed by screening at age 58 (12-year survival from diagnosis). Patient B is diagnosed clinically at age 65 (5-year survival from diagnosis). Screening “added” 7 years of survival, but both patients lived to exactly 70. The 7-year difference is entirely lead time, not a genuine survival benefit. Only mortality-based outcomes (not survival from diagnosis) can distinguish true benefit from lead-time bias.

Case Study: Prostate Cancer Screening with PSA

Following the introduction of PSA (prostate-specific antigen) screening in the late 1980s, 5-year survival rates for prostate cancer in the U.S. rose from approximately 75% to over 99%. However, prostate cancer mortality rates declined only modestly. Etzioni et al. (2002) estimated that lead-time bias accounted for the majority of the apparent survival improvement. Many screen-detected prostate cancers are slow-growing tumors that would never have caused symptoms or death (overdiagnosis), further inflating apparent survival statistics.

Lead-time bias inflates apparent survival without changing the time of death. The next bias goes one step further: it changes which tumors are even available to be detected.

Length-Biased Sampling

Length-biased sampling (or length bias) occurs when screening programs preferentially detect slower-growing, less aggressive tumors. Because slow-growing tumors have a longer detectable preclinical phase, they are more likely to be present during a screening examination. Fast-growing tumors, which have a shorter preclinical window, are more likely to present as “interval cancers” between screenings.

Length Bias MechanismClick to explore

Breast Cancer ExampleClick to explore

OverdiagnosisClick to explore

Summary: Temporal Biases in Screening and Pharmacoepidemiology

Bias	Setting	Mechanism	Correction Strategy
Immortal time bias	Drug/exposure studies	Pre-exposure survival time misclassified	Time-dependent analysis, landmark analysis
Lead-time bias	Screening evaluation	Earlier diagnosis inflates survival from diagnosis	Use mortality, not survival, as endpoint
Length-biased sampling	Screening evaluation	Screening detects slower-growing tumors	Randomized screening trials, mortality endpoints
Overdiagnosis	Screening programs	Detection of nonprogressive disease	Compare incidence in screened vs. unscreened populations

Reflection

A pharmaceutical company publishes an observational study showing that patients who start their new cholesterol medication within 30 days of a heart attack have 45% lower mortality than those who do not. The study classifies patients based on whether they filled a prescription within 30 days. Identify at least two temporal biases that could explain this finding, and propose an analytical approach that would produce a less biased estimate.

Model answerTwo temporal biases drive the 45% mortality reduction. (1) Immortal time bias: classifying patients by whether they filled a prescription within 30 days means anyone who died between days 0–30 without filling cannot be labelled exposed; the exposed group is guaranteed to have lived at least until the day of the fill, while the unexposed group includes those early deaths. (2) Healthy-adherer / depletion-of-susceptibles bias: patients who actually fill a prescription within 30 days are systematically healthier and more engaged with care than those who don't; pharmacy fill is a marker of capacity rather than of intent-to-treat. Less-biased analytic approach: target-trial emulation with a new-user, active-comparator design, time-zero aligned at the day of the first prescription, comparing new initiators of the new drug to new initiators of an established cholesterol drug. Cox model with proportional hazards on time since initiation. Where feasible, supplement with an instrumental-variable analysis using prescriber preference.

✓ Reflection saved

Section 3 of 4

Time-Window, Period & Cohort Effects

⏱ Estimated reading time: 15 minutes

Section 3 of 4

Time-Window, Period & Cohort Effects

Unequal exposure windows in case-control studies, and the age-period-cohort identification problem in time-trend analysis.

Case-control designs

Time-window bias: unequal opportunity

Cases who died had shorter pharmacy histories than surviving controls. Controls accumulated more prescriptions simply by having more time to fill them.

Suissa et al. (2006): apparent statin protection against cancer disappeared when the exposure windows were equalized.

Fix: incidence density sampling; fixed exposure windows identical for cases and controls.

The mathematical constraint

Age-period-cohort identification problem

The collinearity identity

\[ \color{#0B7B6B}{\text{Cohort}} = \color{#C2410C}{\text{Period}} - \color{#6D28D9}{\text{Age}} \]

Cohort birth yearPeriod calendar year of observationAge age at observation

This identity is exact. Any model with all three as straight-line terms has no unique solution. Researchers must impose a constraint: fix one effect, use curved terms, or bring in outside information.

Period effect

Anti-smoking campaigns affect all birth cohorts in a given calendar year. Rates drop across all ages at once.

Cohort effect

People born after 1970 grew up with smoking already stigmatized. Their lifetime peak was lower at every age, not just after campaigns began.

Applied example

Lung cancer trends: period or cohort?

Men's rates decline; women's rise. A period explanation needs campaigns to affect the sexes differently. A cohort explanation needs only that men and women took up smoking in different eras, with cancer lagging exposure by 30 to 40 years.

Carry forward

What to take into the Final Assessment

Time-window bias: unequal exposure opportunity between cases and controls manufactures spurious associations. Align the observation windows.
Age-period-cohort collinearity: Cohort = Period − Age is an exact identity. The three effects cannot be estimated as straight lines all at once. Theory and constraints are required.
Both problems are matters of design and logic, not data problems that more observations can solve.

Introduction and Overview

Earlier sections covered biases tied to specific designs (RCTs and observational pharmacoepidemiology / screening). This section covers two final time-related issues that can affect almost any observational analysis: time-window bias in case-control designs (a close cousin of immortal time bias from an earlier section) and the age-period-cohort identification problem that arises whenever we try to interpret time trends in disease rates.

Learning Objectives

Recognize time-window bias in case-control studies where exposure ascertainment periods differ between cases and controls.
Distinguish age, period, and cohort effects and explain why the three cannot be separately identified without additional constraints.
Interpret a time-trend graph using birth-cohort versus calendar-period framings, and articulate the assumptions each framing imposes.
Propose design or analytic strategies (matching on exposure-window length, restriction, theory-driven APC constraints) to address temporal biases.

Time-Window Bias

Time-window bias occurs in case-control studies when the exposure opportunity period differs between cases and controls. If cases have a systematically shorter (or longer) time window during which exposure could be ascertained, comparisons of exposure prevalence will be distorted.

Key Concept: Time-Window Bias

In a case-control study of medication use and an acute outcome (e.g., myocardial infarction), cases who die shortly after the event have a truncated exposure window compared to surviving controls. If exposure is defined as “any use in the prior year,” controls have a full year to accumulate medication use while cases who die within months may not. Alternatively, cases with longer pre-event periods may accumulate more exposure. The mismatch in observation time creates differential exposure opportunity, biasing the association.

Case Study: Statins and Cancer in Case-Control Studies

Suissa et al. (2011) showed that several case-control studies reporting protective effects of statins on cancer risk were affected by time-window bias. Controls were matched to cases on calendar date but had systematically longer exposure opportunity periods. Because statin use increased rapidly over time, controls, who were observed over longer and more recent periods, had a higher probability of statin exposure than cases. This differential created an artificial protective association. When exposure windows were properly aligned between cases and controls, the protective effect was substantially attenuated or disappeared.

How Time-Window Bias Differs from Immortal Time Bias

Both biases involve time-related distortions, but they operate through different mechanisms:

Immortal time bias occurs in cohort studies when pre-exposure person-time is misclassified, giving the exposed group a guaranteed survival advantage
Time-window bias occurs in case-control studies when cases and controls have unequal opportunities to be classified as exposed, biasing exposure prevalence comparisons

Both can produce spurious associations, but the correction strategies differ: immortal time bias requires time-dependent analysis, while time-window bias requires matching or restricting on exposure opportunity period.

Correcting Time-Window Bias

Strategies for addressing time-window bias include:

Match on exposure opportunity: Ensure cases and controls have the same length of observation time for exposure assessment
Use incidence density sampling: Select controls from the risk set at the time of each case’s event, ensuring comparable exposure windows
Define fixed exposure windows: Restrict exposure assessment to a fixed period (e.g., 90 days before the index date) that is identical for cases and controls
Sensitivity analyses: Vary the exposure window length and assess stability of results

Time-window bias is about the design of a single study. The last topic of the lesson zooms out to time-trend analyses themselves, such as surveillance data and cross-sectional surveys repeated across decades, and the famous identification problem that arises whenever we try to attribute change to a particular cause.

Period and Cohort Effects

When analyzing trends in disease rates or health behaviors over time, it is essential to distinguish between age effects, period effects, and cohort effects, the classic age-period-cohort (APC) problem.

Age EffectsClick to explore

Period EffectsClick to explore

Cohort EffectsClick to explore

Case Study: Smoking Prevalence Trends, Period vs. Cohort Effects

Repeated cross-sectional surveys of smoking in the U.S. and U.K. show that overall smoking prevalence has declined steadily since the 1960s. However, examining trends by birth cohort reveals a more complex picture. Cohorts born in the 1920s–1940s had peak smoking rates above 50% among men and experienced gradual declines as anti-smoking campaigns took effect (a period effect). In contrast, cohorts born after 1970 never reached such high peak prevalence, reflecting a cohort effect of growing up in an environment where smoking was increasingly stigmatized and regulated. Failing to distinguish these effects leads to incorrect projections: a simple period model might predict convergence of all cohorts toward the same low rate, while a cohort model reveals that some older cohorts will maintain elevated rates until they exit the population through mortality.

The APC Identification Problem

Age, period, and cohort are mathematically collinear: Cohort = Period − Age. This means that given any two of these factors, the third is determined. As a concrete case, a person observed in 2020 at age 60 must have been born in 1960, so once two of these coordinates are fixed the third has no room left to vary on its own. This linear dependency creates a fundamental identification problem: it is impossible to simultaneously estimate all three effects without making additional assumptions or imposing constraints. Researchers must use theory, external data, or constrained models to disentangle these intertwined effects.

Summary: Time-Related Study Design Biases

Bias / Effect	Study Design	Core Issue	Solution
Time-window bias	Case-control	Unequal exposure opportunity	Match on observation time, incidence density sampling
Period effect	Cross-sectional, ecological	Calendar-time changes affecting all ages	Age-period-cohort modeling
Cohort effect	Cross-sectional, ecological	Birth-year-specific risk trajectories	Age-period-cohort modeling
APC confounding	Any time-trend analysis	Mathematical collinearity of age, period, cohort	Theory-driven constraints, external validation

Reflection

A researcher examines lung cancer rates over time and observes declining rates in men but rising rates in women. They attribute this to a period effect of anti-smoking campaigns being more effective for men. Using your knowledge of cohort effects, propose an alternative explanation and describe what data analysis you would need to distinguish between these two interpretations.

Model answerThe pattern is more naturally explained by cohort effects than a period effect of campaigns. Smoking uptake in women rose dramatically through the mid-20th century (post-WWII marketing) while uptake in men peaked earlier and has been declining for decades; lung-cancer rates lag uptake by 30–40 years, so women born 1940–60 are now in the high-risk window while men born 1900–30 are aging out of it. To distinguish period from cohort effects, fit an age-period-cohort (APC) model on age-specific incidence: if the male decline and female rise are both concentrated in birth cohorts (parallel curves across periods for a fixed birth-year, with different cohort-specific intercepts), the cohort interpretation wins. Supplement with historical smoking-prevalence data by birth cohort to verify the lag. Standard problem: APC models suffer from identifiability between the three time scales; resolve by constraining one (typically dropping the linear period trend) and reporting sensitivity across constraint choices.

✓ Reflection saved

Section 4 of 4

Final Assessment

⏱ Estimated time: 20 minutes

Bringing It All Together

This lesson covered design-specific and temporal biases in three groups: the biases that operate inside RCTs even when randomization succeeds (an earlier section), the biases that haunt observational pharmacoepidemiology and screening evaluations (an earlier section), and the biases that arise from how time itself is handled in case-control comparisons and time-trend analyses (an earlier section). Each bias here typically combines elements from the three classical categories of earlier lessons. For example, immortal time bias is partly a measurement problem (how exposure status is assigned over time) and partly a selection problem (who survives long enough to be classified as exposed).

The unifying thread is that design choices and the way time enters the analysis can manufacture associations that look causal but are not. The corresponding repairs are also tied to design: blinding and allocation concealment for trials; time-dependent exposure classification and landmark designs for pharmacoepidemiology; mortality endpoints for screening evaluation; matched exposure windows and theory-driven constraints for time-trend analyses. The final reflection asks you to put the full inventory to work on a single hypothetical screening claim; the assessment then tests the conceptual material across all three sections before a later lesson closes the loop on confounding and a later lesson integrates the entire course.

Key Takeaways from this lesson

RCT biases: Inadequate allocation concealment and blinding inflate effect estimates by 30–50%. Placebo effects, Hawthorne effects, compliance bias, and contamination further distort trial results.
Immortal time bias: Misclassifying pre-exposure survival time creates spurious protective associations in pharmacoepidemiologic studies. Time-dependent analysis and landmark designs correct this bias.
Lead-time bias: Earlier detection through screening inflates survival time from diagnosis without necessarily reducing mortality. Only mortality-based endpoints avoid this artifact.
Length-biased sampling: Screening preferentially detects indolent tumors, making screen-detected cancers appear to have better prognosis regardless of treatment.
Time-window bias: Unequal exposure ascertainment periods in case-control studies create differential exposure opportunity, biasing association estimates.
Period vs. cohort effects: Changes in disease rates over time may reflect calendar-period influences on all ages or birth-cohort-specific risk trajectories. The APC identification problem requires theory-driven constraints to disentangle these effects.

R Activity: Simulating Immortal Time Bias

The companion R script r-activities/HSCI_230_Lesson_10_Design_Specific_and_Temporal_Biases.R constructs a cohort in which a prescription has no real effect on survival, then contrasts a naive analysis (classify everyone by whether they ever filled the Rx, and compare mean survival) against a correct analysis that splits person-time at the prescription date. The wrong analysis manufactures a large survival advantage out of thin air; the right analysis recovers the truth that the two rates are essentially equal.

set.seed(230)
N <- 2000

# Each patient survives Exp(rate=1/3) years; about 60% eventually fill a Rx.
# If they die before filling, they end up classified as unexposed.
death_t <- rexp(N, rate = 1/3)
rx_t    <- rexp(N, rate = 1/2)
ever_rx <- rx_t < death_t

## (1) WRONG analysis: classify by ever_rx, ignore immortal time
mean_surv_wrong <- tapply(death_t, ever_rx, mean)
mean_surv_wrong   # exposed group looks much better

## (2) RIGHT analysis: split person-time at the prescription date
unexp_pt <- pmin(death_t, rx_t)              # unexposed person-time
exp_pt   <- pmax(death_t - rx_t, 0) * ever_rx # exposed person-time

unexp_events <- sum(!ever_rx)
exp_events   <- sum( ever_rx)
c(unexp_rate = unexp_events / sum(unexp_pt),
  exp_rate   = exp_events   / sum(exp_pt))

Reflection

Consider the following scenario: A national screening program reports that 5-year survival for a particular cancer has improved from 50% to 85% over 15 years, and attributes this improvement to the screening program. Drawing on everything you have learned in this lesson about design-specific and temporal biases, identify all the biases that could contribute to this apparent improvement, explain how each one operates, and describe what evidence you would need to determine whether the screening program truly reduced mortality.

Model answerMultiple biases can inflate 5-y survival even when screening adds no real benefit. (a) Lead-time bias: screening detects cancers earlier in their natural history, so the clock for 5-y survival starts sooner; survival is longer simply because the diagnosis date moved earlier, not because the date of death did. (b) Length-time bias: screening preferentially picks up slow-growing, indolent cancers (they spend more time in the detectable preclinical phase), so the screen-detected pool is enriched for tumours that would have had good prognosis anyway. (c) Overdiagnosis: subclinical cancers that would never have caused symptoms are now ‘detected’ and counted as cured, the extreme of length-time bias. (d) Stage-migration / Will Rogers effect (Feinstein, Sosin, & Wells, 1985): improved imaging upstages borderline tumours, making both stage-specific groups look better on average. (e) Improvements in treatment over 15 years that have nothing to do with screening (better surgery, targeted therapy) inflate survival regardless of detection mode. The decisive evidence is mortality, not survival: a properly designed randomised screening trial (or a population-level interrupted time-series with mortality endpoints in screened vs. unscreened populations) is what distinguishes a real effect from artefact. Without mortality data and a non-screened comparator, the 50→85% figure is uninformative about whether the programme works.

✓ Reflection saved

Final Knowledge Assessment

This 15-question assessment covers all sections of this lesson. You must score 100% to complete the lesson. Review the explanations for any questions you miss, then retry.

This lesson Complete!

Congratulations! You have successfully completed the lesson on Design-Specific and Temporal Biases. Your responses have been downloaded automatically.

A later lesson, Confounding and Statistical Inference, finishes the bias material. Confounding is the third leg of the canonical bias triad alongside selection (an earlier lesson) and information (an earlier lesson), and the lesson also brings statistical inference back into focus: sampling distributions, confidence intervals, p-values, and the difference between random error and systematic error. By the time you reach a later lesson's integrated appraisal, the full toolkit of this course will be in place.

HSCI 230, Lesson 10

Evaluating Epidemiological Research

Design-Specific &Temporal Biases

Learning objectives for this lesson:

Glossary: Key Terms, People & Concepts

Randomized Trial Biases

Randomized Trial Biases

How does study architecture create spurious associations?

Allocation concealment and blinding

Placebo and nocebo effects

Placebo

Nocebo

The Hawthorne effect

Compliance bias and its mirror

Healthy adherer effect

Contamination

What to take into the next section

Effect inflation

Direction matters

Introduction and Overview

Learning Objectives

Biases in Randomized Controlled Trials

Key Concept: Why RCTs Can Still Be Biased

Allocation Concealment and Blinding

Placebo Effects

Hawthorne Effect

Compliance and Adherence Bias

Summary: RCT-Specific Biases

Immortal Time & Lead-Time Bias

Immortal Time & Lead-Time Bias

Immortal time bias: the logic

The R demonstration: bias out of thin air

Wrong analysis

Right analysis

Lead-time bias: the same death, moved earlier on the clock

Length-biased sampling and overdiagnosis

Length bias

Overdiagnosis

What to take into the next section

Introduction and Overview

Learning Objectives

Immortal Time Bias

Why “Immortal” Time?

R Reflect on what you just ran

Lead-Time Bias

Key Concept: Lead-Time Bias Illustrated

Length-Biased Sampling

Summary: Temporal Biases in Screening and Pharmacoepidemiology

Reflection

Time-Window, Period & Cohort Effects

Time-Window, Period & Cohort Effects

Time-window bias: unequal opportunity

Age-period-cohort identification problem

Period effect

Cohort effect

Lung cancer trends: period or cohort?

What to take into the Final Assessment

Introduction and Overview

Learning Objectives

Time-Window Bias

Key Concept: Time-Window Bias

Period and Cohort Effects

The APC Identification Problem

Summary: Time-Related Study Design Biases

Reflection

Final Assessment

Bringing It All Together

Key Takeaways from this lesson

Reflection

Final Knowledge Assessment

This lesson Complete!

Design-Specific &
Temporal Biases