HSCI 230 — Lesson 10

Design-Specific &
Temporal Biases

Evaluating Epidemiological Research — HSCI 230

Dr. Kiffer G. Card, Faculty of Health Sciences, Simon Fraser University

Learning objectives for this lesson:

  • Identify biases unique to randomized controlled trials, including placebo effects, Hawthorne effects, and contamination
  • Explain how inadequate allocation concealment and blinding inflate treatment effect estimates
  • Define immortal time bias and describe how misclassifying person-time produces spurious protective associations
  • Distinguish lead-time bias from genuine survival benefit in cancer screening evaluations
  • Recognize time-window bias in case-control studies of medication effects
  • Differentiate period effects from cohort effects in repeated cross-sectional data
  • Critically evaluate whether epidemiological studies have adequately addressed design-specific and temporal bias threats
Reference

Glossary — Key Terms, People & Concepts

📚 Reference page — available throughout the lesson

This glossary collects the key concepts, people, and ideas you will meet in this lesson. Use it as a reference while you work through the material, or as a review before assessments. Type in the search box to filter entries.

Trial Design Concepts
Randomization Random assignment of participants to treatment groups so that, in expectation, measured and unmeasured confounders are balanced. The defining feature of an RCT.
Allocation Concealment Procedures (sealed envelopes, central randomization) preventing those enrolling participants from knowing the next assignment. Distinct from blinding; protects against selection bias at entry.
Blinding (Masking) Concealing assignment from participants, providers, outcome assessors, or analysts to prevent expectations from influencing outcomes or reporting. Single, double, and triple blinding refer to who is masked.
Intention-to-Treat (ITT) Analyzing trial participants in the group to which they were randomized, regardless of adherence or crossover. Preserves the benefits of randomization at the cost of estimating effectiveness rather than efficacy.
Per-Protocol Analysis An analysis restricted to participants who adhered to the assigned intervention. Estimates a different parameter than ITT and is vulnerable to selection bias.
Person-Time The accumulated follow-up time contributed by participants, used as the denominator for incidence rates. Misallocating person-time across exposure categories produces immortal time bias.
Temporal Ambiguity Uncertainty about whether the exposure preceded the outcome—a fundamental weakness of cross-sectional studies and a precondition for many temporal biases.
Design-Specific & Temporal Biases
Placebo Effect A response to receiving an inert treatment driven by expectation, conditioning, and clinical context. Why trials require placebo or active comparator arms.
Hawthorne Effect A behaviour change driven simply by being observed or studied, independent of any specific intervention. Threatens both arms of a trial unless the comparison is well chosen.
Contamination When participants in the control arm receive elements of the intervention (or vice versa), pulling estimated treatment effects toward the null.
Compliance / Adherence Bias Systematic differences in adherence between trial arms that distort treatment effect estimates, particularly in per-protocol analyses.
Immortal Time Bias A bias in observational drug studies arising when person-time during which the outcome could not occur is misclassified as exposed. Produces spurious protective associations for treatments.
Lead-Time Bias Apparent improvement in survival in screened populations that arises only because disease is detected earlier—not because death is delayed. Survival is measured from diagnosis, so earlier diagnosis lengthens it artefactually.
Length-Biased Sampling Screening preferentially detects slower-progressing disease (which spends more time in a detectable preclinical phase), inflating apparent survival benefits of screening.
Prevalence–Incidence (Neyman) Bias A bias in cross-sectional or prevalent-case studies whereby rapidly fatal cases are underrepresented, distorting exposure–outcome estimates.
Healthy Worker Effect An apparent occupational health advantage caused by selection: people too unwell to work are excluded from worker cohorts, biasing comparisons with the general population toward the null.
Ascertainment / Detection Bias Differential effort to detect outcomes across exposure groups. Screened or closely monitored participants accumulate diagnoses faster, even if true incidence is identical.
Time-Window Bias A pharmacoepidemiology bias arising in case-control studies when controls have shorter exposure-eligible windows than cases, exaggerating apparent drug effects.
Period Effect A change in outcomes that affects all age groups simultaneously at a given calendar time (e.g., the launch of a new vaccine, an economic shock).
Cohort Effect A change in outcomes specific to people born or exposed in a particular era (e.g., the smoking-prevalence cohort of mid-20th-century men). Distinct from period and age effects but easily confused with them.
Key People
Samy Suissa Pharmacoepidemiologist whose work named and dissected immortal time bias and time-window bias in observational drug studies.
Archie Cochrane (1909–1988) Scottish epidemiologist whose advocacy for randomized trials and synthesis of evidence inspired the Cochrane Collaboration.
Sir Austin Bradford Hill (1897–1991) Designed the first modern double-blind RCT (streptomycin for tuberculosis, 1948) and articulated the “viewpoints” for causal inference still used today.
Miguel Hernán Epidemiologist whose target-trial framework operationalizes how observational pharmacoepidemiology can avoid immortal-time and other temporal biases.
No matching entries. Try a different search term.
Section 1 of 4

Randomized Trial Biases

⏱ Estimated reading time: 20 minutes

Introduction and Overview

Lessons 7–9 catalogued the three classical sources of bias one at a time — causal-specification, selection, and information bias. Lesson 10 picks up where Lesson 9 left off: rather than introducing a fourth category, it works through biases that are specific to particular study designs or that arise from the way time is handled in the analysis. These usually combine elements from the three classical categories in characteristic ways. The three content sections move from RCT-specific biases (Section 1: allocation concealment, blinding, placebo, Hawthorne, adherence) to time-related biases that haunt observational pharmacoepidemiology and screening evaluation (Section 2: immortal time bias, lead-time bias, length-biased sampling, overdiagnosis) to time-window and age–period–cohort effects that arise in any analysis of trends (Section 3). Lesson 11 will then bring the lesson on confounding to a close, and Lesson 12 will integrate the entire course.

Learning Objectives

  • Explain why even properly randomized trials can yield biased effect estimates without adequate allocation concealment and blinding.
  • Distinguish performance, detection, and reporting biases that operate after randomization.
  • Describe how placebo effects, Hawthorne effects, contamination, and noncompliance distort intention-to-treat versus per-protocol estimates.
  • Use meta-epidemiologic evidence to predict the direction and magnitude of bias from specific RCT design flaws.

Biases in Randomized Controlled Trials

Randomized controlled trials (RCTs) are often described as the “gold standard” of causal inference. However, RCTs are not immune to bias. Design-specific biases can systematically distort treatment effect estimates even within a randomized framework. Understanding these biases is critical for interpreting trial results and for designing robust studies.

Key Concept: Why RCTs Can Still Be Biased

Randomization addresses confounding by balancing known and unknown prognostic factors between groups at baseline. However, biases can arise after randomization through inadequate allocation concealment, lack of blinding, differential compliance, contamination, or the psychological effects of being in a trial. These biases affect internal validity even when randomization itself is successful.

Allocation Concealment and Blinding

Meta-epidemiologic studies—studies of studies—have documented that trials without adequate allocation concealment or blinding report systematically larger treatment effect sizes than trials with these safeguards. The Cochrane Collaboration has repeatedly confirmed these findings across hundreds of meta-analyses.

Case Study: Cochrane Meta-Epidemiologic Evidence

Schulz et al. (1995) analyzed 250 controlled trials from 33 meta-analyses and found that trials with inadequate allocation concealment yielded odds ratios that were exaggerated by an average of 30–40% compared to adequately concealed trials. Trials that were not double-blinded showed similar inflation. Wood et al. (2008) extended these findings across 146 meta-analyses, confirming that lack of blinding particularly inflated subjective outcome estimates (e.g., pain, functional status) while having less impact on objective outcomes like mortality.

Allocation concealment refers to procedures that prevent those enrolling participants from knowing upcoming treatment assignments. When concealment is inadequate, recruiters can selectively enroll sicker patients into the treatment arm or healthier patients into the control arm—or vice versa—introducing selection bias after randomization.

  • Adequate methods: Central telephone randomization, sequentially numbered sealed opaque envelopes, pharmacy-controlled allocation
  • Inadequate methods: Open random-number tables, unsealed envelopes, alternation by day of week or bed number

Blinding (masking) prevents participants, clinicians, or outcome assessors from knowing group assignment. Lack of blinding introduces several biases:

  • Performance bias: Clinicians may provide differential co-interventions to unblinded groups
  • Reporting bias: Participants may report outcomes differently based on perceived treatment allocation
  • Detection bias: Assessors may interpret ambiguous outcomes differently when they know group assignment

Empirical evidence of bias magnitude:

Methodological FlawAverage Effect InflationOutcome Type Most Affected
Inadequate allocation concealment30–40% exaggerated ORAll outcomes
Lack of double-blinding15–25% exaggerated ORSubjective outcomes (pain, function)
Both flaws combinedUp to 50% exaggerated ORSubjective outcomes

Allocation concealment and blinding address mechanical and observer-driven biases. The next family of trial-specific biases comes from the participants themselves — from what they expect to feel, how being studied changes their behaviour, and whether they actually take the treatment they were assigned.

Placebo Effects

The placebo effect refers to measurable improvements in participants who receive an inert treatment, driven by expectation, conditioning, and the therapeutic context (catalogued among the classical biases by Sackett, 1979). Placebo effects are particularly pronounced in trials involving pain and depression, where substantial improvement is routinely observed in placebo arms.

2004) used fMRI to demonstrate that placebo analgesia activates endogenous opioid pathways in the brain, producing measurable neurochemical changes. This means placebo responses in pain trials are not merely “imagined” but involve genuine physiological mechanisms.

The magnitude of placebo response in pain trials has been increasing over time, particularly in U.S.-based trials, making it progressively harder to demonstrate superiority of active treatments over placebo.

')">
💊
Placebo in Pain Trials
Click to learn more
2008) conducted a meta-analysis of FDA-submitted antidepressant trials and found that approximately 80% of the improvement in the drug arms was duplicated in the placebo arms. The drug-placebo difference was clinically meaningful only for patients with the most severe depression.

This does not mean antidepressants are ineffective—but it demonstrates the enormous contribution of placebo response in psychiatric trials and the importance of placebo-controlled designs for establishing true drug efficacy.

')">
🧠
Placebo in Depression Trials
Click to learn more
2017).

Nocebo effects complicate the interpretation of adverse event reporting in trials and may inflate discontinuation rates in active treatment arms when participants know the expected side-effect profile.

')">
Nocebo Effects
Click to learn more

Hawthorne Effect

The Hawthorne effect (Wikipedia) describes the phenomenon in which participants modify their behavior simply because they know they are being observed or studied. This effect is named after the famous Western Electric studies in the 1920s–1930s, where factory workers increased productivity regardless of which workplace change was introduced, apparently because they were being monitored. McCambridge, Witton, & Elbourne (2014) systematically reviewed evidence for the effect and emphasized that “research participation effects” are heterogeneous and design-dependent.

Case Study: Hawthorne Effect in Hand Hygiene Studies

Srigley et al. (2014) measured hand hygiene compliance among healthcare workers using both direct observation (where workers knew they were being watched) and electronic monitoring (covert). Compliance rates were nearly three times higher when workers knew they were observed (estimated 70%+ vs. ~25% with electronic monitoring). This finding has profound implications for infection control studies: if the Hawthorne effect inflates compliance in all trial arms, the true baseline behavior is obscured, and interventions may appear less effective than they would be in unmonitored settings.

Compliance and Adherence Bias

Even in well-designed RCTs, not all participants adhere to their assigned treatment. Compliance bias arises when adherent participants differ systematically from non-adherent participants in ways that affect outcomes—a phenomenon sometimes called the “healthy adherer effect.”

Per-Protocol vs. Intention-to-Treat Analysis

Intention-to-treat (ITT) analysis includes all randomized participants in their assigned groups regardless of compliance. Per-protocol (PP) analysis includes only participants who adhered to the study protocol. PP analyses can introduce bias because compliant participants are systematically different from non-compliant ones (healthier, more motivated, higher socioeconomic status). The Coronary Drug Project Research Group (1980) demonstrated this dramatically: placebo adherent patients had 15% lower mortality than placebo non-adherent patients, confirming that adherence itself is a marker of overall health behavior.

The Healthy Adherer Effect

Simpson et al. (2006) conducted a meta-analysis showing that good adherence to placebo was associated with lower mortality (pooled OR = 0.56, 95% CI: 0.43–0.74). This means that adherence is a proxy for a constellation of health-promoting behaviors. Per-protocol analyses that compare “adherers to drug” vs. “all controls” conflate drug effects with the healthy adherer effect, inflating apparent treatment benefits.

Contamination in Community Trials

Contamination occurs when control group participants partially receive the intervention. This is particularly common in community intervention trials and pragmatic trials. For example, in the COMMIT (Community Intervention Trial for Smoking Cessation; COMMIT Research Group, 1995), control communities were exposed to national anti-smoking campaigns occurring simultaneously, which diluted the contrast between intervention and control and made the community-level intervention appear ineffective. Contamination biases results toward the null, reducing the apparent effect of the intervention.

Summary: RCT-Specific Biases

BiasMechanismLikely DirectionPrimary Safeguard
Allocation concealment failureSelective enrollment post-randomizationAway from nullCentralized randomization
Lack of blindingDifferential co-interventions, reporting, detectionAway from nullDouble-blind, placebo control
Placebo effectExpectation and conditioningInflates control improvementPlacebo-controlled design
Hawthorne effectBehavior change from observationToward null (if equal in arms)Covert measurement when ethical
Healthy adherer effectAdherence confounded with health behaviorsAway from null (PP analysis)ITT analysis
ContaminationControls exposed to interventionToward nullCluster randomization, geographic separation
Knowledge Check — Section 1

1. Meta-epidemiologic studies by Schulz et al. and Wood et al. found that trials without adequate allocation concealment report treatment effects that are:

Meta-epidemiologic evidence consistently shows that inadequate allocation concealment inflates effect estimates by 30–40%. This occurs because recruiters can subvert randomization by selectively enrolling patients. Lack of blinding further inflates subjective outcomes by 15–25%. These are systematic, directional biases, not random variation.

2. In the Coronary Drug Project, participants who adhered to the placebo regimen had substantially lower mortality than placebo non-adherers. This finding demonstrates:

The survival benefit among placebo adherers cannot be attributed to the placebo itself (an inert substance). It reflects the healthy adherer effect: people who follow medical advice tend to engage in many other health-promoting behaviors. This confounding makes per-protocol analyses unreliable and is why intention-to-treat analysis is preferred.

3. In the COMMIT smoking cessation trial, control communities were simultaneously exposed to national anti-smoking campaigns. This is an example of:

Contamination occurs when control participants are partially exposed to the intervention or its equivalent. National anti-smoking campaigns reached control communities, reducing the contrast between arms. This biases the effect estimate toward the null, making the intervention appear less effective than it truly is.
Section 2 of 4

Immortal Time & Lead-Time Bias

⏱ Estimated reading time: 20 minutes

Introduction and Overview

Section 1 covered biases that arise inside the controlled environment of an RCT. This section turns to a family of biases that haunt the messier observational designs — especially pharmacoepidemiology and screening evaluations — where time is handled poorly. All four biases in this section share a common engine: a comparison group that has somehow been guaranteed extra survival or extra opportunity to be exposed, by virtue of how the data were assembled rather than anything about the underlying biology. We start with the most consequential of them in observational drug research.

Learning Objectives

  • Define immortal time bias and identify it in pharmacoepidemiologic study designs.
  • Apply time-dependent exposure classification and landmark designs to remove immortal time bias.
  • Distinguish lead-time bias, length-biased sampling, and overdiagnosis in screening studies, and explain why mortality is the only outcome immune to all three.
  • Critique an observational drug or screening study for the role of time-related biases in its reported effect.

Immortal Time Bias

Immortal time bias occurs when a period of follow-up during which the outcome cannot occur is misclassified or improperly handled in the analysis (Lévesque, Hanley, Kezouh, & Suissa, 2010). The term “immortal” refers to the fact that participants must survive (remain event-free) long enough to be classified as exposed. When this survival requirement is not properly accounted for, the exposed group appears to have artificially better outcomes.

▸ INTERACTIVE STORY — THE IMMORTAL TIME GHOST Open full screen ↗

Walk through how a survival window before treatment can fake a treatment benefit. Next ▶ advances scenes.

A 6-scene visualization of immortal time: a patient timeline from diagnosis to death, the misclassified pre-treatment window, the spuriously divergent survival curves, the ghost of immortal time itself, and the time-varying-exposure fix.

Why “Immortal” Time?

Consider a study of statin use and mortality. To be classified as a “statin user,” a patient must survive long enough after hospital discharge to fill a prescription. The time between cohort entry and the first prescription is “immortal time”—by definition, the patient could not have died during this period and still been counted as exposed. If this time is credited to the exposed group (or excluded from the unexposed group), the result is a spurious survival advantage for statin users.

Case Study: Statins and Mortality After Acute Myocardial Infarction

Suissa (2008) demonstrated that several widely cited observational studies reporting dramatic mortality reductions from post-MI statin use (reductions of 25–50%) contained immortal time bias. These studies classified patients as “statin users” based on prescriptions filled after hospital discharge. Patients who died before filling a prescription were automatically classified as “non-users,” and the survival time before the first prescription was either misattributed to the exposed group or excluded entirely. When Suissa reanalyzed the data using time-dependent exposure classification—correctly assigning person-time before the first prescription to the unexposed period—the dramatic survival benefit was substantially reduced or eliminated.

R Immortal time bias in 30 lines of code

What you'll do: simulate a 2,000-patient cohort in which the drug has zero survival benefit, then run two analyses — one that ignores immortal time and one that handles it correctly. What to take away: the dramatic mortality reduction reported in some early observational drug studies is exactly the kind of artefact this code reproduces, and the fix is bookkeeping rather than biology.

Build a cohort where statins truly have no survival benefit, then misclassify person-time the wrong way. Watch a fake survival benefit appear out of nothing.

set.seed(230)
N <- 2000

# Each patient survives Exp(rate=1/3) years; 25% eventually fill a Rx.
# If they die before filling, they go in the unexposed group (no benefit).
death_t <- rexp(N, rate = 1/3)
rx_t    <- rexp(N, rate = 1/2)
ever_rx <- rx_t < death_t

## (1) WRONG analysis: classify by ever_rx, ignore immortal time
mean_surv_wrong <- tapply(death_t, ever_rx, mean)
mean_surv_wrong   # exposed group looks much better

## (2) RIGHT analysis: split person-time at the prescription date
unexp_pt <- pmin(death_t, rx_t)              # every pt's unexposed person-time
exp_pt   <- pmax(death_t - rx_t, 0) * ever_rx # only after Rx, only if they got it

unexp_events <- sum(!ever_rx)
exp_events   <- sum(ever_rx)
c(unexp_rate = unexp_events / sum(unexp_pt),
  exp_rate   = exp_events   / sum(exp_pt))
Console output
FALSE TRUE 1.43 4.27 # WRONG: huge fake "benefit" unexp_rate exp_rate 0.337 0.343 # RIGHT: rates almost identical

The bias is engineered, not measured. Wrong person-time bookkeeping — not a real treatment effect — produced the apparent benefit. Time-dependent exposure assignment is the fix.

R Reflect on what you just ran

Use the questions below to interpret the output you produced. Look at your console before answering.

1. In the WRONG analysis, mean survival times printed as ~1.43 for the unexposed (FALSE) and ~4.27 for the exposed (TRUE). The simulation built in zero treatment effect — so where did the apparent 3-year survival advantage come from? Be specific about which person-time is being miscounted.

Model answerThe 3-year ‘survival advantage’ comes from the immortal person-time between cohort entry and treatment initiation being credited to the exposed group. In the WRONG analysis, anyone who would later become exposed contributes their pre-exposure survival to the exposed column — but by definition they cannot die during that window without losing the exposed label, so that pre-exposure time is guaranteed event-free. The mean survival time of 4.27 (exposed) reflects truncation at the right end (people who die before being treated never get counted as exposed) plus left-counting of pre-exposure time. The lesson: any analysis where exposure status is assigned with information that occurs after time-zero will manufacture an apparent benefit out of thin air.

2. In the RIGHT analysis, the unexposed rate (0.337) and exposed rate (0.343) are nearly identical. Why is the rate of ~0.33 close to the simulated true death-rate parameter 1/3, and what does that confirm about the time-dependent classification you implemented?

Model answerThe simulation generated death times from rexp(n, rate = 1/3), so the true death rate is 1/3 per unit time ≈ 0.333. Both the corrected exposed rate (0.343) and unexposed rate (0.337) sit right at that value, confirming that the time-dependent classification — assigning person-time to unexposed until the moment of treatment, and to exposed only after — has eliminated the artefact. The agreement of both groups with the simulated truth is the cleanest test that the analytic fix works; the IRR is 1.02 instead of 3.0.

3. Suissa's reanalysis of post-MI statin studies showed the same pattern. If you were peer-reviewing an observational drug study reporting a 25-50% mortality reduction, what specific feature of the methods section would you check first, based on what this simulation showed?

Model answerFirst check in the methods: the time-zero alignment — specifically how exposed and unexposed person-time accrues. Look for any of (a) classifying patients by ever-use rather than time-varying exposure; (b) defining exposure based on prescriptions filled during follow-up but counting follow-up from cohort entry; (c) lack of a clear ‘new-user’ design; (d) absence of risk-set sampling for nested case-control. The Suissa diagnostic is: if patients had to survive long enough to fill a prescription before being labelled exposed, the analysis has built in immortal time, and the reported 25–50% mortality reduction is partly or entirely artefactual.
Saved.

Immortal time bias introduces a systematic distortion through three related mechanisms:

  1. Survival requirement: Exposed participants must survive long enough to receive the exposure (e.g., fill a prescription, receive a transplant, attend a rehabilitation program)
  2. Time misclassification: The pre-exposure survival time is either credited to the exposed group (inflating their person-time denominator) or excluded from analysis
  3. Asymmetric comparison: The unexposed group includes all person-time from cohort entry, including deaths that occurred before exposed participants had the opportunity to be classified

The result is a comparison between a group with guaranteed survival during one period vs. a group with no such guarantee, producing a spurious protective effect.

Nobel Prize longevity study: A widely reported analysis suggested Nobel Prize winners live longer than nominees. However, the “exposure” (winning the prize) requires surviving to the date of the award. Nominees who died before the award ceremony contribute person-time only to the “non-winner” group. Correcting for immortal time eliminated the longevity advantage.

Metformin and cancer: Early observational studies reported that metformin users had dramatically reduced cancer risk compared to users of other diabetes drugs. Suissa and Azoulay (2012) showed that time-related biases, including immortal time bias, explained much of this apparent benefit.

Correcting immortal time bias:

  • Time-dependent exposure classification: Model exposure as a time-varying covariate, so person-time before the first prescription is correctly assigned to the unexposed period
  • Landmark analysis: Define a fixed time point after cohort entry and classify exposure status at that landmark; exclude patients who die or are lost before the landmark
  • Nested case-control or case-crossover designs: Match on time at risk to ensure comparable exposure windows
  • Target trial emulation: Design the analysis to mimic a hypothetical RCT, specifying time zero, eligibility, and treatment assignment simultaneously (Hernán & Robins, 2016)

Immortal time bias is the most common time-related bias in observational drug studies. The closely related family of biases that affect the evaluation of screening programs share the same fundamental issue — the way time enters the comparison — but they manifest differently. The next three subsections take them in turn.

Lead-Time Bias

Lead-time bias occurs when screening or early detection appears to improve survival simply because the diagnosis is made earlier, without actually changing the time of death. The “lead time” is the interval between screen detection and the time the disease would have been diagnosed clinically. If survival is measured from diagnosis, screened patients will appear to live longer even if screening does not change mortality.

Key Concept: Lead-Time Bias Illustrated

Imagine two identical patients with the same cancer developing at age 55 and causing death at age 70. Patient A is diagnosed by screening at age 58 (15-year survival from diagnosis). Patient B is diagnosed clinically at age 65 (5-year survival from diagnosis). Screening “added” 10 years of survival—but both patients lived to exactly 70. The 10-year difference is entirely lead time, not a genuine survival benefit. Only mortality-based outcomes (not survival from diagnosis) can distinguish true benefit from lead-time bias.

Case Study: Prostate Cancer Screening with PSA

Following the introduction of PSA (prostate-specific antigen) screening in the late 1980s, 5-year survival rates for prostate cancer in the U.S. rose from approximately 75% to over 99%. However, prostate cancer mortality rates declined only modestly. Etzioni et al. (2002) estimated that lead-time bias accounted for the majority of the apparent survival improvement. Many screen-detected prostate cancers are slow-growing tumors that would never have caused symptoms or death (overdiagnosis), further inflating apparent survival statistics.

Lead-time bias inflates apparent survival without changing the time of death. The next bias goes one step further: it changes which tumors are even available to be detected.

Length-Biased Sampling

Length-biased sampling (or length bias) occurs when screening programs preferentially detect slower-growing, less aggressive tumors. Because slow-growing tumors have a longer detectable preclinical phase, they are more likely to be present during a screening examination. Fast-growing tumors, which have a shorter preclinical window, are more likely to present as “interval cancers” between screenings.

🔍
Length Bias Mechanism
Click to learn more
🏹
Breast Cancer Example
Click to learn more
2010) estimated that 15–25% of breast cancers and up to 60% of screen-detected prostate cancers represent overdiagnosis. These patients undergo treatment (surgery, radiation, hormonal therapy) for a disease that would not have harmed them, experiencing side effects without benefit.')">
Overdiagnosis
Click to learn more

Summary: Temporal Biases in Screening and Pharmacoepidemiology

BiasSettingMechanismCorrection Strategy
Immortal time biasDrug/exposure studiesPre-exposure survival time misclassifiedTime-dependent analysis, landmark analysis
Lead-time biasScreening evaluationEarlier diagnosis inflates survival from diagnosisUse mortality, not survival, as endpoint
Length-biased samplingScreening evaluationScreening detects slower-growing tumorsRandomized screening trials, mortality endpoints
OverdiagnosisScreening programsDetection of nonprogressive diseaseCompare incidence in screened vs. unscreened populations

Reflection

A pharmaceutical company publishes an observational study showing that patients who start their new cholesterol medication within 30 days of a heart attack have 45% lower mortality than those who do not. The study classifies patients based on whether they filled a prescription within 30 days. Identify at least two temporal biases that could explain this finding, and propose an analytical approach that would produce a less biased estimate.

Model answerTwo temporal biases drive the 45% mortality reduction. (1) Immortal time bias: classifying patients by whether they filled a prescription within 30 days means anyone who died between days 0–30 without filling cannot be labelled exposed — the exposed group is guaranteed to have lived at least until the day of the fill, while the unexposed group includes those early deaths. (2) Healthy-adherer / depletion-of-susceptibles bias: patients who actually fill a prescription within 30 days are systematically healthier and more engaged with care than those who don't; pharmacy fill is a marker of capacity, not just intent-to-treat. Less-biased analytic approach: target-trial emulation with a new-user, active-comparator design, time-zero aligned at the day of the first prescription, comparing new initiators of the new drug to new initiators of an established cholesterol drug. Cox model with proportional hazards on time since initiation. Where feasible, supplement with an instrumental-variable analysis using prescriber preference.
✓ Reflection saved
Knowledge Check — Section 2

1. In observational studies of post-MI statin use and mortality, immortal time bias produces a spurious protective effect because:

Immortal time bias is specifically about the misclassification of person-time. The period between cohort entry and the first prescription is “immortal” because the patient must be alive to fill the prescription. When this time is credited to the exposed group or excluded from the unexposed group, it creates a spurious survival advantage. Options C and D describe real confounders but are distinct from immortal time bias.

2. After PSA screening was introduced, prostate cancer 5-year survival rates rose from ~75% to >99%, yet mortality rates declined only modestly. The most important explanation for this discrepancy is:

The dramatic improvement in 5-year survival with minimal mortality reduction is the classic signature of lead-time bias combined with overdiagnosis. Diagnosing tumors years earlier automatically extends measured survival from diagnosis. Additionally, PSA screening detects many indolent cancers (overdiagnosis) that would never cause death, adding long-surviving patients to the numerator. Only mortality endpoints can distinguish true screening benefit from these artifacts.

3. Length-biased sampling in cancer screening means that:

Length-biased sampling occurs because tumors with a longer preclinical detectable phase (i.e., slow-growing tumors) are more likely to be “present” during any given screening test. Aggressive, fast-growing tumors pass through the detectable phase quickly and are more likely to present between screenings as interval cancers. This means screen-detected cancers are enriched for indolent disease, inflating apparent screening benefit when survival (rather than mortality) is used as the outcome.
Section 3 of 4

Time-Window, Period & Cohort Effects

⏱ Estimated reading time: 15 minutes

Introduction and Overview

Sections 1 and 2 covered biases tied to specific designs (RCTs and observational pharmacoepidemiology / screening). This section covers two final time-related issues that can affect almost any observational analysis: time-window bias in case-control designs (a close cousin of immortal time bias from Section 2) and the age-period-cohort identification problem that arises whenever we try to interpret time trends in disease rates.

Learning Objectives

  • Recognize time-window bias in case-control studies where exposure ascertainment periods differ between cases and controls.
  • Distinguish age, period, and cohort effects and explain why the three cannot be separately identified without additional constraints.
  • Interpret a time-trend graph using birth-cohort versus calendar-period framings, and articulate the assumptions each framing imposes.
  • Propose design or analytic strategies (matching on exposure-window length, restriction, theory-driven APC constraints) to address temporal biases.

Time-Window Bias

Time-window bias occurs in case-control studies when the exposure opportunity period differs between cases and controls. If cases have a systematically shorter (or longer) time window during which exposure could be ascertained, comparisons of exposure prevalence will be distorted.

Key Concept: Time-Window Bias

In a case-control study of medication use and an acute outcome (e.g., myocardial infarction), cases who die shortly after the event have a truncated exposure window compared to surviving controls. If exposure is defined as “any use in the prior year,” controls have a full year to accumulate medication use while cases who die within months may not. Alternatively, cases with longer pre-event periods may accumulate more exposure. The mismatch in observation time creates differential exposure opportunity, biasing the association.

Case Study: Statins and Cancer in Case-Control Studies

Suissa et al. (2006) showed that several case-control studies reporting protective effects of statins on cancer risk were affected by time-window bias. Controls were matched to cases on calendar date but had systematically longer exposure opportunity periods. Because statin use increased rapidly over time, controls—who were observed over longer and more recent periods—had a higher probability of statin exposure than cases. This differential created an artificial protective association. When exposure windows were properly aligned between cases and controls, the protective effect was substantially attenuated or disappeared.

How Time-Window Bias Differs from Immortal Time Bias

Both biases involve time-related distortions, but they operate through different mechanisms:

  • Immortal time bias occurs in cohort studies when pre-exposure person-time is misclassified, giving the exposed group a guaranteed survival advantage
  • Time-window bias occurs in case-control studies when cases and controls have unequal opportunities to be classified as exposed, biasing exposure prevalence comparisons

Both can produce spurious associations, but the correction strategies differ: immortal time bias requires time-dependent analysis, while time-window bias requires matching or restricting on exposure opportunity period.

Correcting Time-Window Bias

Strategies for addressing time-window bias include:

  • Match on exposure opportunity: Ensure cases and controls have the same length of observation time for exposure assessment
  • Use incidence density sampling: Select controls from the risk set at the time of each case’s event, ensuring comparable exposure windows
  • Define fixed exposure windows: Restrict exposure assessment to a fixed period (e.g., 90 days before the index date) that is identical for cases and controls
  • Sensitivity analyses: Vary the exposure window length and assess stability of results

Time-window bias is about the design of a single study. The last topic of the lesson zooms out to time-trend analyses themselves — surveillance data, cross-sectional surveys repeated across decades — and the famous identification problem that arises whenever we try to attribute change to a particular cause.

Period and Cohort Effects

When analyzing trends in disease rates or health behaviors over time, it is essential to distinguish between age effects, period effects, and cohort effects—the classic age-period-cohort (APC) problem.

👨‍🦳
Age Effects
Click to learn more
📅
Period Effects
Click to learn more
👪
Cohort Effects
Click to learn more
Case Study: Smoking Prevalence Trends—Period vs. Cohort Effects

Repeated cross-sectional surveys of smoking in the U.S. and U.K. show that overall smoking prevalence has declined steadily since the 1960s. However, examining trends by birth cohort reveals a more complex picture. Cohorts born in the 1920s–1940s had peak smoking rates above 50% among men and experienced gradual declines as anti-smoking campaigns took effect (a period effect). In contrast, cohorts born after 1970 never reached such high peak prevalence—reflecting a cohort effect of growing up in an environment where smoking was increasingly stigmatized and regulated. Failing to distinguish these effects leads to incorrect projections: a simple period model might predict convergence of all cohorts toward the same low rate, while a cohort model reveals that some older cohorts will maintain elevated rates until they exit the population through mortality.

The APC Identification Problem

Age, period, and cohort are mathematically collinear: Cohort = Period − Age. This means that given any two of these factors, the third is determined. This linear dependency creates a fundamental identification problem—it is impossible to simultaneously estimate all three effects without making additional assumptions or imposing constraints. Researchers must use theory, external data, or constrained models to disentangle these intertwined effects.

Summary: Time-Related Study Design Biases

Bias / EffectStudy DesignCore IssueSolution
Time-window biasCase-controlUnequal exposure opportunityMatch on observation time, incidence density sampling
Period effectCross-sectional, ecologicalCalendar-time changes affecting all agesAge-period-cohort modeling
Cohort effectCross-sectional, ecologicalBirth-year-specific risk trajectoriesAge-period-cohort modeling
APC confoundingAny time-trend analysisMathematical collinearity of age, period, cohortTheory-driven constraints, external validation

Reflection

A researcher examines lung cancer rates over time and observes declining rates in men but rising rates in women. They attribute this to a period effect of anti-smoking campaigns being more effective for men. Using your knowledge of cohort effects, propose an alternative explanation and describe what data analysis you would need to distinguish between these two interpretations.

Model answerThe pattern is more naturally explained by cohort effects than a period effect of campaigns. Smoking uptake in women rose dramatically through the mid-20th century (post-WWII marketing) while uptake in men peaked earlier and has been declining for decades; lung-cancer rates lag uptake by 30–40 years, so women born 1940–60 are now in the high-risk window while men born 1900–30 are aging out of it. To distinguish period from cohort effects, fit an age-period-cohort (APC) model on age-specific incidence: if the male decline and female rise are both concentrated in birth cohorts (parallel curves across periods for a fixed birth-year, with different cohort-specific intercepts), the cohort interpretation wins. Supplement with historical smoking-prevalence data by birth cohort to verify the lag. Standard problem: APC models suffer from identifiability between the three time scales; resolve by constraining one (typically dropping the linear period trend) and reporting sensitivity across constraint choices.
✓ Reflection saved
Knowledge Check — Section 3

1. Time-window bias in case-control studies of medication use occurs when:

Time-window bias arises specifically from a mismatch in the time available for exposure to be ascertained between cases and controls. When controls have longer observation windows, they have more opportunity to accumulate medication use, inflating exposure prevalence relative to cases (or vice versa). The solution is to ensure equal exposure assessment windows.

2. In repeated cross-sectional surveys, smoking prevalence among 50-year-olds has declined from 40% in 1980 to 15% in 2020. Which statement best distinguishes between period and cohort explanations?

Both period and cohort effects could produce the observed trend. A period effect would show similar declines across all birth cohorts at the same calendar time. A cohort effect would show that successive birth cohorts had lower smoking prevalence at every age. Distinguishing them requires age-period-cohort analysis—examining rates stratified by both birth year and calendar year. While the APC identification problem creates analytical challenges, the effects can often be partially disentangled with appropriate modeling and theory.

3. The age-period-cohort identification problem exists because:

The APC identification problem is a mathematical fact: because Cohort = Period − Age, any model including all three as linear terms is unidentifiable—there are infinite solutions. Researchers must impose constraints (e.g., assuming one effect is negligible, using non-linear functions, or incorporating external information) to disentangle the effects. This is not a data collection problem but a fundamental structural issue in time-trend analysis.

4. A researcher finds that statin use appears protective against cancer in a case-control study. However, controls had an average of 4.2 years of pharmacy records while cases (who died) had an average of 2.1 years. This discrepancy most likely introduces:

This is time-window bias in a case-control framework. Controls had twice the pharmacy record duration, giving them more time to accumulate statin prescriptions. This means higher statin exposure prevalence among controls is partly an artifact of longer observation, not a true protective association. The solution is to ensure equal exposure ascertainment windows for cases and controls.
Section 4 of 4

Final Assessment

⏱ Estimated time: 20 minutes

Bringing It All Together

This lesson covered design-specific and temporal biases in three groups: the biases that operate inside RCTs even when randomization succeeds (Section 1), the biases that haunt observational pharmacoepidemiology and screening evaluations (Section 2), and the biases that arise from how time itself is handled in case-control comparisons and time-trend analyses (Section 3). Each bias here typically combines elements from the three classical categories of Lessons 7–9 — for example, immortal time bias is partly a measurement problem (how exposure status is assigned over time) and partly a selection problem (who survives long enough to be classified as exposed).

The unifying thread is that design choices and the way time enters the analysis can manufacture associations that look causal but are not. The corresponding repairs are also tied to design: blinding and allocation concealment for trials; time-dependent exposure classification and landmark designs for pharmacoepidemiology; mortality endpoints for screening evaluation; matched exposure windows and theory-driven constraints for time-trend analyses. The final reflection asks you to put the full inventory to work on a single hypothetical screening claim; the assessment then tests the conceptual material across all three sections before Lesson 11 closes the loop on confounding and Lesson 12 integrates the entire course.

Key Takeaways from Lesson 10

  • RCT biases: Inadequate allocation concealment and blinding inflate effect estimates by 30–50%. Placebo effects, Hawthorne effects, compliance bias, and contamination further distort trial results.
  • Immortal time bias: Misclassifying pre-exposure survival time creates spurious protective associations in pharmacoepidemiologic studies. Time-dependent analysis and landmark designs correct this bias.
  • Lead-time bias: Earlier detection through screening inflates survival time from diagnosis without necessarily reducing mortality. Only mortality-based endpoints avoid this artifact.
  • Length-biased sampling: Screening preferentially detects indolent tumors, making screen-detected cancers appear to have better prognosis regardless of treatment.
  • Time-window bias: Unequal exposure ascertainment periods in case-control studies create differential exposure opportunity, biasing association estimates.
  • Period vs. cohort effects: Changes in disease rates over time may reflect calendar-period influences on all ages or birth-cohort-specific risk trajectories. The APC identification problem requires theory-driven constraints to disentangle these effects.
R Activity — Simulating Immortal Time Bias

The companion R script r-activities/HSCI_230_Lesson_10_Design_Specific_and_Temporal_Biases.R constructs a cohort in which a prescription has no real effect on survival, then contrasts a naive analysis (classify everyone by whether they ever filled the Rx, and compare mean survival) against a correct analysis that splits person-time at the prescription date. The wrong analysis manufactures a large survival advantage out of thin air; the right analysis recovers the truth that the two rates are essentially equal.

set.seed(230)
N <- 2000

# Each patient survives Exp(rate=1/3) years; 25% eventually fill a Rx.
# If they die before filling, they end up classified as unexposed.
death_t <- rexp(N, rate = 1/3)
rx_t    <- rexp(N, rate = 1/2)
ever_rx <- rx_t < death_t

## (1) WRONG analysis: classify by ever_rx, ignore immortal time
mean_surv_wrong <- tapply(death_t, ever_rx, mean)
mean_surv_wrong   # exposed group looks much better

## (2) RIGHT analysis: split person-time at the prescription date
unexp_pt <- pmin(death_t, rx_t)              # unexposed person-time
exp_pt   <- pmax(death_t - rx_t, 0) * ever_rx # exposed person-time

unexp_events <- sum(!ever_rx)
exp_events   <- sum( ever_rx)
c(unexp_rate = unexp_events / sum(unexp_pt),
  exp_rate   = exp_events   / sum(exp_pt))

Final Reflection

Consider the following scenario: A national screening program reports that 5-year survival for a particular cancer has improved from 50% to 85% over 15 years, and attributes this improvement to the screening program. Drawing on everything you have learned in this lesson about design-specific and temporal biases, identify all the biases that could contribute to this apparent improvement, explain how each one operates, and describe what evidence you would need to determine whether the screening program truly reduced mortality.

Model answerMultiple biases can inflate 5-y survival even when screening adds no real benefit. (a) Lead-time bias: screening detects cancers earlier in their natural history, so the clock for 5-y survival starts sooner; survival is longer simply because the diagnosis date moved earlier, not because the date of death did. (b) Length-time bias: screening preferentially picks up slow-growing, indolent cancers (they spend more time in the detectable preclinical phase), so the screen-detected pool is enriched for tumours that would have had good prognosis anyway. (c) Overdiagnosis: subclinical cancers that would never have caused symptoms are now ‘detected’ and counted as cured — the extreme of length-time bias. (d) Stage-migration / Will Rogers effect (Feinstein, Sosin, & Wells, 1985): improved imaging upstages borderline tumours, making both stage-specific groups look better on average. (e) Improvements in treatment over 15 years that have nothing to do with screening (better surgery, targeted therapy) inflate survival regardless of detection mode. The decisive evidence is mortality, not survival: a properly designed randomised screening trial (or a population-level interrupted time-series with mortality endpoints in screened vs. unscreened populations) is what distinguishes a real effect from artefact. Without mortality data and a non-screened comparator, the 50→85% figure is uninformative about whether the programme works.
✓ Reflection saved

Final Quiz

This 15-question assessment covers all sections of Lesson 10. You must score 100% to complete the lesson. Review the explanations for any questions you miss, then retry.

Final Assessment — Lesson 10 (15 Questions)

1. A meta-analysis combines results from 20 RCTs of a surgical procedure. Trials that used sham surgery controls showed a pooled OR of 1.1, while trials using usual care controls (no blinding) showed a pooled OR of 2.4. This pattern is best explained by:

When trials lack blinding (usual care controls), multiple biases inflate effect estimates: participants expect benefit and report better outcomes (placebo/expectation), clinicians provide differential co-interventions (performance bias), and assessors interpret outcomes favorably (detection bias). Sham surgery controls these biases by maintaining blinding, yielding an unbiased estimate close to the null.

2. In a workplace wellness RCT, the intervention group receives health coaching while the control group receives no contact. Both groups show improved health behaviors. The most likely explanation for improvement in the control group is:

The Hawthorne effect describes behavior modification due to awareness of being observed. Control participants who know they are in a study and will be monitored may change their health behaviors even without receiving the intervention. This is distinct from contamination (where controls actually receive intervention elements) and from social desirability (which affects reporting but not necessarily behavior).

3. The Coronary Drug Project found that placebo-adherent patients had 15% lower 5-year mortality than placebo non-adherent patients. This finding implies that:

Since an inert placebo cannot reduce mortality, the survival advantage of adherent participants must reflect confounding—adherent people tend to be healthier in many unmeasured ways. This is the healthy adherer effect, and it demonstrates why per-protocol analyses can be biased even in randomized trials. Intention-to-treat analysis preserves the benefits of randomization.

4. A community-randomized trial of a dietary intervention finds no significant difference in cardiovascular events between intervention and control communities. However, process data show that 40% of control community residents adopted the dietary changes through media coverage of the trial. The true intervention effect is most likely:

Contamination occurs when control participants adopt the intervention. This reduces the contrast between groups, biasing the effect estimate toward the null. The true effect of the dietary intervention is likely larger than observed. While contamination complicates interpretation, instrumental variable approaches or as-treated analyses (with appropriate caution) may help estimate the effect in the compliant subgroup.

5. An observational study reports that patients who complete cardiac rehabilitation after MI have 50% lower mortality than those who do not. This dramatic benefit could be partly explained by immortal time bias because:

Completing a multi-week rehabilitation program requires surviving long enough to finish it. If a patient dies during week 2 of a 12-week program, they would be classified as “did not complete rehabilitation.” The weeks of guaranteed survival for completers represent immortal time. Options B and D describe real confounders but are not immortal time bias specifically.

6. A landmark analysis of statin use and mortality sets the landmark at 6 months post-MI and classifies patients based on statin use by that point. What is the primary advantage of this approach?

Landmark analysis addresses immortal time bias by establishing a fixed time point (the landmark) at which exposure is assessed. All patients must have survived to the landmark, so there is no differential immortal time. Patients who die before the landmark are excluded entirely. Follow-up begins at the landmark, ensuring comparable starting points for exposed and unexposed groups.

7. A screening program for neuroblastoma in infants detects many tumors, but a randomized trial shows no mortality reduction. The most complete explanation involves:

Neuroblastoma screening in infancy is a classic example of overdiagnosis—many infant neuroblastomas spontaneously regress. Screening detected these self-resolving tumors (inflating incidence) without catching the aggressive forms early enough to change mortality. The combination of lead-time bias, length-biased sampling, and overdiagnosis explains the discrepancy between increased detection and unchanged mortality.

8. In a case-control study of NSAIDs and colorectal cancer, cases had pharmacy records covering a mean of 3.5 years while controls had records covering 6.2 years. If NSAIDs are truly unrelated to colorectal cancer, this study would most likely find:

This is time-window bias. Controls have nearly twice the pharmacy record duration, giving them more time to fill NSAID prescriptions. Even if the true association is null, controls will appear to have higher NSAID exposure, producing a spurious protective OR. The correction is to equalize the exposure ascertainment window between cases and controls.

9. Lung cancer rates among women aged 55–64 are higher in 2020 than in 1980, while rates among men of the same age declined. A researcher claims this reflects a period effect of anti-smoking campaigns being less effective for women. An alternative cohort-based explanation is:

The cohort explanation recognizes that women born in the mid-20th century adopted smoking later and reached peak prevalence later than men. Women aged 55–64 in 2020 (born ~1956–1965) came of age during the period of highest female smoking, while their male counterparts were already part of declining-smoking cohorts. The resulting lung cancer rates reflect these different cohort-specific smoking trajectories, not differential effectiveness of anti-smoking campaigns.

10. A researcher wants to study whether a new diabetes drug reduces cancer risk using a healthcare database. Which study design feature most directly prevents immortal time bias?

Immortal time bias is caused by misclassification of person-time. The direct solution is to treat drug exposure as time-varying: before the first prescription, the person contributes time to the unexposed group; after the prescription, they contribute to the exposed group. Matching (option A) addresses confounding but not time misclassification. Restricting follow-up (option D) could actually worsen the bias by requiring survival to the restriction point.

11. In a trial of acupuncture for chronic pain, patients receiving real acupuncture improve by 40%, while patients receiving sham acupuncture (needles placed at non-acupuncture points) improve by 35%. Patients on a waitlist improve by 10%. These results suggest:

This three-arm design separates the total effect into components. The difference between sham and waitlist (35% - 10% = 25%) represents the placebo/context effect (expectation, therapeutic relationship, ritual). The difference between real and sham (40% - 35% = 5%) represents the specific needle-placement effect. Most of the apparent “benefit” of acupuncture over no treatment is attributable to non-specific placebo effects.

12. A cancer screening study reports that patients whose cancers were detected by screening have a 10-year survival rate of 90%, while patients whose cancers were detected clinically have a 10-year survival rate of 40%. This comparison is invalid for evaluating screening effectiveness because:

Comparing screen-detected to clinically detected cancers is subject to three interrelated biases: (1) lead-time bias adds years of survival from diagnosis without changing time of death; (2) length-biased sampling means screen-detected cancers are enriched for slow-growing tumors with inherently better prognosis; and (3) overdiagnosis means some screen-detected cancers would never cause death. Together, these biases make screening appear far more beneficial than it may actually be. Only randomized trials comparing mortality between screened and unscreened populations can provide valid estimates.

13. An investigator studying the effect of beta-blockers on heart failure hospitalization in a case-control study ensures that both cases and controls have exactly 365 days of pharmacy data before the index date. This design feature primarily addresses:

Ensuring equal pharmacy data duration for cases and controls directly addresses time-window bias. Both groups have exactly the same opportunity to accumulate medication exposure. Without this restriction, differential record lengths would create differential exposure opportunity, biasing the association. This does not address confounding by indication (which requires adjustment for disease severity) or immortal time bias (a cohort study issue).

14. Obesity rates among 40-year-olds in 2020 are much higher than among 40-year-olds in 1980. If this is primarily a cohort effect rather than a period effect, you would expect to find:

A cohort effect means that the 1980 birth cohort carries higher obesity risk throughout their lives compared to the 1940 cohort, due to different formative exposures (processed food environment, sedentary lifestyles, in utero conditions). This would be visible at every age point, not just at age 40. A period effect, by contrast, would affect all age groups simultaneously at the same calendar time. Option A describes a period effect pattern.

15. A systematic review finds that observational studies of a medication report a 45% risk reduction, while the single large RCT reports only a 12% risk reduction. Considering design-specific biases, which combination of biases most likely explains the discrepancy?

Observational pharmacoepidemiologic studies are vulnerable to multiple biases that inflate treatment effects: immortal time bias (misclassified person-time), the healthy adherer effect (medication users are systematically healthier), and confounding by indication (healthier patients receive certain medications). The RCT, with randomization and ITT analysis, controls for these biases. The consistent finding that RCTs show smaller effects than observational studies for the same medication supports this explanation.

Lesson 10 Complete!

Congratulations! You have successfully completed the lesson on Design-Specific and Temporal Biases. Your responses have been downloaded automatically.

Lesson 11 — Confounding and Statistical Inference — finishes the bias material. Confounding is the third leg of the canonical bias triad alongside selection (Lesson 8) and information (Lesson 9), and the lesson also brings statistical inference back into focus — sampling distributions, confidence intervals, p-values, and the difference between random error and systematic error. By the time you reach Lesson 12's integrated appraisal, the full toolkit of HSCI 230 will be in place.