HSCI 230 — Lesson 3

Introduction to Observational Studies

Evaluating Epidemiological Research

Kiffer G. Card, PhD, Faculty of Health Sciences, Simon Fraser University

Learning objectives for this lesson:

  • Differentiate between descriptive and explanatory studies
  • Differentiate between experimental and observational studies
  • Describe the three main elements of the unified approach to observational study design
  • Describe the advantages and limitations of case reports, case-series reports, and surveys
  • Design a cross-sectional study accounting for its strengths and weaknesses
  • Identify circumstances where a cross-sectional study is appropriate
  • List three approaches for obtaining incidence estimates from cross-sectional prevalence data
  • Differentiate between repeated cross-sectional studies and following a cohort in a longitudinal study
  • Apply the STROBE checklist to reporting a cross-sectional study

Dohoo, I. R., Martin, S. W., & Stryhn, H. (2012). Methods in Epidemiologic Research. VER Inc.

Reference

Glossary — Key Terms, People & Concepts

📚 Reference page — available throughout the lesson

This glossary collects the key concepts, people, and ideas you will meet in this lesson. Use it as a reference while you work through the material, or as a review before assessments. Type in the search box to filter entries.

Key Concepts & Ideas
Observational Study A study in which the researcher does not assign exposures — participants “assign themselves” through their behaviors, environments, or characteristics. The dominant design family in epidemiology.
Experimental Study A study in which the researcher assigns exposure (ideally at random). Includes randomized controlled trials. Generally provides stronger causal inference but is often infeasible or unethical for many public-health questions.
Descriptive Study A study aimed at describing the distribution of disease or other health outcomes by person, place, and time — without testing a specific causal hypothesis.
Explanatory (Analytic) Study A study designed to test associations between specific exposures and outcomes, typically with a hypothesis about causation in mind.
Exposure A factor whose effect on a health outcome is being investigated — e.g., a behavior, environmental contaminant, drug, or social condition.
Outcome The health state or event whose occurrence the study is trying to explain — disease, death, recovery, lab measurement, behavior change.
Prevalence The proportion of a population that has a condition at a particular point (point prevalence) or over a window (period prevalence). The natural measure delivered by cross-sectional studies.
Incidence The rate at which new cases of a condition arise over time in a population. Captured cleanly by cohort designs; only indirectly by cross-sectional surveys.
Cumulative Incidence (Risk) The proportion of an initially disease-free population that develops the outcome over a defined follow-up period.
Incidence Rate New cases divided by total person-time at risk. Has units of 1/time and accommodates loss to follow-up and varying observation windows.
Study Base The population and time period from which cases (and, in case-control studies, controls) arise. Defining the study base clearly is essential to all observational designs.
Sampling Frame The list or mechanism from which study participants are actually drawn. The closer the sampling frame matches the target population, the less selection bias you should expect.
Selection Bias Systematic error introduced when the people who end up in the study differ in relevant ways from the population the study claims to represent. A primary threat to all observational designs.
Information (Measurement) Bias Error introduced when exposure or outcome is measured inaccurately — e.g., faulty instruments, recall errors, misclassification.
Confounding A distortion of the exposure–outcome association due to a third variable that is associated with both. The central methodological problem of observational epidemiology; for a foundational catalogue of biases see Sackett (1979).
STROBE Checklist Strengthening the Reporting of Observational Studies in Epidemiology — a 22-item checklist that observational studies should follow when reporting cohort, case-control, and cross-sectional designs (von Elm et al., 2007).
Three Elements of Observational Design Dohoo, Martin & Stryhn's unifying framework: (1) selection of the study sample, (2) measurement of exposure and outcome, and (3) the temporal relationship between them. Every observational design is a different recipe combining these three elements.
Methods & Study Designs
Case Report A detailed clinical or epidemiological description of a single patient or event. Useful for hypothesis generation and signaling new conditions; cannot estimate frequency or test causal hypotheses.
Case Series A description of a small group of cases sharing a feature. Like case reports but offering modest pattern-recognition power; still cannot estimate population frequencies without a denominator.
Survey A study that collects information from a sample of a population through standardized questions or measurements. The data-gathering engine behind cross-sectional and many descriptive studies.
Cross-Sectional Study A study that measures exposure and outcome at the same point in time in a defined population. Estimates prevalence; useful for hypothesis generation but generally weak for cause–effect inference because exposure and outcome are simultaneous.
Repeated Cross-Sectional Studies A series of separate cross-sectional samples taken from the same target population at different times. Tracks population-level change but cannot follow individuals.
Longitudinal Cohort Follows the same individuals over time, allowing within-person change and incidence to be estimated. Contrast with repeated cross-sectional designs.
Ecological Study An observational study in which the unit of analysis is a group (region, country, school) rather than an individual. Treated more fully in Lesson 6.
No matching entries. Try a different search term.
Section 1

Study Classification & Design Framework

Sections 7.1–7.2 of Dohoo, Martin & Stryhn

Introduction and Overview

Lessons 1 and 2 worked at the level of the discipline: where epidemiology came from, what counts as knowledge, and how the published record can fail and be appraised. Lesson 3 takes the first step from those framing concerns into the working tools of the field. Almost every bias the manufactured-doubt strategy weaponizes, every analytic error the integrity-and-reform material in Lesson 1 catalogued, and every pre-analytic discipline EGAP demands — all of them play out concretely once you have a study design in front of you. So this lesson is about the design.

The four content sections proceed from general to specific. Section 1 sets up a classification scheme for every study you will encounter for the rest of HSCI 230. Section 2 covers the simplest of those study types — descriptive studies, where a comparison group is absent. Section 3 introduces the first true analytic observational design: the cross-sectional study. Section 4 pushes on its limitations and shows what reporting it well requires. By the end of the lesson, the language you'll need for case-control studies (Lesson 4) and cohort studies (Lesson 5) will already be in place.

Learning Objectives

  • Distinguish descriptive from explanatory (analytic) studies by their purpose and inferential claims.
  • Distinguish experimental from observational studies in terms of investigator control and confounding.
  • Apply Hernan’s and Rubin’s “unified approach” (thought experiment, design before data, forward projection) to a research question.
  • Locate any study type within the hierarchy of evidence and explain what causal weight that placement does and does not imply.

Descriptive vs. Explanatory Studies

Epidemiologic studies can be classified into two major categories: descriptive and explanatory (analytic). This classification reflects both the study’s objectives and its ability to support causal inference. The two tabs below define each category in turn; click between them and notice that the difference is not about technique but about purpose — whether the study is built to characterize disease occurrence or to compare groups.

Descriptive studies include case reports, case-series reports, and surveys. They are designed solely to describe the nature and distribution of outcome events such as health-related phenomena. They describe the who, what, when, and where of disease occurrence.

Although a descriptive survey is not designed to assess hypotheses about manipulatable causes of the outcome event, the frequency of the outcome is usually described in categories of age, race, sex, season, and space.

Explanatory studies (also called analytic studies) are designed to make comparisons and contrasts between subgroups of study subjects based on exposure or outcome status. They allow the investigator to identify statistical associations between exposures and outcomes.

Explanatory studies can be further subdivided into experimental and observational studies, depending on whether the investigator controls the allocation of study subjects to exposure groups.

The descriptive–explanatory split asks what a study is trying to do. The next split — experimental versus observational — asks how the explanatory studies do it, and is the distinction this entire lesson hinges on.

Experimental vs. Observational Studies

In experimental studies, the investigator controls (usually through randomisation) the allocation of study subjects to exposure groups. In contrast, in observational studies, the investigators try not to influence the natural course of events for the study subjects.

Key Distinction

In experimental studies, we try to reduce variation from all sources through selection and control of the experimental setting. In observational studies, we embrace the presence of natural variation in order to identify important interactions among key variables and the exposure–disease association.

The price paid through the use of observational studies is that considerable efforts are required to prevent confounding (bias) of the exposure–disease association. Experiments are the preferred choice when the treatment is straightforward and easily manipulated, such as a vaccine or a specific therapeutic agent. The major advantage of the experimental approach is the ability to control potential confounders through the process of randomisation.

If observational studies have to fight confounding the way experiments avoid it, the natural question is how to design them so that the fight is winnable. Two well-known methodologists — Miguel Hernan and Donald Rubin — offer the same answer in slightly different language: borrow the discipline of the experiment, even when you cannot run one. That answer has three parts, summarized in the accordion below.

A Unified Approach to Study Design

Hernan (2005) stressed that when considering an observational study design, we should think about the design of a field experiment to accomplish the same objective. This approach, reinforced by Rubin (2007), emphasises that ‘design trumps analysis’ and that all elements of the study design should be completed before seeing any outcome data. Expand each of the three steps below in order — the order matters, because each step constrains the next.

1. The ‘Thought Experiment’

As a first step in considering an epidemiological study, a ‘thought experiment’ can be accomplished and should specify the key elements of study group, its selection, assignment to exposure, procedures for follow-up, and detecting the outcome. The important part is that formal randomisation would ensure ‘exchangeability’—the groups being compared are so similar that it does not matter which group was assigned to exposure.

2. Design Features Before Seeing Data

All design features are completed before anyone has seen the outcome data. This includes subject exclusion, selection criteria, and control of confounding. Rubin formalises the process through propensity scores (the probability of exposure given the covariates) in the exposed and non-exposed groups. Unless these are virtually equal, some degree of confounding is possible.

3. Forward Projection (Critical Appraisal)

After completing the initial design, we project forward to the presentation of study results under 3 different scenarios: (1) the exposure appears to increase risk; (2) the exposure appears to decrease risk; or (3) the exposure does not appear to be associated. For each scenario, we must defend the proposed design. This process helps identify potential weaknesses.

The unified approach gives you the discipline that any single study should have. Stepping back further, we can also rank study types by how much causal weight any one well-designed instance of them can carry. That ranking — the so-called hierarchy of evidence — is the last piece of the classification map for this section.

Hierarchy of Evidence for Causal Inference

From the perspective of drawing causal inferences, experimental studies are generally referred to as the gold standard (hierarchy of evidence). The hierarchy of causal evidence (from strongest to weakest) is typically as shown in the table below. The Lesson 1 caveat is worth holding onto here: the hierarchy describes causal weight, not absolute quality. A well-conducted cross-sectional study can be far more useful than a poorly-conducted RCT for many real-world questions.

Study TypeDifficultyInvestigator ControlCausal EvidenceRelevance
Laboratory trialModerateVery highVery highLow
Controlled field trial (RCT)ModerateHighVery highHigh
Cohort studyDifficultHighHighHigh
Case-control studyModerateModerateModerateHigh
Cross-sectional studyModerateLowLowModerate
SurveyModerateModerateNot applicableHigh
Case seriesEasyVery lowNot applicableLow to high
Case reportVery easyVery lowNot applicableLow to high

The reflection below asks you to use the classification scheme we just built to evaluate a research question of your own. The knowledge check that follows tests the conceptual content directly. Once you have worked through both, Section 2 takes the bottom of the hierarchy — the descriptive designs — and asks what useful work they can do despite their lack of a comparison group.

Reflection

Think about a health outcome you are interested in studying. Would an experimental or observational approach be more appropriate, and why? Consider ethical, practical, and scientific factors in your answer.

Model answerA strong answer names a concrete outcome (e.g., second-hand-smoke exposure and asthma in children) and walks through trade-offs. Experimental approaches (RCTs) yield the cleanest causal inference but are ethically and practically constrained: you cannot randomise people to harmful exposures, often cannot blind, may face refusal, and the artificial setting can hurt external validity. Observational designs are feasible for most exposures of public-health interest (diet, occupation, neighbourhood, structural racism), preserve real-world variation, and allow long follow-up — but they trade away guaranteed exchangeability between groups and require explicit confounder control. A useful framing: if randomisation is feasible and ethical and trial conditions resemble real practice, randomise; otherwise design the strongest possible observational study (large prospective cohort with rich covariates, ideally complemented by quasi-experimental analyses such as instrumental variables, regression discontinuity, or difference-in-differences).
Reflection saved!
Knowledge Check — Section 1

1. Which of the following BEST distinguishes explanatory from descriptive studies?

Explanatory (analytic) studies are specifically designed to compare subgroups of subjects based on exposure or outcome status, whereas descriptive studies only characterise the distribution of disease.

2. What is the major advantage of experimental studies over observational studies for causal inference?

The major advantage of the experimental approach is the ability to control potential confounders, both measured and unmeasured, through the process of randomisation.

3. The ‘unified approach’ to observational study design includes the thought experiment, completing design features before seeing data, and:

The third component of the unified approach is forward projection, where the researcher projects forward to three scenarios (increased risk, decreased risk, no association) and defends the proposed design under each.
Complete the quiz and reflection to continue.
Section 2

Descriptive Studies: Case Reports, Case Series & Surveys

Section 7.3 of Dohoo, Martin & Stryhn

Introduction and Overview

The hierarchy at the end of Section 1 placed descriptive studies at the bottom for causal inference, and that ranking is fair — but it can give the wrong impression about their usefulness. Descriptive studies are where many real epidemiologic investigations start — from James Lind's 1753 treatise on scurvy to John Snow's 1854 mapping of the Broad Street cholera outbreak. They generate the hypotheses that the analytic designs in later sections then test.

Descriptive studies are used to describe the main features of a disease or health-related outcome. Although they are not designed to evaluate associations between exposures and outcomes, the observations made in a descriptive study can form the basis of hypotheses which are then further investigated in analytic studies. Three forms of descriptive studies are case reports, case-series reports, and surveys. The three flip cards below introduce each in turn; click through them and notice how each one expands the scope of observation — from a single unusual case, to a documented group of cases, to a full population sample.

Learning Objectives

  • Define case reports, case-series, and surveys and explain what each contributes to the evidence base.
  • Explain why descriptive designs cannot establish causal associations but are essential for hypothesis generation.
  • Identify situations in public-health practice where descriptive studies are the right tool (emerging outbreaks, rare events, signal detection).
  • Read the historical case-series literature with appropriate caveats about generalisability and selection.
📄
Case Reports
Click to explore
📋
Case-Series Reports
Click to explore
📊
Surveys
Click to explore

Key Characteristics of Study Types

Descriptive studies differ from analytic observational studies in important ways. The following comparison highlights these differences:

Important Limitation

A common feature of both case reports and case-series reports is the absence of a comparison group. Without a comparison group, it is impossible to draw valid conclusions about causal associations. This is why descriptive studies are considered hypothesis-generating rather than hypothesis-testing.

The third descriptive type — the survey — is also the bridge into the rest of the lesson. Once a survey starts collecting risk-factor information alongside disease information, it stops being purely descriptive and becomes the design Section 3 is about.

From Survey to Analytic Study

Kalsbeek and Heiss (2000), and Speybroeck et al (2003) have described the appropriate analysis of surveys bearing in mind the study design. If the survey is designed to collect information about both an outcome of interest and potential exposures (risk factors) beyond the categories of people, place, and time, it then becomes a cross-sectional analytic study and as such, can be used to evaluate associations between exposures and outcomes. The Ontario Hypertension Survey scenario below is a good example of this transition in action; we will return to its design choices in Section 3.

Scenario: The Ontario Hypertension Survey

Leenen et al (2008) conducted a survey of the prevalence of hypertension in Ontario. The sampling frame consisted of municipalities and dissemination areas. From 6,436 eligible dwellings, contact was made with 4,559 potential participants. Hypertension prevalence was found to be 21.3% of the population overall. This survey combined both prevalence estimation and risk factor analysis, making it a cross-sectional analytic study.

The reflection below puts the descriptive–analytic boundary to work. Once you have answered it and the knowledge check, Section 3 turns the question around: given that we now have a true analytic observational design (the cross-sectional study), how should we actually build one?

Reflection

Can you think of a disease or health condition for which a case-series report might be the most appropriate initial study design? What hypothesis might it generate for future analytic studies?

Model answerCase series are most useful when (a) the outcome is novel or extremely rare, so cohort or case-control logistics are infeasible, and (b) you want to generate hypotheses rather than test them. Classic example: the 1981 MMWR report of five cases of Pneumocystis carinii pneumonia in previously healthy young men in Los Angeles — the founding observation of the AIDS epidemic. Modern analogues: the early Wuhan case series of viral pneumonia of unknown cause (Dec 2019), clusters of e-cigarette-associated lung injury (EVALI, 2019), or VITT-like thromboses after early COVID vaccines. The hypothesis these reports generate is structural: this cluster looks unlike background; an analytic cohort or case-control study should now test for a shared exposure, risk factor, or pathogen.
Reflection saved!
Knowledge Check — Section 2

1. What is the primary limitation shared by both case reports and case-series reports?

Both case reports and case-series reports include only cases; they lack an explicit comparison group, making it impossible to draw valid conclusions about causal associations.

2. A survey becomes a cross-sectional analytic study when it:

When a survey is designed to collect information about both an outcome of interest and potential exposures (risk factors) beyond the basic categories of people, place, and time, it becomes a cross-sectional analytic study.

3. A case-series report documenting 50 patients with a rare autoimmune condition would be classified as:

Case-series reports are descriptive studies. They describe the characteristics of a group of cases but do not make formal comparisons with a control or unexposed group.
Complete the quiz and reflection to continue.
Section 3

Cross-Sectional Studies: Design & Implementation

Sections 7.4–7.5 of Dohoo, Martin & Stryhn

Introduction and Overview

Section 1 placed observational analytic studies in the larger classification map, and Section 2 walked through the descriptive designs that lead up to them. This section is the first time we sit inside an analytic observational design and look at how it actually works. We start with two classification distinctions that apply to every observational study you will encounter (prospective vs retrospective; the three sampling approaches), and then narrow in on the design that sits at the centre of this lesson: the cross-sectional study.

Learning Objectives

  • Distinguish prospective and retrospective observational designs and explain why cross-sectional studies are inherently retrospective.
  • Compare the three sampling approaches (cross-sectional, cohort, case-control) and identify the question each is best suited to answer.
  • Walk through the four design steps of a cross-sectional study (study group, exposure, outcome, confounding) and apply them to a real example.
  • Explain why prevalence — not incidence — is the natural outcome measure for a cross-sectional design.
  • Identify situations in which matching is and is not available as a confounding-control strategy.

Observational Studies Overview

Observational studies (a subgroup of analytic or explanatory studies) have an explicit formal contrast as part of their design: the prevalence of the outcome by exposure category groups is the central foundation. They differ from descriptive studies in that the comparison of two or more groups is central, and from experiments in that the researcher has no control over the allocation of study subjects to the exposure groups.

Prospective vs. Retrospective Designs

Observational studies can also be classified as prospective or retrospective. In prospective studies, the disease or outcome has not occurred at the time the study starts. In retrospective studies, both the exposure and the outcome have occurred when the study begins—hence cross-sectional studies are inherently retrospective in nature.

Sampling Drives the Design

Three Main Approaches

The choices of observational analytic study design have traditionally been among 3 approaches based on how study subjects are selected:

  • Cross-sectional study: A sample is obtained from the source population, and the prevalence of both disease and exposure is determined at the time of subject selection.
  • Cohort study: A sample of study subjects from a source population with heterogeneous exposure levels is obtained, and the incidence of the outcome in the follow-up period is determined. Landmark examples include the Framingham Heart Study (Dawber, Meadors, & Moore, 1951) and the Whitehall II study of British civil servants (Marmot et al., 1991).
  • Case-control study: Subjects with the outcome (cases) are identified and their exposure history is contrasted with the exposure history of a sample of non-case subjects (controls). Doll & Hill's (1950) case-control study of smoking and lung cancer, and Herbst, Ulfelder & Poskanzer's (1971) investigation of DES and vaginal adenocarcinoma, are the canonical examples.

Of the three sampling approaches above, the cross-sectional study is the one this lesson will fully dissect. Lessons 5 and 6 will pick up case-control and cohort designs in the same way. So the rest of this section is the architectural tour of one design, but the questions it asks — how to obtain study subjects, how to assess exposure, how to define the outcome, how to control confounding — are the same questions you will be asking of every observational design from now on.

Cross-Sectional Study Design

The defining feature of a cross-sectional study is that it is an observational study whose outcome frequency measure is prevalence. The basis of the cross-sectional design is that a sample, or census, of subjects is obtained from the source population and the presence or absence of the outcome is ascertained at that point. The accordion below walks through the four design steps in the order a researcher would take them: define the study group, measure exposure, ascertain outcome, and design in protections against confounding from the start.

Obtaining the Study Group

If the researcher wants to make inferences about the frequency of the outcome in a target population, then study subjects should be obtained by a formal random sampling procedure. The source population is the listing (real or implied) of potential study subjects from which the study group is obtained. The study group is that set of subjects who agree to take part in the study.

Assessing Exposure

Exposure and other covariate status, such as demographic data, are obtained at the time of study subject selection or first contact/examination. Because the outcome measure is prevalence, it is sometimes difficult to know the appropriate time frame in which the exposure, if time-varying, might cause the outcome. Studying currently (prevalent) exposed subjects can also lead to bias when interpreting the impact of these exposures.

Assessing the Outcome of Interest

It is important to clearly define the outcome/disease of interest. In general, great care should be used if the outcome is a surrogate for a clinically important event. It is also important that widely accepted diagnostic criteria be used to identify the disease or outcome of interest.

Ensuring Comparability

The two main approaches used to prevent bias from factors associated with the outcome and whose distribution differs between exposure groups (confounders) are exclusion (restricted sampling) and analytic (statistical) control. Matching to prevent confounding cannot be applied in cross-sectional studies. Analytic control requires the use of a multivariable model.

The scenario below is a real Canadian example that uses every step of the accordion above. As you read it, ask yourself what the researchers did at each step — how the source population was defined, how exposure was measured, how the outcome was ascertained — and where you can already see the design being constrained by the cross-sectional choice. The reflection that follows is built around exactly that question.

Scenario: Postpartum Depression in Canadian Women

Lanes et al (2011) conducted a cross-sectional study of postpartum depression (PPDS) among Canadian women. The survey used the Edinburgh Postnatal Depression Scale (EPDS) as the outcome measure. Potential risk factors included socioeconomic status, demographic factors, and maternal characteristics. Of 8,542 selected women, 6,421 responded. The national prevalence of minor/major and major PPDS was found to be 8.46% and 8.69% respectively. The mother’s stress level during pregnancy and prior depression had the strongest associations.

Reflection

In the postpartum depression study described above, the exposure ‘stress during pregnancy’ was measured retrospectively at the same time as the outcome. What challenges does this create for causal inference? How might you address these challenges?

Model answerRetrospective measurement of stress at the same time as the depression outcome creates two intertwined threats. (a) Temporality is unverifiable: standard causal inference requires the exposure to precede the outcome, but here both are reported simultaneously, so reverse causation (depression colouring how stress is remembered) cannot be ruled out. (b) Recall bias: depressed mothers may systematically over-report stressors, inflating the apparent association — a form of differential misclassification. Addressing them: (i) use prospectively measured exposures (clinical records of pregnancy stress, biomarkers, contemporaneous diaries) rather than retrospective interview; (ii) anchor stress to objective events (job loss, bereavement) verifiable from records; (iii) use a nested design with baseline measurement during pregnancy; (iv) include a comparison cohort without the outcome to assess differential recall; (v) where prospective design is impossible, run sensitivity analyses for differential misclassification.
Reflection saved!
Knowledge Check — Section 3

1. The defining feature of a cross-sectional study is that its outcome frequency measure is:

The defining feature of a cross-sectional study is that its outcome frequency measure is prevalence, based on the number of existing cases at the time of the study.

2. Cross-sectional studies are inherently:

In cross-sectional studies, both the exposure and the outcome have already occurred when the study begins. The exposure and outcome are assessed at the same point in time, making them inherently retrospective.

3. Which approach to controlling confounding CANNOT be applied in cross-sectional studies?

Matching to prevent confounding cannot be applied in cross-sectional studies because subjects are sampled from the population without regard to their exposure or outcome status, unlike case-control studies where matching is feasible.
Complete the quiz and reflection to continue.
Section 4

Limitations, Incidence Estimation & Reporting

Sections 7.6–7.9 of Dohoo, Martin & Stryhn

Introduction and Overview

Section 3 introduced the cross-sectional design and walked through how to build one. This section is its honest counterweight. Once a study is in the field, three practical questions tend to arrive in sequence: What can this design not tell us? Can we squeeze incidence information out of it? And how should we report the whole thing so the next reader can judge it? Those three questions are the three subsections that follow.

Learning Objectives

  • Explain why cross-sectional studies confound prevalence with disease duration and how that produces the reverse-causation problem.
  • Identify the conditions under which a cross-sectional design can support a defensible causal inference (time-invariant exposures, plausible temporality).
  • Describe the conditions under which incidence can be estimated from cross-sectional data, and the assumptions required.
  • Apply the STROBE checklist as a reporting standard for observational research and explain why each section matters to the next reader.

Inferential Limitations of Cross-Sectional Studies

By its nature, a cross-sectional study design measures prevalence, which is a function of both incidence and duration of the disease. Consequently, it is often difficult to disentangle factors associated with persistence of the outcome from factors associated with developing the outcome in the first instance (i.e., becoming a new incident case).

The Reverse Causation Problem

When the exposure factors are time-varying, it is often very difficult to differentiate cause and effect. For example, if one is studying the relationship between dog ownership and blood pressure, and the association is negative, one cannot differentiate between people that obtained a dog because they had low blood pressure from those whose lifestyle changed, consequently lowering their blood pressure after obtaining a dog. The more changeable the exposure, the worse this issue becomes.

Cross-sectional studies are best suited for time-invariant exposures such as race or sex. In these instances, the investigator can be certain that the exposure preceded, or at least was not caused by, the outcome.

Reverse causation and the prevalence/duration confound are the conceptual limits of the design. The next thing to learn is what the design does let you do, computationally — namely, the cross-tabulation that almost every analytic move in this course is built on.

R Build a 2×2 contingency table from raw cross-sectional data

What you'll do: simulate 200 people with a known exposure-disease relationship, build the 2×2 contingency table, and read prevalence off each row. What to take away: the 2×2 table is the unit of analysis the rest of the course is built on; every measure of association you will meet in Lessons 5 and 6 starts from a structure that looks exactly like this.

Most observational studies start the same way: cross-tabulate exposure against outcome. Below we simulate a small cross-sectional dataset, build the 2×2 table, and read off the prevalence of disease in exposed vs unexposed.

# Simulate 200 people: half exposed, with higher disease prevalence among exposed.
set.seed(230)
n <- 200
exposure <- rep(c("exposed", "unexposed"), each = n/2)
disease  <- c(rbinom(100, 1, 0.30), rbinom(100, 1, 0.10))

dat <- data.frame(exposure = exposure, disease = disease)

# 2x2 cross-tabulation -- the workhorse of descriptive epi.
tab <- table(dat$exposure, dat$disease,
             dnn = c("Exposure", "Disease"))
tab

# Prevalence (proportion with disease=1) within each exposure group.
prop.table(tab, margin = 1)   # margin=1 -> divide each row by its row total
Console output
Disease Exposure 0 1 exposed 71 29 unexposed 92 8 Disease Exposure 0 1 exposed 0.71 0.29 unexposed 0.92 0.08

Reading the table. 29% of the exposed group have disease vs 8% of the unexposed. That contrast — and the rows that produced it — is the launchpad for every measure of association you will learn in this course.

R Reflect on what you just ran

Use the questions below to interpret the output you produced. Look at your console table before answering.

1. Look at the raw tab output. How many exposed individuals had disease? How many unexposed had disease? Write the four cell counts (a, b, c, d) in the standard 2×2 layout (exposed-diseased, exposed-non-diseased, unexposed-diseased, unexposed-non-diseased).

Model answerReading the printed 2×2 from tab: exposed with disease ≈ 29, exposed without disease ≈ 71; unexposed with disease ≈ 8, unexposed without disease ≈ 92. In standard layout: a = 29 (exposed/diseased), b = 71 (exposed/non-diseased), c = 8 (unexposed/diseased), d = 92 (unexposed/non-diseased). Totals: 100 exposed and 100 unexposed, 37 diseased and 163 not, n = 200. From these you can read off prevalence (a/(a+b)) for the exposed group, the prevalence ratio (PR), and the odds ratio (OR) — the three numbers that drive interpretation of any cross-sectional dataset.

2. Using prop.table(tab, margin = 1), the prevalence of disease was 0.29 in the exposed group and 0.08 in the unexposed group. Why is this a measure of prevalence and not incidence, given the design that generated the data?

Model answerBoth numerator ("how many have the disease right now") and denominator ("how many people are in the group right now") are evaluated at the same snapshot — the simulation generates a single binary outcome per person with no follow-up time. That is the defining mark of prevalence: existing cases divided by population at a point in time. Incidence requires a denominator of person-time (or at minimum a baseline disease-free cohort followed forward), so that ‘new’ cases can be distinguished from ‘old’ ones. Without a temporal element, no rate can be computed; what you have is a snapshot.

3. The simulation used rbinom(100, 1, 0.30) for the exposed and 0.10 for the unexposed. What would happen to the observed exposed-group prevalence if you changed the seed or doubled n to 400 per group? Would the contrast between groups become more or less reliable?

Model answerChanging the seed would shuffle which 100 random draws fall above the 0.30 threshold, so the observed exposed-group prevalence would jitter around 0.30 — sometimes 0.24, sometimes 0.34 — while preserving the long-run expectation. Doubling n to 400 per group would cut the standard error of the observed proportion roughly in half (SE = √(p(1-p)/n)), so the prevalence estimates would settle much closer to their generating values (0.30 and 0.10), the CI on each would narrow, and the test of the exposed-vs-unexposed contrast would become more powerful. Larger n doesn't fix bias — only sampling variability.
Saved.

Estimating Incidence from Cross-Sectional Studies

Although cross-sectional studies directly measure prevalence, there are approaches for estimating incidence from prevalence data. This is often desirable because incidence data are more useful for causal inference. The three tabs below introduce the most common approaches in increasing order of analytical sophistication. Each one trades a different practical cost (run two surveys, develop two assays, build a mathematical model) for a different epistemic gain.

A simple way to obtain population-level incidence data is to perform two cross-sectional studies, one before and one after an event of interest. For example, Miller et al (2010) performed two cross-sectional studies before and after the 2009 H1N1 epidemic in England, giving a population-based estimate of incidence.

Other approaches include using two different tests—one that detects early immune response and one that detects long-lasting immunity. People who test negatively to the less sensitive test are followed forward for a defined time period to ascertain how many become positive. This approach has been refined for HIV studies.

Rajan and Sokal (2011) describe how to estimate age-specific incidence from prevalence data. Their general approach uses two prevalence estimates at different time points. The incidence rate at year ‘a’ is:

Incidence from Prevalence (Eq 7.1)
Ia = 1 − [1 − (Pa+n − Pa) / (1 − Pa)]1/n

where ‘n’ is the time between the two prevalence estimates (Pa and Pa+n) in the cross-sectional survey.

The first incidence-estimation approach — running two cross-sections at different time points — raises a question of its own. If we are willing to track the same population twice, why not just track the same individuals through time? That is the difference between a repeated cross-section and a cohort.

Repeated Cross-Sectional vs. Cohort Studies

Sometimes it is desirable to follow a population over time. Two options exist: repeated cross-sectional samplings of the population, or a longitudinal study of the initial study subjects (a cohort approach). Each has distinct advantages, and the choice between them is a real one. The two cards below put their strengths and limitations side by side — cohorts return in detail in Lesson 5.

🔄
Repeated Cross-Sectional
Click to compare
Cohort Studies
Click to compare

The previous subsections covered what cross-sectional studies cannot do, what they can do, and how they relate to longitudinal designs. The last piece of the puzzle is reporting — how to write up an observational study so the next reader can judge it without having to reverse-engineer the methods.

Reporting Observational Studies: The STROBE Statement

In 2004, a network of methodologists, researchers, and journal editors established what we now know as the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement (von Elm et al., 2007; explained in detail by Vandenbroucke et al., 2007) — one of the EQUATOR-network reporting guidelines we previewed in Lesson 1. It provides a checklist of 22 items considered essential for good reporting of observational studies.

STROBE Checklist Key Sections

The STROBE checklist covers: Title & Abstract (indicate study design), Introduction (background, objectives, hypotheses), Methods (study design, setting, participants, variables, data sources, bias, sample size, statistical methods), Results (participants, descriptive data, outcome data, main results), and Discussion (key results, limitations, interpretation, generalisability).

The reflection below is the practical exit ticket for this section — a textbook reverse-causation case dressed up in plausible language. After working through it and the knowledge check, the lesson moves to its comprehensive final assessment, which integrates the classification map (Section 1), the descriptive designs (Section 2), the cross-sectional design itself (Section 3), and the limitations and reporting standards covered here.

Reflection

Consider a cross-sectional study that finds an association between pet ownership and lower blood pressure. Explain why this finding cannot be interpreted as causal evidence that pet ownership lowers blood pressure. What study design would be more appropriate?

Model answerCross-sectional findings cannot support causal claims because the snapshot makes the direction of effect ambiguous (does owning a pet lower BP, or do people with lower BP have the energy and capacity to keep a pet?) and provides no exposure-to-outcome temporal ordering. Confounding is also likely: pet owners differ from non-owners in income, housing, marital status, age, physical activity, and social network — all of which independently affect BP. Selection on health may also be at work (sicker, frailer people give up pets). Better designs: a prospective cohort that enrolls non-owners and follows them through pet acquisition, measuring BP before and after; a quasi-experimental study exploiting a natural shock (e.g., a pet-adoption programme rollout) with difference-in-differences; or, where feasible, an RCT of dog-walking interventions in matched non-owners.
Reflection saved!
Knowledge Check — Section 4

1. The primary reason cross-sectional studies have limited ability to support causal inference is:

Cross-sectional studies measure prevalence, which is a function of both incidence and duration. This makes it difficult to distinguish factors that cause disease from factors that affect disease duration or survival.

2. Cross-sectional studies are best suited for studying exposures that are:

Cross-sectional studies are best suited for time-invariant exposures such as race or sex, where the investigator can be certain that the exposure preceded the outcome and was not affected by it.

3. The STROBE statement provides:

The STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) statement provides a checklist of 22 items considered essential for good reporting of observational studies.
R Activity — Building a 2×2 table and computing prevalence

The companion R script r-activities/HSCI_230_Lesson_3_Introduction_to_Observational_Studies.R simulates a cross-sectional dataset of 200 people (half exposed, half unexposed) with a true disease prevalence of 30% in the exposed and 10% in the unexposed, then constructs the 2×2 cross-tabulation that anchors nearly every observational analysis. You will compute row-wise prevalence with prop.table(..., margin = 1) and, as a stretch, derive a prevalence ratio — a first taste of the measures-of-association we revisit in later lessons.

# Simulate 200 people: half exposed, with higher disease prevalence among exposed
set.seed(230)
n        <- 200
exposure <- rep(c("exposed", "unexposed"), each = n/2)
disease  <- c(rbinom(100, 1, 0.30), rbinom(100, 1, 0.10))

dat <- data.frame(exposure = exposure, disease = disease)

# 2x2 cross-tabulation -- the workhorse of descriptive epi
tab <- table(dat$exposure, dat$disease,
             dnn = c("Exposure", "Disease"))
tab

# Prevalence (proportion with disease=1) within each exposure group
prop.table(tab, margin = 1)   # margin=1 -> divide each row by its row total

## -----------------------------------------------------------------------------
## Stretch: prevalence ratio (a quick preview of measures-of-association)
## -----------------------------------------------------------------------------
prev_exposed   <- prop.table(tab, margin = 1)["exposed",   "1"]
prev_unexposed <- prop.table(tab, margin = 1)["unexposed", "1"]
prev_ratio     <- prev_exposed / prev_unexposed
cat("Prevalence ratio:", round(prev_ratio, 2), "\n")
Complete the quiz and reflection to continue.
Final Assessment

Lesson 3 — Final Review & Assessment

Bringing It All Together

This lesson started with a classification map (Section 1), walked down it to the descriptive designs (Section 2), then up into the first true analytic observational design (Section 3), and finished by interrogating that design’s honest limits and reporting standards (Section 4). The arc was deliberate. The vocabulary you now have — descriptive vs. explanatory, experimental vs. observational, prospective vs. retrospective, the three sampling approaches, prevalence vs. incidence, the unified approach to design, the STROBE reporting framework — is the working vocabulary of every observational study you will read for the rest of HSCI 230.

The deeper move was the one Hernan and Rubin pushed: design trumps analysis. An observational study cannot randomise its way out of confounding, but it can be built so that the analytic decisions are committed in advance, the source population is clearly defined, and the confounding strategy is not an afterthought. That is what the unified approach (the thought experiment, design before seeing data, forward projection) makes operational. Applied seriously, it converts an observational study from something the manufactured-doubt strategy can attack into something a critical reader can actually evaluate.

The two designs that come next — the case-control study (Lesson 4) and the cohort study (Lesson 5) — do not introduce a new framework. They are two more sampling approaches inside the one you already have, each making different trade-offs around how subjects are selected, how outcomes and exposures are ascertained, and what measure of association the design naturally produces. Read those lessons through the lens of this one and the through-line will be clear.

Key Takeaways from Lesson 3

  • Descriptive vs. explanatory is a distinction of purpose: descriptive studies characterise disease occurrence and generate hypotheses; explanatory studies compare groups to test them.
  • Experimental vs. observational is a distinction of investigator control: experiments randomise to manage confounding; observational studies must build in confounding control through design.
  • The unified approach — thought experiment, design before data, forward projection — is how an observational study borrows the discipline of an experiment without being one.
  • Cross-sectional studies measure prevalence, not incidence, which makes them useful for time-invariant exposures and unreliable for time-varying ones because of the reverse-causation problem.
  • Matching is not available in cross-sectional studies; restriction and statistical adjustment are.
  • STROBE-compliant reporting is part of an observational study’s integrity, not an administrative add-on — it is what allows the next reader to judge whether the design carries the inference the authors claim.

Final Reflection

Reflect on the full range of study designs discussed in this lesson. If you were asked to investigate the relationship between a novel environmental exposure and a chronic health outcome, what type of study would you begin with and why? How might your study design evolve as evidence accumulates?

Model answerFor a novel environmental exposure and a chronic outcome you rarely have the population, registry, or budget for a large prospective cohort on day one. The right opening move is descriptive: identify the affected community via clinical surveillance or community-reported clusters, then publish a case series establishing that the cluster exists and is unlikely to be background. As suspicion grows, run a case-control study — cheaper, faster, and well-suited to rare chronic outcomes — comparing exposure histories of cases and matched controls; this gives an OR and rough effect size. If signals are strong, transition to a cohort study (ideally prospective, with biomarker measurement at baseline) for incidence, dose-response, and latency. Where the exposure is policy-determined or geographically clustered, complement with ecological / quasi-experimental designs (interrupted time-series, regression discontinuity) that exploit natural variation. The design evolution mirrors the strength of the evidence: hypothesis-generating → hypothesis-testing → magnitude-estimating → causal-mechanistic.
Reflection saved!

Lesson 3 Comprehensive Assessment

This assessment covers all sections of Lesson 3. You must answer all 15 questions correctly to complete the lesson. Review the feedback after each attempt.

Final Assessment — Lesson 3 (15 Questions)
Final Assessment — Lesson 3: Introduction to Observational Studies (15 Questions)

1. Which study type is designed to make comparisons between subgroups based on exposure or outcome status?

Explanatory (analytic) studies are specifically designed to make comparisons and contrasts between subgroups of study subjects based on exposure or outcome status.

2. The key difference between experimental and observational studies is:

In experimental studies, the investigator controls (usually through randomisation) the allocation of study subjects to study groups. In observational studies, the investigator does not control the allocation.

3. The ‘thought experiment’ in the unified approach to study design involves:

The thought experiment involves imagining a field experiment that would accomplish the same objective as the proposed observational study, specifying study group selection, assignment to exposure, follow-up, and outcome detection.

4. Case reports are considered useful primarily because they:

Case reports describe unusual observations that can help researchers generate useful hypotheses to be investigated in future studies. They do not include comparison groups and cannot support causal claims.

5. A case-series report should document which of the following?

A case-series report documents the who (affected subjects), when (temporal aspects), and where (geographic aspects) of the condition in a group of subjects.

6. In the study classification hierarchy, which observational study design provides the strongest evidence for causal inference?

Among observational study designs, cohort studies are generally considered to provide the strongest evidence for causal inference because they can establish temporal sequence and measure incidence directly.

7. In a cross-sectional study, subjects are selected from the source population based on:

In a cross-sectional study, a sample or census of subjects is obtained from the source population, and the presence or absence of both exposure and outcome are determined at the same time.

8. The natural measure of association in a cross-sectional study of a binary outcome is:

The main comparison in a cross-sectional study is between the prevalence in the exposed and non-exposed subjects, and the natural measure of association is the prevalence risk ratio.

9. A major limitation of cross-sectional studies when studying time-varying exposures is:

When exposure factors are time-varying, it is very difficult to differentiate cause and effect because exposure and outcome are measured simultaneously. This reverse causation problem is a major limitation.

10. Prevalence is a function of:

Prevalence is a function of both the incidence (rate of new cases) and the duration of disease, which is why cross-sectional studies have limited ability to support causal inference about disease occurrence.

11. Repeated cross-sectional studies are preferred over cohort studies when:

Repeated cross-sectional studies are preferred when the research objective relates to events and associations within the full population at different periods of time, rather than tracking specific individuals.

12. Which of the following is NOT a component of the STROBE checklist?

STROBE is specifically for reporting observational studies, which do not involve randomisation. Specifying randomisation procedures would be part of the CONSORT statement for reporting randomised trials.

13. Propensity scores in observational study design are used to:

Rubin formalises the assessment of comparability through propensity scores, which estimate the probability of exposure given the covariates. If propensity scores differ substantially between groups, confounding is likely present.

14. A purposive non-random sample in a cross-sectional study primarily threatens:

Using a purposive non-random sample limits the external validity (ability to extrapolate results beyond the source population). While internal validity may be maintained if the study group is representative of the source population, generalisability is compromised.

15. A study that samples subjects from the general population, measures their current blood pressure and dietary habits simultaneously, and compares blood pressure between high-salt and low-salt diet groups is:

This study measures both exposure (dietary habits) and outcome (blood pressure) simultaneously in a sample from the population, making it a cross-sectional analytic study. It measures prevalence and makes comparisons between exposure groups.

Congratulations!

You have completed Lesson 3: Introduction to Observational Studies. You now understand the classification of epidemiological study designs, the unified approach to observational study design, the characteristics of descriptive studies, the design and implementation of cross-sectional studies, their inferential limitations, and the STROBE reporting guidelines.

Lesson 4 picks up the second of the three sampling approaches you met in Section 3: the case-control study. Where cross-sectional studies sample without regard to disease, case-control studies start from the disease and look backward at exposure — an inversion that solves some of the limits we surfaced in Section 4 and introduces new ones (notably, selection bias and the choice of controls) that the next lesson is largely about.