Introduction to Observational Studies

Evaluating Epidemiological Research

Learning objectives for this lesson:

Differentiate between descriptive and explanatory studies
Differentiate between experimental and observational studies
Describe the three main elements of the unified approach to observational study design
Describe the advantages and limitations of case reports, case-series reports, and surveys
Design a cross-sectional study accounting for its strengths and weaknesses
Identify circumstances where a cross-sectional study is appropriate
List three approaches for obtaining incidence estimates from cross-sectional prevalence data
Differentiate between repeated cross-sectional studies and following a cohort in a longitudinal study
Apply the STROBE checklist to reporting a cross-sectional study

This course was developed by Dr. Kiffer G. Card, Faculty of Health Sciences, Simon Fraser University.

Reference

Glossary: Key Terms, People & Concepts

📚 Reference page, available throughout the lesson

This glossary collects the key concepts, people, and ideas you will meet in this lesson. Use it as a reference while you work through the material, or as a review before assessments. Type in the search box to filter entries.

Key Concepts & Ideas

Observational Study A study in which the researcher does not assign exposures, participants “assign themselves” through their behaviors, environments, or characteristics. The dominant design family in epidemiology.

Experimental Study A study in which the researcher assigns exposure (ideally at random). Includes randomized controlled trials. Generally provides stronger causal inference but is often infeasible or unethical for many public-health questions.

Descriptive Study A study aimed at describing the distribution of disease or other health outcomes by person, place, and time, without testing a specific causal hypothesis.

Explanatory (Analytic) Study A study designed to test associations between specific exposures and outcomes, typically with a hypothesis about causation in mind.

Exposure A factor whose effect on a health outcome is being investigated, e.g., a behavior, environmental contaminant, drug, or social condition.

Outcome The health state or event whose occurrence the study is trying to explain, disease, death, recovery, lab measurement, behavior change.

Prevalence The proportion of a population that has a condition at a particular point (point prevalence) or over a window (period prevalence). The natural measure delivered by cross-sectional studies.

Incidence The rate at which new cases of a condition arise over time in a population. Captured cleanly by cohort designs; only indirectly by cross-sectional surveys.

Cumulative Incidence (Risk) The proportion of an initially disease-free population that develops the outcome over a defined follow-up period.

Incidence Rate New cases divided by total person-time at risk. Has units of 1/time and accommodates loss to follow-up and varying observation windows.

Study Base The population and time period from which cases (and, in case-control studies, controls) arise. Defining the study base clearly is essential to all observational designs.

Sampling Frame The list or mechanism from which study participants are actually drawn. The closer the sampling frame matches the target population, the less selection bias you should expect.

Selection Bias Systematic error introduced when the people who end up in the study differ in relevant ways from the population the study claims to represent. A primary threat to all observational designs.

Information (Measurement) Bias Error introduced when exposure or outcome is measured inaccurately, e.g., faulty instruments, recall errors, misclassification.

Confounding A distortion of the exposure–outcome association due to a third variable that is associated with both. The central methodological problem of observational epidemiology; for a foundational catalogue of biases see Sackett (1979).

STROBE Checklist Strengthening the Reporting of Observational Studies in Epidemiology, a 22-item checklist that observational studies should follow when reporting cohort, case-control, and cross-sectional designs (von Elm et al., 2007).

Three Elements of Observational Design Dohoo, Martin & Stryhn's unifying framework: (1) selection of the study sample, (2) measurement of exposure and outcome, and (3) the temporal relationship between them. Every observational design is a different recipe combining these three elements.

Methods & Study Designs

Case Report A detailed clinical or epidemiological description of a single patient or event. Useful for hypothesis generation and signaling new conditions; cannot estimate frequency or test causal hypotheses.

Case Series A description of a small group of cases sharing a feature. Like case reports but offering modest pattern-recognition power; still cannot estimate population frequencies without a denominator.

Survey A study that collects information from a sample of a population through standardized questions or measurements. The data-gathering engine behind cross-sectional and many descriptive studies.

Cross-Sectional Study A study that measures exposure and outcome at the same point in time in a defined population. Estimates prevalence; useful for hypothesis generation but generally weak for cause–effect inference because exposure and outcome are simultaneous.

Repeated Cross-Sectional Studies A series of separate cross-sectional samples taken from the same target population at different times. Tracks population-level change but cannot follow individuals.

Longitudinal Cohort Follows the same individuals over time, allowing within-person change and incidence to be estimated. Contrast with repeated cross-sectional designs.

Ecological Study An observational study in which the unit of analysis is a group (region, country, school) rather than an individual. Treated more fully in a later lesson.

No matching entries. Try a different search term.

Section 1

Study Classification & Design Framework

Sections 7.1–7.2 of Dohoo, Martin & Stryhn

Section 1 of 4

Study Classification & Design Framework

Two cuts through study design, and a hierarchy of causal evidence.

Why start here

From framing to design

Earlier lessons set up the intellectual scaffolding. This section is where study design becomes concrete.

Every bias, every design trade-off, and every inferential claim you will evaluate later plays out in the structure of a specific study. The classification map built here is the shared vocabulary for everything that follows.

First cut

Descriptive vs. explanatory

Descriptive

Characterises who, what, when, and where of disease occurrence. No comparison group. Cannot test causal hypotheses.

Explanatory (analytic)

Designed around a formal comparison between subgroups based on exposure or outcome status. Can test hypotheses.

The distinction is one of purpose, not technique or sample size.

Second cut

Experimental vs. observational

Experimental

Investigator controls exposure allocation, usually by randomisation. Exchangeability is guaranteed. Confounders are balanced.

Observational

Investigator observes natural variation. No randomisation. Confounding must be managed through design and statistical adjustment.

Unified approach

Three steps that borrow from the experiment

1 · Thought experiment

Specify the study as if designing a randomised trial: study group, exposure, follow-up, outcome.

2 · Design before data

Lock in all design elements before seeing the outcome. Exclusion criteria, confounding strategy, propensity scores.

3 · Forward projection

Project forward under three result scenarios. If you cannot defend the design under each, revise it.

Hierarchy of evidence

Causal weight, not absolute quality

Carry forward

What to take into the next section

Descriptive vs. explanatory is a distinction of purpose, not method.
Experimental vs. observational turns on who controls exposure assignment.
The unified approach lets observational studies borrow experimental discipline.
The hierarchy ranks causal weight, not overall quality or usefulness.

Introduction and Overview

Earlier lessons worked at the level of the discipline: where epidemiology came from, what counts as knowledge, and how the published record can fail and be appraised. This lesson takes the first step from those framing concerns into the working tools of the field. Almost every bias the manufactured-doubt strategy weaponizes, every analytic error the integrity-and-reform material in an earlier lesson catalogued, and every pre-analytic discipline EGAP demands, all of them play out concretely once you have a study design in front of you. So this lesson is about the design.

The four content sections proceed from general to specific. This section sets up a classification scheme for every study you will encounter for the rest of this course. A later section covers the simplest of those study types, descriptive studies, where a comparison group is absent. A later section introduces the first true analytic observational design: the cross-sectional study. A later section pushes on its limitations and shows what reporting it well requires. By the end of the lesson, the language you'll need for case-control studies (a later lesson) and cohort studies (a later lesson) will already be in place.

Learning Objectives

Distinguish descriptive from explanatory (analytic) studies by their purpose and inferential claims.
Distinguish experimental from observational studies in terms of investigator control and confounding.
Apply Hernan’s and Rubin’s “unified approach” (thought experiment, design before data, forward projection) to a research question.
Locate any study type within the hierarchy of evidence and explain what causal weight that placement does and does not imply.

Descriptive vs. Explanatory Studies

Epidemiologic studies can be classified into two major categories: descriptive and explanatory (analytic). This classification reflects both the study’s objectives and its ability to support causal inference. The two tabs below define each category in turn; click between them and notice that the difference is not about technique but about purpose, whether the study is built to characterize disease occurrence or to compare groups.

Descriptive studies include case reports, case-series reports, and surveys. They are designed solely to describe the nature and distribution of outcome events such as health-related phenomena. They describe the who, what, when, and where of disease occurrence.

Although a descriptive survey is not designed to assess hypotheses about manipulatable causes of the outcome event, the frequency of the outcome is usually described in categories of age, race, sex, season, and space.

Explanatory studies (also called analytic studies) are designed to make comparisons and contrasts between subgroups of study subjects based on exposure or outcome status. They allow the investigator to identify statistical associations between exposures and outcomes.

Explanatory studies can be further subdivided into experimental and observational studies, depending on whether the investigator controls the allocation of study subjects to exposure groups.

The descriptive–explanatory split asks what a study is trying to do. The next split (experimental versus observational) asks how the explanatory studies do it, and is the distinction this entire lesson hinges on.

Experimental vs. Observational Studies

In experimental studies, the investigator controls (usually through randomisation) the allocation of study subjects to exposure groups. In contrast, in observational studies, the investigators try not to influence the natural course of events for the study subjects.

Key Distinction

In experimental studies, we try to reduce variation from all sources through selection and control of the experimental setting. In observational studies, we embrace the presence of natural variation in order to identify important interactions among key variables and the exposure–disease association.

The price paid through the use of observational studies is that considerable efforts are required to prevent confounding (bias) of the exposure–disease association. Experiments are the preferred choice when the treatment is straightforward and easily manipulated, such as a vaccine or a specific therapeutic agent. The major advantage of the experimental approach is the ability to control potential confounders through the process of randomisation.

If observational studies have to fight confounding the way experiments avoid it, the natural question is how to design them so that the fight is winnable. Two well-known methodologists (Miguel Hernan and Donald Rubin) offer the same answer in slightly different language: borrow the discipline of the experiment, even when you cannot run one. That answer has three parts, summarized in the accordion below.

A Unified Approach to Study Design

Hernan (2005) stressed that when considering an observational study design, we should think about the design of a field experiment to accomplish the same objective. This approach, reinforced by Rubin (2007), emphasises that ‘design trumps analysis’ and that all elements of the study design should be completed before seeing any outcome data. Expand each of the three steps below in order, the order matters, because each step constrains the next.

1. The ‘Thought Experiment’

As a first step in considering an epidemiological study, a ‘thought experiment’ can be accomplished and should specify the key elements of study group, its selection, assignment to exposure, procedures for follow-up, and detecting the outcome. The important part is that formal randomisation would ensure ‘exchangeability’, the groups being compared are so similar that it does not matter which group was assigned to exposure.

2. Design Features Before Seeing Data

All design features are completed before anyone has seen the outcome data. This includes subject exclusion, selection criteria, and control of confounding. Rubin formalises the process through propensity scores (the probability of exposure given the covariates) in the exposed and non-exposed groups. In plain terms, a propensity score rolls up everything you measured about a person into one number: how likely someone with those characteristics was to end up exposed. Unless the exposed and unexposed groups have virtually the same spread of propensity scores, some degree of confounding remains possible.

3. Forward Projection (Critical Appraisal)

After completing the initial design, we project forward to the presentation of study results under 3 different scenarios: (1) the exposure appears to increase risk; (2) the exposure appears to decrease risk; or (3) the exposure does not appear to be associated. For each scenario, we must defend the proposed design. This process helps identify potential weaknesses.

The unified approach gives you the discipline that any single study should have. Stepping back further, we can also rank study types by how much causal weight any one well-designed instance of them can carry. That ranking (the so-called hierarchy of evidence) is the last piece of the classification map for this section.

Hierarchy of Evidence for Causal Inference

From the perspective of drawing causal inferences, experimental studies are generally referred to as the gold standard (hierarchy of evidence). The hierarchy of causal evidence (from strongest to weakest) is typically as shown in the table below. The an earlier lesson caveat is worth holding onto here: the hierarchy describes causal weight, not absolute quality. A well-conducted cross-sectional study can be far more useful than a poorly-conducted RCT for many real-world questions.

Study Type	Difficulty	Investigator Control	Causal Evidence	Relevance
Laboratory trial	Moderate	Very high	Very high	Low
Controlled field trial (RCT)	Moderate	High	Very high	High
Cohort study	Difficult	High	High	High
Case-control study	Moderate	Moderate	Moderate	High
Cross-sectional study	Moderate	Low	Low	Moderate
Survey	Moderate	Moderate	Not applicable	High
Case series	Easy	Very low	Not applicable	Low to high
Case report	Very easy	Very low	Not applicable	Low to high

The reflection below asks you to use the classification scheme we just built to evaluate a research question of your own. The knowledge check that follows tests the conceptual content directly. Once you have worked through both, a later section takes the bottom of the hierarchy (the descriptive designs) and asks what useful work they can do despite their lack of a comparison group.

Reflection

Think about a health outcome you are interested in studying. Would an experimental or observational approach be more appropriate, and why? Consider ethical, practical, and scientific factors in your answer.

Model answerA strong answer names a concrete outcome (e.g., second-hand-smoke exposure and asthma in children) and walks through trade-offs. Experimental approaches (RCTs) yield the cleanest causal inference but are ethically and practically constrained: you cannot randomise people to harmful exposures, often cannot blind, may face refusal, and the artificial setting can hurt external validity. Observational designs are feasible for most exposures of public-health interest (diet, occupation, neighbourhood, structural racism), preserve real-world variation, and allow long follow-up; but they trade away guaranteed exchangeability between groups and require explicit confounder control. A useful framing: if randomisation is feasible and ethical and trial conditions resemble real practice, randomise; otherwise design the strongest possible observational study (large prospective cohort with rich covariates, ideally complemented by quasi-experimental analyses such as instrumental variables, regression discontinuity, or difference-in-differences).

Reflection saved!

Knowledge Check; this section

1. Which of the following BEST distinguishes explanatory from descriptive studies?

Explanatory studies use larger sample sizes Explanatory studies make comparisons between subgroups based on exposure or outcome status Explanatory studies are always experimental

Explanatory (analytic) studies are specifically designed to compare subgroups of subjects based on exposure or outcome status, whereas descriptive studies only characterise the distribution of disease.

2. What is the major advantage of experimental studies over observational studies for causal inference?

They are less expensive to conduct Randomisation controls for both measured and unmeasured confounders They always have larger sample sizes

The major advantage of the experimental approach is the ability to control potential confounders, both measured and unmeasured, through the process of randomisation.

3. The ‘unified approach’ to observational study design includes the thought experiment, completing design features before seeing data, and:

Conducting a meta-analysis Obtaining ethics approval Forward projection under three different result scenarios

The third component of the unified approach is forward projection, where the researcher projects forward to three scenarios (increased risk, decreased risk, no association) and defends the proposed design under each.

● Complete the quiz and reflection to continue.

Section 2

Descriptive Studies: Case Reports, Case Series & Surveys

Section 7.3 of Dohoo, Martin & Stryhn

Section 2 of 4

Descriptive Studies: Case Reports, Case Series & Surveys

Hypothesis-generating designs, and the bridge to analytic studies.

Three types

Case reports, case series, and surveys

Case report

A single unusual case. Flags a finding, cannot test it. Very easy to conduct; very low causal weight.

Case-series report

A documented group of cases sharing a feature, without a comparison group. Essential for rare or novel conditions.

Survey

A sample of a defined population; describes distribution of outcomes and characteristics across people, place, and time.

Key limitation

No comparison group

Descriptive studies are hypothesis-generating, not hypothesis-testing.

Case reports and case-series reports observe only people with the condition of interest. Without a group of people without it, no comparison of exposure prevalence is possible, and therefore no causal claim can be made.

The 1981 MMWR case series of five cases of Pneumocystis carinii pneumonia in previously healthy young men in Los Angeles is the canonical example of a case series done well: it observed, documented, and generated the hypothesis that launched the investigation of what became known as AIDS.

2000) and Speybroeck et al (2003) formalised this transition. The Ontario Hypertension Survey conducted by Leenen and colleagues in 2008 is a real example: a survey of 4,559 participants that combined prevalence estimation with risk-factor analysis, making it a cross-sectional analytic study with a population prevalence of 21.3 percent."> The bridge

From survey to cross-sectional analytic study

A survey that asks only about people, place, and time remains descriptive.

Once it collects data on potential exposures alongside the outcome, it becomes a cross-sectional analytic study and can evaluate associations.

Ontario Hypertension Survey

Leenen et al. (2008): 4,559 participants, hypertension prevalence 21.3%. Collected both prevalence and risk factor data, making it a cross-sectional analytic study.

Carry forward

What to take into the next section

Descriptive studies are indispensable for hypothesis generation, especially for rare, novel, or emerging conditions.
The absence of a comparison group is a feature of purpose, not a flaw of execution.
Adding exposure measurement to a survey converts it into a cross-sectional analytic study.

Introduction and Overview

The hierarchy at the end of an earlier section placed descriptive studies at the bottom for causal inference, and that ranking is fair; but it can give the wrong impression about their usefulness. Descriptive studies are where many real epidemiologic investigations start, from James Lind's 1753 treatise on scurvy to John Snow's 1854 mapping of the Broad Street cholera outbreak. They generate the hypotheses that the analytic designs in later sections then test.

Descriptive studies are used to describe the main features of a disease or health-related outcome. Although they are not designed to evaluate associations between exposures and outcomes, the observations made in a descriptive study can form the basis of hypotheses which are then further investigated in analytic studies. Three forms of descriptive studies are case reports, case-series reports, and surveys. The three flip cards below introduce each in turn; click through them and notice how each one expands the scope of observation, from a single unusual case, to a documented group of cases, to a full population sample.

Learning Objectives

Define case reports, case-series, and surveys and explain what each contributes to the evidence base.
Explain why descriptive designs cannot establish causal associations but are essential for hypothesis generation.
Identify situations in public-health practice where descriptive studies are the right tool (emerging outbreaks, rare events, signal detection).
Read the historical case-series literature with appropriate caveats about generalisability and selection.

Case ReportsClick to explore

Case-Series ReportsClick to explore

SurveysClick to explore

Key Characteristics of Study Types

Descriptive studies differ from analytic observational studies in important ways. The following comparison highlights these differences:

Important Limitation

A common feature of both case reports and case-series reports is the absence of a comparison group. Without a comparison group, it is impossible to draw valid conclusions about causal associations. This is why descriptive studies are considered hypothesis-generating rather than hypothesis-testing.

The third descriptive type (the survey) is also the bridge into the rest of the lesson. Once a survey starts collecting risk-factor information alongside disease information, it stops being purely descriptive and becomes the design a later section is about.

From Survey to Analytic Study

Kalsbeek and Heiss (2000), and Speybroeck et al (2003) have described the appropriate analysis of surveys bearing in mind the study design. If the survey is designed to collect information about both an outcome of interest and potential exposures (risk factors) beyond the categories of people, place, and time, it then becomes a cross-sectional analytic study and as such, can be used to evaluate associations between exposures and outcomes. The Ontario Hypertension Survey scenario below is a good example of this transition in action; we will return to its design choices in a later section.

Scenario: The Ontario Hypertension Survey

Leenen et al (2008) conducted a survey of the prevalence of hypertension in Ontario. The sampling frame consisted of municipalities and dissemination areas. From 6,436 eligible dwellings, contact was made with 4,559 potential participants. Hypertension prevalence was found to be 21.3% of the population overall. This survey combined both prevalence estimation and risk factor analysis, making it a cross-sectional analytic study.

The reflection below puts the descriptive–analytic boundary to work. Once you have answered it and the knowledge check, a later section turns the question around: given that we now have a true analytic observational design (the cross-sectional study), how should we actually build one?

Reflection

Can you think of a disease or health condition for which a case-series report might be the most appropriate initial study design? What hypothesis might it generate for future analytic studies?

Model answerCase series are most useful when (a) the outcome is novel or extremely rare, so cohort or case-control logistics are infeasible, and (b) you want to generate hypotheses rather than test them. Classic example: the 1981 MMWR report of five cases of Pneumocystis carinii pneumonia in previously healthy young men in Los Angeles, the founding observation of the AIDS epidemic. Modern analogues: the early Wuhan case series of viral pneumonia of unknown cause (Dec 2019), clusters of e-cigarette-associated lung injury (EVALI, 2019), or VITT-like thromboses after early COVID vaccines. The hypothesis these reports generate is structural: this cluster looks unlike background; an analytic cohort or case-control study should now test for a shared exposure, risk factor, or pathogen.

Reflection saved!

Knowledge Check; this section

1. What is the primary limitation shared by both case reports and case-series reports?

They cannot describe disease occurrence They require very large sample sizes They lack a comparison group for evaluating causal associations

Both case reports and case-series reports include only cases; they lack an explicit comparison group, making it impossible to draw valid conclusions about causal associations.

2. A survey becomes a cross-sectional analytic study when it:

Includes more than 1,000 participants Collects data on both an outcome and potential exposures beyond person, place, and time Uses random sampling exclusively

When a survey is designed to collect information about both an outcome of interest and potential exposures (risk factors) beyond the basic categories of people, place, and time, it becomes a cross-sectional analytic study.

3. A case-series report documenting 50 patients with a rare autoimmune condition would be classified as:

A descriptive study An explanatory observational study An experimental study

Case-series reports are descriptive studies. They describe the characteristics of a group of cases but do not make formal comparisons with a control or unexposed group.

● Complete the quiz and reflection to continue.

Section 3

Cross-Sectional Studies: Design & Implementation

Sections 7.4–7.5 of Dohoo, Martin & Stryhn

Section 3 of 4

Cross-Sectional Studies: Design & Implementation

Architecture and trade-offs of the first analytic observational design.

Temporal classification

Prospective vs. retrospective

Prospective

Outcome has not occurred at study start. Subjects followed forward in time. Incidence can be measured directly.

Retrospective

Both exposure and outcome have already occurred. Data are a snapshot of existing status. Cross-sectional studies are inherently retrospective.

Three approaches

How subjects are selected shapes the design

Four design steps

Building a cross-sectional study

1 · Obtain the study group

Random sampling from a defined source population supports population inference.

2 · Assess exposure

Measured at first contact. Time-varying exposures create ambiguity about when the exposure occurred relative to the outcome.

3 · Ascertain the outcome

Clearly defined using validated criteria. Surrogate outcomes need particular care.

4 · Ensure comparability

Matching is unavailable. Use restriction or multivariable statistical adjustment to control confounding.

Case example

Postpartum depression in Canadian women

Lanes et al. (2011) conducted a cross-sectional study of postpartum depression among Canadian women.

Source population: women across Canada.
Study group: 6,421 of 8,542 selected women.
Outcome: Edinburgh Postnatal Depression Scale (EPDS).
Prevalence: 8.46% minor/major depression; 8.69% major.

Strongest associations: stress during pregnancy and prior depression history.

Design mapping

Random sample → baseline exposure → validated outcome → multivariable adjustment. All four steps illustrated in a single study.

Carry forward

What to take into the next section

Cross-sectional studies measure prevalence, not incidence. That is a design consequence, not an error.
Simultaneous exposure and outcome measurement creates temporal ambiguity for time-varying exposures.
Matching is unavailable: confounding control requires restriction or statistical adjustment.

Introduction and Overview

An earlier section placed observational analytic studies in the larger classification map, and an earlier section walked through the descriptive designs that lead up to them. This section is the first time we sit inside an analytic observational design and look at how it actually works. We start with two classification distinctions that apply to every observational study you will encounter (prospective vs retrospective; the three sampling approaches), and then narrow in on the design that sits at the centre of this lesson: the cross-sectional study.

Learning Objectives

Distinguish prospective and retrospective observational designs and explain why cross-sectional studies are inherently retrospective.
Compare the three sampling approaches (cross-sectional, cohort, case-control) and identify the question each is best suited to answer.
Walk through the four design steps of a cross-sectional study (study group, exposure, outcome, confounding) and apply them to a real example.
Explain why prevalence (not incidence) is the natural outcome measure for a cross-sectional design.
Identify situations in which matching is and is not available as a confounding-control strategy.

Observational Studies Overview

Observational studies (a subgroup of analytic or explanatory studies) have an explicit formal contrast as part of their design: the prevalence of the outcome by exposure category groups is the central foundation. They differ from descriptive studies in that the comparison of two or more groups is central, and from experiments in that the researcher has no control over the allocation of study subjects to the exposure groups.

Prospective vs. Retrospective Designs

Observational studies can also be classified as prospective or retrospective. In prospective studies, the disease or outcome has not occurred at the time the study starts. In retrospective studies, both the exposure and the outcome have occurred when the study begins, hence cross-sectional studies are inherently retrospective in nature.

Sampling Drives the Design

Three Main Approaches

The choices of observational analytic study design have traditionally been among 3 approaches based on how study subjects are selected:

Cross-sectional study: A sample is obtained from the source population, and the prevalence of both disease and exposure is determined at the time of subject selection.
Cohort study: A sample of study subjects from a source population with heterogeneous exposure levels is obtained, and the incidence of the outcome in the follow-up period is determined. Landmark examples include the Framingham Heart Study (Dawber, Meadors, & Moore, 1951) and the Whitehall II study of British civil servants (Marmot et al., 1991).
Case-control study: Subjects with the outcome (cases) are identified and their exposure history is contrasted with the exposure history of a sample of non-case subjects (controls). Doll & Hill's (1950) case-control study of smoking and lung cancer, and Herbst, Ulfelder & Poskanzer's (1971) investigation of DES and vaginal adenocarcinoma, are the canonical examples.

The difference between these two outcome measures matters more than it first looks. Prevalence is a snapshot, the share of people who have the condition at the moment you look. Incidence is a rate of new cases, how quickly people who were healthy develop the condition over a stretch of time. A cross-sectional study photographs a population once, so it can count who currently has the outcome but cannot watch anyone cross over from healthy to sick, which is why the measure it yields is prevalence.

Of the three sampling approaches above, the cross-sectional study is the one this lesson will fully dissect. Later lessons will pick up case-control and cohort designs in the same way. So the rest of this section is the architectural tour of one design, but the questions it asks, how to obtain study subjects, how to assess exposure, how to define the outcome, how to control confounding, are the same questions you will be asking of every observational design from now on.

Cross-Sectional Study Design

The defining feature of a cross-sectional study is that it is an observational study whose outcome frequency measure is prevalence. The basis of the cross-sectional design is that a sample, or census, of subjects is obtained from the source population and the presence or absence of the outcome is ascertained at that point. The accordion below walks through the four design steps in the order a researcher would take them: define the study group, measure exposure, ascertain outcome, and design in protections against confounding from the start.

Obtaining the Study Group

If the researcher wants to make inferences about the frequency of the outcome in a target population, then study subjects should be obtained by a formal random sampling procedure. The source population is the listing (real or implied) of potential study subjects from which the study group is obtained. The study group is that set of subjects who agree to take part in the study.

Assessing Exposure

Exposure and other covariate status, such as demographic data, are obtained at the time of study subject selection or first contact/examination. Because the outcome measure is prevalence, it is sometimes difficult to know the appropriate time frame in which the exposure, if time-varying, might cause the outcome. Studying currently (prevalent) exposed subjects can also lead to bias when interpreting the impact of these exposures.

Assessing the Outcome of Interest

It is important to clearly define the outcome/disease of interest. In general, great care should be used if the outcome is a surrogate for a clinically important event. It is also important that widely accepted diagnostic criteria be used to identify the disease or outcome of interest.

Ensuring Comparability

The two main approaches used to prevent bias from factors associated with the outcome and whose distribution differs between exposure groups (confounders) are exclusion (restricted sampling) and analytic (statistical) control. Matching to prevent confounding cannot be applied in cross-sectional studies. Analytic control requires the use of a multivariable model.

The scenario below is a real Canadian example that uses every step of the accordion above. As you read it, ask yourself what the researchers did at each step, how the source population was defined, how exposure was measured, how the outcome was ascertained, and where you can already see the design being constrained by the cross-sectional choice. The reflection that follows is built around exactly that question.

Scenario: Postpartum Depression in Canadian Women

Lanes et al (2011) conducted a cross-sectional study of postpartum depression (PPDS) among Canadian women. The survey used the Edinburgh Postnatal Depression Scale (EPDS) as the outcome measure. Potential risk factors included socioeconomic status, demographic factors, and maternal characteristics. Of 8,542 selected women, 6,421 responded. The national prevalence of minor/major and major PPDS was found to be 8.46% and 8.69% respectively (these were two separate severity bands that the study counted apart, not nested categories, so together about 17% of women screened positive for postpartum depressive symptoms). The mother’s stress level during pregnancy and prior depression had the strongest associations.

Reflection

In the postpartum depression study described above, the exposure ‘stress during pregnancy’ was measured retrospectively at the same time as the outcome. What challenges does this create for causal inference? How might you address these challenges?

Model answerRetrospective measurement of stress at the same time as the depression outcome creates two intertwined threats. (a) Temporality is unverifiable: standard causal inference requires the exposure to precede the outcome, but here both are reported simultaneously, so reverse causation (depression colouring how stress is remembered) cannot be ruled out. (b) Recall bias: depressed mothers may systematically over-report stressors, inflating the apparent association, a form of differential misclassification. Addressing them: (i) use prospectively measured exposures (clinical records of pregnancy stress, biomarkers, contemporaneous diaries) rather than retrospective interview; (ii) anchor stress to objective events (job loss, bereavement) verifiable from records; (iii) use a nested design with baseline measurement during pregnancy; (iv) include a comparison cohort without the outcome to assess differential recall; (v) where prospective design is impossible, run sensitivity analyses for differential misclassification.

Reflection saved!

Knowledge Check; this section

1. The defining feature of a cross-sectional study is that its outcome frequency measure is:

Prevalence Incidence rate Cumulative incidence

The defining feature of a cross-sectional study is that its outcome frequency measure is prevalence, based on the number of existing cases at the time of the study.

2. Cross-sectional studies are inherently:

Prospective Retrospective Experimental

In cross-sectional studies, both the exposure and the outcome have already occurred when the study begins. The exposure and outcome are assessed at the same point in time, making them inherently retrospective.

3. Which approach to controlling confounding CANNOT be applied in cross-sectional studies?

Restriction (exclusion criteria) Statistical control via multivariable models Matching

Matching to prevent confounding cannot be applied in cross-sectional studies because subjects are sampled from the population without regard to their exposure or outcome status, unlike case-control studies where matching is feasible.

● Complete the quiz and reflection to continue.

Section 4

Limitations, Incidence Estimation & Reporting

Sections 7.6–7.9 of Dohoo, Martin & Stryhn

Section 4 of 4

Limitations, Incidence Estimation & Reporting

What the cross-sectional design cannot do, what it can, and how to report it properly.

Core limits

The prevalence-duration confound and reverse causation

Prevalence-duration confound

Prevalence = incidence × duration. Long-lasting cases accumulate. Factors that prolong disease can appear to be risk factors for onset.

Reverse causation

With time-varying exposures, cross-sectional data cannot establish whether the exposure preceded the outcome. Cause and consequence are indistinguishable in a snapshot.

Example: pet ownership and blood pressure. Does owning a dog lower blood pressure, or do people with lower blood pressure tend to acquire pets?

Where it works

Time-invariant exposures

Cross-sectional studies are well-suited to exposures that do not change over time, because there the direction of effect is not ambiguous.

Biological sex

Genetic variants

Race/ethnicity

Blood type

Congenital conditions

For these, the exposure unambiguously preceded any outcome. The snapshot design is then a defensible choice, not a concession.

Incidence from prevalence

Three estimation approaches

Repeated cross-sections

Run two surveys at different times. Compare prevalence estimates to infer population-level incidence. Miller et al. (2010), H1N1 in England.

Two-test approach

One test for early response, one for lasting immunity. Follow those negative on the less-sensitive test. Refined for HIV surveillance.

Mathematical estimation

Rajan & Sokal (2011). Derive age-specific incidence from two prevalence estimates.

Eq 7.1 · Incidence from prevalence (Rajan & Sokal, 2011)

\[ \color{#C2410C}{I_a} = 1 - \left[1 - \frac{\color{#0B7B6B}{P_{a+n}} - \color{#6D28D9}{P_a}}{1 - \color{#6D28D9}{P_a}}\right]^{1/\color{#1D4ED8}{n}} \]

I_a incidence over intervalP_a+n prevalence at endP_a prevalence at startn interval length

Design choice

Repeated cross-sections vs. cohort studies

Repeated cross-sections

Fresh sample each wave. Avoids aging cohort problem. Stays representative over time. Cannot track individual change.

Cohort (longitudinal)

Same individuals followed forward. Captures individual trajectories. Establishes temporal sequence. Survivor bias grows over time.

Reporting standard

STROBE: 22 items for observational studies

The STROBE statement (von Elm et al., Lancet, 2007) provides a 22-item checklist for reporting observational studies.

It covers: title and abstract, introduction, methods (design, setting, participants, variables, bias, sample size, statistical methods), results, and discussion.

Part of the EQUATOR network of reporting guidelines introduced in an earlier lesson.

Why every item matters

Each omission makes it harder for the next reader to judge whether the design carries the inference the authors claim. Reporting well is part of scientific integrity.

Introduction and Overview

An earlier section introduced the cross-sectional design and walked through how to build one. This section is its honest counterweight. Once a study is in the field, three practical questions tend to arrive in sequence: What can this design not tell us? Can we squeeze incidence information out of it? And how should we report the whole thing so the next reader can judge it? Those three questions are the three subsections that follow.

Learning Objectives

Explain why cross-sectional studies confound prevalence with disease duration and how that produces the reverse-causation problem.
Identify the conditions under which a cross-sectional design can support a defensible causal inference (time-invariant exposures, plausible temporality).
Describe the conditions under which incidence can be estimated from cross-sectional data, and the assumptions required.
Apply the STROBE checklist as a reporting standard for observational research and explain why each section matters to the next reader.

Inferential Limitations of Cross-Sectional Studies

By its nature, a cross-sectional study design measures prevalence, which is a function of both incidence and duration of the disease. Consequently, it is often difficult to disentangle factors associated with persistence of the outcome from factors associated with developing the outcome in the first instance (i.e., becoming a new incident case).

A concrete example shows the problem. Imagine a treatment that helps people survive longer with a disease without preventing anyone from getting it in the first place. Incidence is untouched, yet more people are living with the disease at any given moment, so prevalence rises. In a one-time snapshot that survival-prolonging factor sits alongside the cases and can be mistaken for a cause of the disease, when all it really changed was how long cases last.

The Reverse Causation Problem

When the exposure factors are time-varying, it is often very difficult to differentiate cause and effect. For example, if one is studying the relationship between dog ownership and blood pressure, and the association is negative, one cannot differentiate between people that obtained a dog because they had low blood pressure from those whose lifestyle changed, consequently lowering their blood pressure after obtaining a dog. The more changeable the exposure, the worse this issue becomes.

Cross-sectional studies are best suited for time-invariant exposures such as race or sex. In these instances, the investigator can be certain that the exposure preceded, or at least was not caused by, the outcome.

Reverse causation and the prevalence/duration confound are the conceptual limits of the design. The next thing to learn is what the design does let you do, computationally, namely, the cross-tabulation that almost every analytic move in this course is built on.

R Build a 2×2 contingency table from raw cross-sectional data

What you'll do: simulate 200 people with a known exposure-disease relationship, build the 2×2 contingency table, and read prevalence off each row. What to take away: the 2×2 table is the unit of analysis the rest of the course is built on; every measure of association you will meet in later lessons starts from a structure that looks exactly like this.

Most observational studies start the same way: cross-tabulate exposure against outcome. Below we simulate a small cross-sectional dataset, build the 2×2 table, and read off the prevalence of disease in exposed vs unexposed.

# Simulate 200 people: half exposed, with higher disease prevalence among exposed.
set.seed(230)
n <- 200
exposure <- rep(c("exposed", "unexposed"), each = n/2)
disease  <- c(rbinom(100, 1, 0.30), rbinom(100, 1, 0.10))

dat <- data.frame(exposure = exposure, disease = disease)

# 2x2 cross-tabulation -- the workhorse of descriptive epi.
tab <- table(dat$exposure, dat$disease,
             dnn = c("Exposure", "Disease"))
tab

# Prevalence (proportion with disease=1) within each exposure group.
prop.table(tab, margin = 1)   # margin=1 -> divide each row by its row total

Console output

Disease Exposure 0 1 exposed 71 29 unexposed 92 8 Disease Exposure 0 1 exposed 0.71 0.29 unexposed 0.92 0.08

Reading the table. 29% of the exposed group have disease vs 8% of the unexposed. That contrast (and the rows that produced it) is the launchpad for every measure of association you will learn in this course.

R Reflect on what you just ran

Use the questions below to interpret the output you produced. Look at your console table before answering.

1. Look at the raw tab output. How many exposed individuals had disease? How many unexposed had disease? Write the four cell counts (a, b, c, d) in the standard 2×2 layout (exposed-diseased, exposed-non-diseased, unexposed-diseased, unexposed-non-diseased).

Model answerReading the printed 2×2 from tab: exposed with disease ≈ 29, exposed without disease ≈ 71; unexposed with disease ≈ 8, unexposed without disease ≈ 92. In standard layout: a = 29 (exposed/diseased), b = 71 (exposed/non-diseased), c = 8 (unexposed/diseased), d = 92 (unexposed/non-diseased). Totals: 100 exposed and 100 unexposed, 37 diseased and 163 not, n = 200. From these you can read off prevalence (a/(a+b)) for the exposed group, the prevalence ratio (PR), and the odds ratio (OR), the three numbers that drive interpretation of any cross-sectional dataset.

2. Using prop.table(tab, margin = 1), the prevalence of disease was 0.29 in the exposed group and 0.08 in the unexposed group. Why is this a measure of prevalence and not incidence, given the design that generated the data?

Model answerBoth numerator ("how many have the disease right now") and denominator ("how many people are in the group right now") are evaluated at the same snapshot, the simulation generates a single binary outcome per person with no follow-up time. That is the defining mark of prevalence: existing cases divided by population at a point in time. Incidence requires a denominator of person-time (or at minimum a baseline disease-free cohort followed forward), so that ‘new’ cases can be distinguished from ‘old’ ones. Without a temporal element, no rate can be computed; what you have is a snapshot.

3. The simulation used rbinom(100, 1, 0.30) for the exposed and 0.10 for the unexposed. What would happen to the observed exposed-group prevalence if you changed the seed, or increased the sample to 400 per group (four times the original)? Would the contrast between groups become more or less reliable?

Model answerChanging the seed would shuffle which 100 random draws fall above the 0.30 threshold, so the observed exposed-group prevalence would jitter around 0.30, sometimes 0.24, sometimes 0.34; while preserving the long-run expectation. Going to 400 per group (four times the original 100) would cut the standard error of the observed proportion in half, because SE = √(p(1-p)/n) shrinks in proportion to 1/√n, so four times the sample halves it; the prevalence estimates would then settle much closer to their generating values (0.30 and 0.10), the CI on each would narrow, and the test of the exposed-vs-unexposed contrast would be better powered. Larger n doesn't fix bias, only sampling variability.

Saved.

Estimating Incidence from Cross-Sectional Studies

Although cross-sectional studies directly measure prevalence, there are approaches for estimating incidence from prevalence data. This is often desirable because incidence data are more useful for causal inference. The three tabs below introduce the most common approaches in increasing order of analytical sophistication. Each one trades a different practical cost (run two surveys, develop two assays, build a mathematical model) for a different epistemic gain.

A simple way to obtain population-level incidence data is to perform two cross-sectional studies, one before and one after an event of interest. For example, Miller et al (2010) performed two cross-sectional studies before and after the 2009 H1N1 epidemic in England, giving a population-based estimate of incidence.

Other approaches include using two different tests; one that detects early immune response and one that detects long-lasting immunity. People who test negatively to the less sensitive test are followed forward for a defined time period to ascertain how many become positive. This approach has been refined for HIV studies.

Rajan and Sokal (2011) describe how to estimate age-specific incidence from prevalence data. Their general approach uses two prevalence estimates at different time points. The incidence rate at year ‘a’ is:

Incidence from prevalence (Eq 7.1)

\[ \color{#C2410C}{I_a} = 1 - \left[1 - \frac{\color{#0B7B6B}{P_{a+n}} - \color{#6D28D9}{P_a}}{1 - \color{#6D28D9}{P_a}}\right]^{1/\color{#1D4ED8}{n}} \]

The incidence over an age interval is recovered from two prevalence snapshots: prevalence at the end of the interval and prevalence at its start, scaled by the interval length. It converts a static cross-sectional measure into an estimate of new cases.

where ‘n’ is the time between the two prevalence estimates (P_a and P_a+n) in the cross-sectional survey.

The intuition is simpler than the algebra. The fraction of people who are still disease-free shrinks from (1 - P_a) at the start of the interval to (1 - P_a+n) at the end. Whatever share of that healthy group crossed into disease during the interval is the new-case activity we call incidence, and the equation converts that shrinkage into an average rate per unit of time.

The first incidence-estimation approach (running two cross-sections at different time points) raises a question of its own. If we are willing to track the same population twice, why not just track the same individuals through time? That is the difference between a repeated cross-section and a cohort.

Repeated Cross-Sectional vs. Cohort Studies

Sometimes it is desirable to follow a population over time. Two options exist: repeated cross-sectional samplings of the population, or a longitudinal study of the initial study subjects (a cohort approach). Each has distinct advantages, and the choice between them is a real one. The two cards below put their strengths and limitations side by side, cohorts return in detail in a later lesson.

Repeated Cross-SectionalClick to explore

Cohort StudiesClick to explore

The previous subsections covered what cross-sectional studies cannot do, what they can do, and how they relate to longitudinal designs. The last piece of the puzzle is reporting, how to write up an observational study so the next reader can judge it without having to reverse-engineer the methods.

Reporting Observational Studies: The STROBE Statement

In 2004, a network of methodologists, researchers, and journal editors established what we now know as the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) statement (von Elm et al., 2007; explained in detail by Vandenbroucke et al., 2007); one of the EQUATOR-network reporting guidelines we previewed in an earlier lesson. It provides a checklist of 22 items considered essential for good reporting of observational studies.

STROBE Checklist Key Sections

The STROBE checklist covers: Title & Abstract (indicate study design), Introduction (background, objectives, hypotheses), Methods (study design, setting, participants, variables, data sources, bias, sample size, statistical methods), Results (participants, descriptive data, outcome data, main results), and Discussion (key results, limitations, interpretation, generalisability).

The reflection below is the practical exit ticket for this section, a textbook reverse-causation case dressed up in plausible language. After working through it and the knowledge check, the lesson moves to its comprehensive final assessment, which integrates the classification map (an earlier section), the descriptive designs (an earlier section), the cross-sectional design itself (an earlier section), and the limitations and reporting standards covered here.

Reflection

Consider a cross-sectional study that finds an association between pet ownership and lower blood pressure. Explain why this finding cannot be interpreted as causal evidence that pet ownership lowers blood pressure. What study design would be more appropriate?

Model answerCross-sectional findings cannot support causal claims because the snapshot makes the direction of effect ambiguous (does owning a pet lower BP, or do people with lower BP have the energy and capacity to keep a pet?) and provides no exposure-to-outcome temporal ordering. Confounding is also likely: pet owners differ from non-owners in income, housing, marital status, age, physical activity, and social network, all of which independently affect BP. Selection on health may also be at work (sicker, frailer people give up pets). Better designs: a prospective cohort that enrolls non-owners and follows them through pet acquisition, measuring BP before and after; a quasi-experimental study exploiting a natural shock (e.g., a pet-adoption programme rollout) with difference-in-differences; or, where feasible, an RCT of dog-walking interventions in matched non-owners.

Reflection saved!

Knowledge Check; this section

1. The primary reason cross-sectional studies have limited ability to support causal inference is:

They cannot include large sample sizes Prevalence reflects both incidence and duration, making it difficult to distinguish cause from consequence They always have selection bias

Cross-sectional studies measure prevalence, which is a function of both incidence and duration. This makes it difficult to distinguish factors that cause disease from factors that affect disease duration or survival.

2. Cross-sectional studies are best suited for studying exposures that are:

Time-invariant (e.g., sex, race, genetic factors) Easily modified by treatment Measured only in laboratory settings

Cross-sectional studies are best suited for time-invariant exposures such as race or sex, where the investigator can be certain that the exposure preceded the outcome and was not affected by it.

3. The STROBE statement provides:

A method for calculating sample size in cross-sectional studies A ranking of the quality of different study designs A checklist of 22 items for reporting observational studies

The STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) statement provides a checklist of 22 items considered essential for good reporting of observational studies.

R Activity, Building a 2×2 table and computing prevalence

The companion R script r-activities/HSCI_230_Lesson_3_Introduction_to_Observational_Studies.R simulates a cross-sectional dataset of 200 people (half exposed, half unexposed) with a true disease prevalence of 30% in the exposed and 10% in the unexposed, then constructs the 2×2 cross-tabulation that anchors nearly every observational analysis. You will compute row-wise prevalence with prop.table(..., margin = 1) and, as a stretch, derive a prevalence ratio, a first taste of the measures-of-association we revisit in later lessons.

# Simulate 200 people: half exposed, with higher disease prevalence among exposed
set.seed(230)
n        <- 200
exposure <- rep(c("exposed", "unexposed"), each = n/2)
disease  <- c(rbinom(100, 1, 0.30), rbinom(100, 1, 0.10))

dat <- data.frame(exposure = exposure, disease = disease)

# 2x2 cross-tabulation -- the workhorse of descriptive epi
tab <- table(dat$exposure, dat$disease,
             dnn = c("Exposure", "Disease"))
tab

# Prevalence (proportion with disease=1) within each exposure group
prop.table(tab, margin = 1)   # margin=1 -> divide each row by its row total

## -----------------------------------------------------------------------------
## Stretch: prevalence ratio (a quick preview of measures-of-association)
## -----------------------------------------------------------------------------
prev_exposed   <- prop.table(tab, margin = 1)["exposed",   "1"]
prev_unexposed <- prop.table(tab, margin = 1)["unexposed", "1"]
prev_ratio     <- prev_exposed / prev_unexposed
cat("Prevalence ratio:", round(prev_ratio, 2), "\n")

● Complete the quiz and reflection to continue.

HSCI 230, Lesson 3

Evaluating Epidemiological Research

Introduction to Observational Studies

Learning objectives for this lesson:

Glossary: Key Terms, People & Concepts

Study Classification & Design Framework

Study Classification & Design Framework

From framing to design

Descriptive vs. explanatory

Descriptive

Explanatory (analytic)

Experimental vs. observational

Experimental

Observational

Three steps that borrow from the experiment

1 · Thought experiment

2 · Design before data

3 · Forward projection

Causal weight, not absolute quality

What to take into the next section

Introduction and Overview

Learning Objectives

Descriptive vs. Explanatory Studies

Experimental vs. Observational Studies

Key Distinction

A Unified Approach to Study Design

Hierarchy of Evidence for Causal Inference

Reflection

Descriptive Studies: Case Reports, Case Series & Surveys

Descriptive Studies: Case Reports, Case Series & Surveys

Case reports, case series, and surveys

Case report

Case-series report

Survey

No comparison group

From survey to cross-sectional analytic study

Ontario Hypertension Survey

What to take into the next section

Introduction and Overview

Learning Objectives

Key Characteristics of Study Types

Important Limitation

From Survey to Analytic Study

Reflection

Cross-Sectional Studies: Design & Implementation

Cross-Sectional Studies: Design & Implementation

Prospective vs. retrospective

Prospective

Retrospective

How subjects are selected shapes the design

Building a cross-sectional study

1 · Obtain the study group

2 · Assess exposure

3 · Ascertain the outcome

4 · Ensure comparability

Postpartum depression in Canadian women

Design mapping

What to take into the next section

Introduction and Overview

Learning Objectives

Observational Studies Overview

Prospective vs. Retrospective Designs

Sampling Drives the Design

Three Main Approaches

Cross-Sectional Study Design

Reflection

Limitations, Incidence Estimation & Reporting

Limitations, Incidence Estimation & Reporting

The prevalence-duration confound and reverse causation

Prevalence-duration confound

Reverse causation

Time-invariant exposures

Three estimation approaches

Repeated cross-sections

Two-test approach

Mathematical estimation

Repeated cross-sections vs. cohort studies

Repeated cross-sections

Cohort (longitudinal)

STROBE: 22 items for observational studies