Repeated Measures Data

Exploratory Data Analysis For Epidemiology

Learning objectives for this lesson:

Recognize and describe the unique characteristics of repeated measures data structures
Use descriptive and graphical tools to explore repeated measures datasets
Apply simple univariate approaches (separate time point analyses, summary statistics) to analyze repeated measures
Understand the limitations of random-intercept mixed models for repeated measures and why correlation structures matter
Choose among correlation structures (compound symmetry, AR(1), ARMA(1,1), Toeplitz, unstructured) for repeated measures
Apply linear mixed models with appropriate correlation structures to repeated measures data
Understand trend models with random slopes for time
Describe the challenges of extending GLMMs to discrete repeated measures data including transition models
Use GEE procedures to analyze clustered and repeated measures data

This course was developed by Dr. Kiffer G. Card, Faculty of Health Sciences, Simon Fraser University based on Dohoo, I. R., Martin, S. W., & Stryhn, H. (2012). Methods in Epidemiologic Research. VER Inc.

Reference

Glossary: Key Terms, People & Concepts

📚 Reference page, available throughout the lesson

This glossary collects the key concepts, people, and ideas you will meet in this lesson. Use it as a reference while you work through the material, or as a review before assessments. Type in the search box to filter entries.

Key Concepts & Ideas

Repeated measures (longitudinal) dataMultiple observations on the same subject over time. A special case of clustered data where the cluster is the subject and the level-1 unit is the measurement occasion. Within-subject correlation must be modeled to obtain valid inference.

Within-subject correlationThe correlation between repeated measurements on the same subject. Often decreases as the time gap between measurements grows; ignoring it leads to incorrect standard errors.

Balanced vs unbalanced designBalanced designs have the same number of measurements at the same times for all subjects; unbalanced designs allow varying numbers of timing of observations. Mixed models and GEE handle unbalanced data; classical repeated-measures ANOVA generally does not.

MCAR / MAR / MNARCategories of missingness. Missing Completely At Random (MCAR): missingness independent of all data. Missing At Random (MAR): missingness depends only on observed data. Missing Not At Random (MNAR): missingness depends on unobserved values. Likelihood-based mixed models give valid inference under MAR; GEE generally requires MCAR.

Dropout (attrition)Loss of subjects over the course of a longitudinal study. Especially problematic when dropout is informative (related to the outcome). Sensitivity analyses or pattern-mixture / selection models may be required.

Time-varying covariateA predictor whose value changes across measurement occasions (e.g., current weight, current treatment). Distinct from time-invariant covariates (e.g., sex at birth) and requires careful interpretation in longitudinal models.

Time as predictorTime can enter a model linearly, as polynomial, as splines, or as categorical visit numbers. The choice depends on whether time effects are smooth or visit-specific.

Growth curve / trajectoryA subject-specific function describing how the outcome evolves over time. Modeled with random intercepts and random slopes for time, possibly with non-linear time terms.

Profile (spaghetti) plotA descriptive plot of individual outcome trajectories over time, often overlaid on a mean profile. Useful for spotting heterogeneity in trajectories and influential cases.

Population-averaged vs subject-specific effectsTwo interpretations of longitudinal model coefficients. GEE produces population-averaged effects; mixed models produce subject-specific (conditional) effects. They differ for non-linear models.

Methods & Statistical Concepts

Covariance / correlation structureA specification of the within-subject covariance matrix for repeated measurements. Choices trade off parsimony against flexibility; common choices include compound symmetry, AR(1), Toeplitz, and unstructured.

Compound symmetry (exchangeable)Assumes equal variances at all times and a single correlation between any two measurements. Equivalent to a random-intercept model. Often unrealistic for longitudinal data with declining correlation over time.

Autoregressive AR(1)A first-order autoregressive structure: corr(t₁, t₂) = ρ^{|t₁−t₂|}. Correlations decay geometrically with the distance between observations. Best for equally spaced repeated measures with a stationary process.

Toeplitz (banded)A covariance structure where correlation depends only on the distance between measurements but each lag has its own parameter. More flexible than AR(1); requires equally spaced times.

Unstructured covarianceNo constraint on the within-subject covariance: every variance and pairwise correlation is estimated separately. Most flexible but requires many parameters; suitable when the number of distinct times is small.

Spatial / continuous-time covarianceStructures (e.g., spatial exponential, Gaussian, power) that allow correlation to depend on the actual time gap rather than equally spaced visits. Useful for irregularly timed measurements.

Generalized estimating equations (GEE)A marginal-model approach for longitudinal data using a working correlation structure and sandwich variance estimators. Robust to misspecification of the correlation but requires MCAR for valid inference under missingness.

Mixed-effects models for longitudinal dataModels with random subject effects (intercepts, slopes) and an explicit covariance structure for residuals. Likelihood-based, valid under MAR, and accommodate continuous and discrete outcomes.

Sandwich (robust) variance estimatorA variance formula that is consistent even when the within-subject covariance model is wrong. Standard in GEE; can also be applied to mixed models using cluster-robust adjustments.

Model selection for covariance structuresProcedure to compare candidate covariance structures (compound symmetry, AR(1), Toeplitz, unstructured) using AIC, BIC, or likelihood-ratio tests. Conducted with REML for variance components and ML for fixed-effect comparisons.

Transition (Markov) modelsModels where the current outcome depends on past outcomes (e.g., y_t as a function of y_t−1). Useful for binary or categorical longitudinal outcomes; an alternative to marginal and conditional approaches.

Multiple imputationA method for handling missing data by generating several imputed datasets, fitting the model in each, and pooling results. Useful when MAR is plausible and software does not directly handle missingness.

Key People

Nan Laird & James WareAuthors of the foundational 1982 paper on linear random-effects models for longitudinal data, supplying the framework for likelihood-based analysis of repeated measures.

Kung-Yee Liang & Scott ZegerCo-developers of generalized estimating equations (GEE), 1986. Their marginal framework with robust variance is one of the two dominant approaches to longitudinal data.

Peter Diggle (b. 1950)British statistician and co-author of Analysis of Longitudinal Data (Diggle, Heagerty, Liang, Zeger). Influential in disseminating longitudinal data methods to applied researchers.

Donald B. Rubin (b. 1943)American statistician who formalized the missing-data taxonomy (MCAR, MAR, MNAR) and introduced multiple imputation. His framework underlies modern missing-data analyses.

Roderick J. A. LittleCo-author with Rubin of Statistical Analysis with Missing Data, the standard reference on principled missing-data methods, and a leader in pattern-mixture model development.

No matching entries. Try a different search term.

Section 2

Univariate & Multivariate Approaches

⏱ Estimated time: 20 minutes

Section 2 of 4

Univariate & Multivariate Approaches

Classical methods before mixed models, and why each one strains under realistic longitudinal data.

Simplest approaches

Separate analyses and summary statistics

Separate time points

One test per visit. Ignores within-subject correlation, creates a multiple-testing problem proportional to the number of time points.

Summary statistics

Collapse each trajectory to one value (slope, AUC, change score) and analyse between subjects. Reliable but discards temporal detail and time-varying covariates.

RM-ANOVA

Repeated measures analysis of variance

Epsilon correction factor (Huynh–Feldt)

\[ \color{#0B7B6B}{\varepsilon} = \frac{m^2(\bar{d}_{..} - \bar{d}_{i.})^2}{(m-1)\bigl[\textstyle\sum_{j<k}d_{jk}^2 - m\sum_j \bar{d}_{j.}^2 \bigr]} \]

ε sphericity correction (1 = sphericity holds, lower = worse)

When \(\varepsilon = 1\) compound symmetry holds. As \(\varepsilon \to 0\) the violation is severe and the uncorrected F-test is increasingly anti-conservative.

MANOVA

Multivariate analysis of variance

No assumption about correlation structure, but requires perfectly balanced data and no missing values.

Covariance and correlation matrices (Diggle et al.)

\[ \color{#0B7B6B}{\boldsymbol{\Sigma}} = \operatorname{Cov}(\color{#C2410C}{\mathbf{Y}_i}), \quad \color{#6D28D9}{\mathbf{R}} = \operatorname{Corr}(\color{#C2410C}{\mathbf{Y}_i}) \]

Σ covariance matrix R correlation matrix Y_i a subject's repeated measures

For m time points, the unstructured covariance has \(m(m+1)/2\) parameters. With many time points and missing data, MANOVA cannot be fit.

Carry forward

What breaks in each classical method

Separate analyses: multiple testing; within-subject information wasted.
Summary statistics: temporal detail and time-varying covariates lost.
RM-ANOVA: compound-symmetry assumption fails under autocorrelation.
MANOVA: complete balanced data required; collapses with any missingness or many time points.

All four fail under realistic longitudinal conditions. A later section shows what to do instead.

Introduction and Overview

Before mixed models, what was the field doing? An earlier section gave us the descriptive picture of repeated-measures data. This section walks through the methods that dominated longitudinal analysis for decades: split-plot ANOVA (univariate, with strong assumptions like compound symmetry/sphericity), MANOVA (multivariate, weaker assumptions but lower power and intolerant of missing data), and summary-statistic approaches that collapse each subject’s trajectory to a single value. Knowing these methods is more than historical: they still appear in legacy literature and teaching, and seeing where they fail motivates the modern approaches in later sections.

Learning Objectives

Compare separate-time-point, summary-statistic, RM-ANOVA, and MANOVA approaches to longitudinal data.
State the compound-symmetry / sphericity assumption underlying RM-ANOVA and apply the Huynh–Feldt correction when it is violated.
Read covariance and correlation matrices and decompose them into variance and correlation components.
Identify the conditions (balanced data, no missingness, few time points) under which classical methods are still defensible.

Simple Approaches to Repeated Measures

Before turning to complex mixed models, it is worth understanding the simpler methods that have traditionally been used for repeated measures data. These methods either reduce the data to avoid modelling correlations altogether, or make strong assumptions about the correlation structure.

Separate Time Point Analysis

The simplest approach is to analyze each time point independently; for example, running a separate t-test or regression at each visit. This is straightforward but wasteful: it ignores the within-subject correlations and creates a multiple testing problem. If there are m time points, a Bonferroni correction divides α by m, which can be very conservative.

Summary Statistics Approach

A more elegant simple approach is to compute a single summary value per subject, such as the slope of their trajectory, the drop from first to last measurement, or the area under the curve (AUC), and then perform a standard between-subjects analysis on these summaries.

Advantages: Simple, robust to model assumptions about correlation structure, and easy to interpret.

Disadvantages: Loss of information about the temporal pattern, difficulty incorporating within-subject time-varying predictors, and potential loss of power.

Repeated Measures ANOVA

Repeated measures ANOVA treats time as a within-subject factor and tests for differences across time points. However, it assumes compound symmetry: all pairs of time points have the same correlation. This is the same assumption as a random intercept model.

When compound symmetry is violated (which is common with autocorrelated data), the F-test becomes liberal (anti-conservative). The Greenhouse–Geisser and Huynh–Feldt correction factors (ε) adjust the degrees of freedom to account for this violation (Greenhouse & Geisser, 1959; Huynh & Feldt, 1976). When ε = 1, compound symmetry holds perfectly; as ε decreases, the violation is more severe. The underlying assumption can be tested formally with Mauchly's test of sphericity (Mauchly, 1940).

MANOVA (Multivariate Analysis of Variance)

MANOVA treats the entire vector of repeated measurements as a multivariate outcome, making no assumptions about the correlation structure. This is its key advantage over repeated measures ANOVA.

Limitations: Requires completely balanced data with no missing values, cannot easily handle within-subject continuous predictors, and uses wide-format data. It also becomes impractical with many time points.

Covariance and Correlation Matrices

Covariance matrix (Eq 23.1)

\[ \color{#0B7B6B}{\boldsymbol{\Sigma}} = \operatorname{Cov}(\color{#C2410C}{\mathbf{Y}_i}) \]

The covariance matrix collects the variances and covariances of the repeated outcomes for subject i: an m by m table, one row and column per time point.

Correlation matrix (Eq 23.2)

\[ \color{#0B7B6B}{\mathbf{R}} = \operatorname{Corr}(\color{#C2410C}{\mathbf{Y}_i}) \]

The correlation matrix is the standardised version of the covariance: ones on the diagonal and the between-time-point correlations of the outcomes for subject i off the diagonal.

Limitations of Each Approach

Separate time points: Multiple testing, ignores correlations, wasteful of information. Summary statistics: Loses temporal detail, cannot incorporate time-varying covariates. RM ANOVA: Assumes compound symmetry, which is rarely true. MANOVA: Requires complete, balanced data with no missing values. All of these limitations motivate the use of mixed models with flexible correlation structures.

R Activity: Longitudinal mixed model with autoregressive errors

The dataset phaa_repeated.csv tracks 200 patients in a hypothetical wellness trial through 4 visits (months 0, 6, 12, 18). Outcomes: sbp_mmhg (continuous) and adherent (binary). The full annotated script is in r-activities/HSCI_410_Lesson_12_Repeated_Measures_Data.R.

library(lme4);  library(lmerTest);  library(nlme);  library(geepack)
dat <- read.csv("phaa_repeated.csv", stringsAsFactors = FALSE)
dat$id  <- factor(dat$id)
dat$arm <- factor(dat$arm, levels = c("control","intervention"))

# 1. Random-intercept mixed model on the continuous outcome
m_lmm <- lmer(sbp_mmhg ~ visit * arm + age + female + (1 | id),
              data = dat)
summary(m_lmm)

# 2. Add an AR(1) within-subject correlation structure with nlme
m_lme <- lme(sbp_mmhg ~ visit * arm + age + female,
             random      = ~ 1 | id,
             correlation = corAR1(form = ~ visit | id),
             data        = dat, na.action = na.omit)
summary(m_lme)

# 3. Compare correlation structures with AIC
m_cs <- update(m_lme, correlation = corCompSymm(form = ~ visit | id))
m_un <- update(m_lme, correlation = corSymm(form    = ~ 1     | id))
AIC(m_lme, m_cs, m_un)

# 4. Same trial, binary outcome (adherence) -- GLMM and GEE
m_bin <- glmer(adherent ~ visit * arm + age + female + (1 | id),
               data    = dat, family = binomial,
               control = glmerControl(optimizer = "bobyqa"))
exp(fixef(m_bin))               # subject-specific ORs

m_gee <- geeglm(adherent ~ visit * arm + age + female,
                id = id, data = dat, family = binomial,
                corstr = "exchangeable")
exp(coef(m_gee))                # population-averaged ORs

Pick the structure deliberately. AR(1) (corAR1) is the natural choice for evenly-spaced visits; corCAR1 handles unequal spacing; corSymm (unstructured) is the most general but estimates the most parameters. lme() handles dropout via likelihood, so you do not have to drop subjects. The visit:armintervention coefficient is the trial's headline number: the additional change in SBP per month attributable to the intervention. The last two lines refit the binary adherence outcome as a population-averaged GEE, so its odds ratios can sit beside the subject-specific ones from the GLMM.

R Reflect on what you just ran

Use the questions below to interpret the output you produced. Look at your console / plot before answering.

1. From summary(m_lmm), report the visit:armintervention coefficient, its SE, and its p-value. Translate it in one sentence: what is the additional change in SBP per month attributable to the intervention vs control?

Model answersummary(m_lmm) for visit:armintervention typically returns a coefficient around −0.5 mmHg/month with SE ~0.15 and p < 0.001. Translation: each additional month of follow-up, intervention-arm participants have an SBP change that is 0.5 mmHg more negative than control-arm participants, that is, the intervention's monthly effect on SBP. Over 12 months, the cumulative effect is ~6 mmHg lower SBP than control, a clinically meaningful difference if sustained.

2. From AIC(m_lme, m_cs, m_un), which correlation structure has the lowest AIC? Are the differences small (within 2 units) or large? Following the rule "pick the simplest structure with similar AIC," what would you report?

Model answerAIC(m_lme, m_cs, m_un) typically shows unstructured (un) with the lowest AIC, but compound symmetry (cs) often within a few units. With the rule "pick the simplest with similar AIC," report compound symmetry as the primary model and unstructured as a sensitivity analysis. Compound symmetry assumes all pairs of visits have the same correlation, a strong assumption but parsimonious. If AIC differences are > 10 between cs and un, switch to unstructured. If < 2, prefer cs.

3. From exp(fixef(m_bin)) and exp(coef(m_gee)), compare the subject-specific vs population-averaged ORs for visit:armintervention on the binary adherence outcome. Which is larger in magnitude, and why?

Model answerSubject-specific OR (from m_bin GLMM) is typically larger in magnitude than the population-averaged OR (from m_gee). For example, OR(GLMM) = 1.85 might correspond to OR(GEE) = 1.55. The reason is the logit link's non-linearity: averaging over the random-effects distribution shrinks marginal effects toward 1 relative to conditional effects. Subject-specific is the effect within a person; population-averaged is the effect averaged across the population. Reporting both with explicit labels is best practice.

Saved.

Approach	Handles Missing Data?	Assumes Equal Correlations?	Time-Varying Covariates?
Separate Time Points	Yes (per time point)	N/A (ignores structure)	Yes
Summary Statistics	Partially	No	No
RM ANOVA	No	Yes (compound symmetry)	No
MANOVA	No	No	No
Mixed Models	Yes	Flexible	Yes

Reflection

When would you choose a simple summary statistic approach over a mixed model for repeated measures data? What information might you lose by simplifying the analysis in this way?

Model answerSimple summary-statistic approach (e.g., compare mean change scores between groups, or area-under-the-curve): suitable when (a) sample is small, (b) follow-up times are equal across participants, (c) you want robust, easy-to-communicate effects, (d) interest is in the average effect rather than the time-course. Useful for pilot studies and simple primary analyses. Lose by simplifying: (i) information about time-course (does the effect emerge gradually, plateau, or attenuate?), (ii) statistical power (mixed models use the full longitudinal correlation structure), (iii) ability to handle missing data (summary statistics typically require complete data), (iv) flexibility (mixed models accommodate covariates, interactions, time-varying exposures, irregular measurement times). Default to mixed models for any non-trivial longitudinal analysis.

Reflection saved!

* Complete the quiz and reflection to continue.

Section 3

Linear Mixed Models with Correlation Structure

⏱ Estimated time: 20 minutes

Section 3 of 4

Linear Mixed Models with Correlation Structure

Modelling temporal dependence directly: compound symmetry, AR(1), ARMA(1,1), Toeplitz, and unstructured.

The key extension

Beyond random intercepts

Laird & Ware (1982) linear mixed model

\[ \color{#0B7B6B}{\mathbf{Y}_i} = \color{#C2410C}{\mathbf{X}_i\boldsymbol{\beta}} + \color{#6D28D9}{\mathbf{Z}_i\mathbf{b}_i} + \color{#BE185D}{\boldsymbol{\varepsilon}_i}, \quad \boldsymbol{\varepsilon}_i \sim \mathcal{N}(\mathbf{0},\, \boldsymbol{\Sigma}_i) \]

Y_i repeated outcomes X_iβ fixed effects Z_ib_i subject random effects ε_i within-subject errors

The random effect \(\mathbf{b}_i\) captures between-subject variability. The residual covariance \(\boldsymbol{\Sigma}_i\) models within-subject autocorrelation when a random intercept alone is insufficient.

Two core structures

Compound symmetry and AR(1)

Compound symmetry: all lags equal

\[ \color{#0B7B6B}{\operatorname{Corr}(Y_{ij}, Y_{ik})} = \color{#C2410C}{\rho} \quad \text{for all } j \neq k \]

Corr(Y_ij,Y_ik) correlation of any two times ρ single common correlation

AR(1): geometric decay with lag

\[ \color{#0B7B6B}{\operatorname{Corr}(Y_{ij}, Y_{ik})} = \color{#C2410C}{\rho}^{\,\color{#6D28D9}{|j-k|}} \]

Corr correlation of two times ρ one-step correlation |j−k| number of steps apart

Three more structures

ARMA(1,1), Toeplitz, and unstructured

ARMA(1,1)

2 parameters. Allows steeper or flatter initial decay than AR(1). Good when the correlation drops sharply then levels off.

Toeplitz

\(m-1\) parameters. Lag-specific unconstrained correlations. Requires equidistant time points.

Unstructured

\(m(m+1)/2\) parameters. No constraints. Only feasible with few time points and large samples.

Parameter count summary

\[ \text{CS}: 1 \quad \text{AR(1)}: 1 \quad \text{ARMA}: 2 \quad \text{Toeplitz}: m{-}1 \quad \text{Unstructured}: \tfrac{m(m+1)}{2} \]

Combining effects

Random effects plus correlation structures

Redundant combinations

Random intercept + compound symmetry errors produce the same structure. Both cannot be identified together.

Useful combinations

Random intercept + AR(1) errors: correlations decay but stay positive at every lag because of the shared intercept.

Use the AIC to compare non-nested structures; use likelihood-ratio tests for nested ones (e.g., AR(1) nested within Toeplitz).

Carry forward

Selecting the right correlation structure

Start with the empirical correlation matrix: does correlation decay with lag?
AR(1) is a well-motivated default for equally spaced repeated measures.
Random intercept + AR(1) errors capture both subject heterogeneity and autocorrelation.
Use AIC for non-nested comparisons; likelihood-ratio tests for nested ones.

Introduction and Overview

Where the modern toolkit takes over. An earlier section showed why the classical methods strain when measurement spacing is irregular, missingness is informative, or the within-subject correlation pattern is more complex than “all pairs equally correlated.” Mixed models with explicit residual correlation structures (compound symmetry, AR(1), Toeplitz, unstructured) are the response. They let us model the temporal dependence directly rather than assume it away, handle unbalanced or missing-at-random data gracefully, and combine random effects with structured residuals to capture both subject-level heterogeneity and within-subject autocorrelation. This section is the heart of the lesson.

Learning Objectives

Specify a linear mixed model with an explicit residual correlation structure and explain how it relaxes the compound-symmetry assumption.
Distinguish among compound symmetry, AR(1), ARMA(1,1), Toeplitz, and unstructured correlation matrices and identify when each is appropriate.
Combine random intercepts (and slopes) with structured residuals to capture subject heterogeneity and within-subject autocorrelation.
Use AIC for non-nested and likelihood-ratio tests for nested correlation structures during model selection.
Handle unbalanced and irregular-spacing designs that classical methods cannot accommodate.

Beyond Random Intercepts

A random intercept model assumes compound symmetry: all pairs of measurements on the same subject are equally correlated. For most repeated measures data, this assumption is violated because of autocorrelation. We need to extend the mixed model to include explicit correlation structures for the error term ε, the framework formalised in the classic Laird & Ware random-effects model for longitudinal data (Laird & Ware, 1982; Fitzmaurice, Laird, & Ware, 2011).

Choosing a Correlation Structure

The choice of correlation structure is one of the most important decisions in repeated measures analysis. Start by examining the empirical correlation matrix. If correlations clearly decay with increasing time lag, consider AR(1) or ARMA(1,1). If the decay is minimal, compound symmetry may suffice. If the pattern is complex, consider Toeplitz or unstructured. Use AIC to compare non-nested structures and likelihood ratio tests for nested ones.

Key Correlation Structures

Each structure below is a different assumption about how a subject's repeated measurements hang together over time. More flexible assumptions fit a wider range of patterns but cost more parameters, so the aim is the simplest structure that still matches the decay you see in the empirical correlation matrix.

Compound Symmetry (Exchangeable)

All pairs of measurements have the same correlation ρ, regardless of how far apart in time they are. This is the simplest structure and is equivalent to a random intercept model. It has only 1 correlation parameter.

When appropriate: When there is no autocorrelation, that is, the correlation between measurements does not depend on time distance. This is rare in practice for true repeated measures data.

First-order autoregressive: AR(1)

Correlations decay as powers of ρ with increasing time distance: Corr(Y_j, Y_k) = ρ^|j−k|. This produces an exponential decay pattern. It has only 1 parameter (ρ) and is a good default for equally spaced repeated measures.

When appropriate: When the correlation matrix shows a clear pattern of decreasing correlations with increasing time lag, and the decay appears approximately geometric.

ARMA(1,1)

An extension of AR(1) that allows a slower or more flexible decay in correlations. It has 2 parameters and can accommodate patterns where the initial drop in correlation is steep but then levels off.

Toeplitz (Stationary)

Each lag has its own unconstrained correlation. For m time points, there are m − 1 correlation parameters. The structure is “banded”: the correlation depends only on the time lag, not on which specific time points are involved.

When appropriate: When the pattern of decay is irregular and cannot be well approximated by AR(1) or ARMA, but you still believe the correlation depends only on lag distance.

Unstructured

Completely unconstrained correlations and variances for each pair of time points. For m time points, there are m(m+1)/2 parameters. This is the most flexible but requires the most parameters.

When appropriate: Only with few time points and large sample sizes. With many time points, the number of parameters becomes impractical.

AR(1) correlation structure

\[ \color{#0B7B6B}{\operatorname{Corr}(Y_{ij}, Y_{ik})} = \color{#C2410C}{\rho}^{\,\color{#6D28D9}{|j-k|}} \]

Under AR(1), the correlation between two measurements is the one-step correlation raised to the number of time steps between them: it decays geometrically with lag (lag 1 gives ρ, lag 2 gives ρ², lag 3 gives ρ³).

Structure	Parameters	Key Feature	Assumption
Compound Symmetry	1	Equal correlations	No autocorrelation
AR(1)	1	Geometric decay	Equidistant time points
ARMA(1,1)	2	Flexible decay	Equidistant time points
Toeplitz	m − 1	Lag-specific correlations	Equidistant time points
Unstructured	m(m+1)/2	Completely flexible	None

Combining Random Effects with Correlation Structures

An important practical consideration is how random effects interact with error correlation structures. Some combinations are redundant and cannot be separately identified:

Random intercepts + compound symmetry errors = redundant, since both produce the same correlation structure
Random intercepts + AR(1) errors = useful, since it produces a structure where correlations decay but do not reach zero
Unstructured errors + random effects = pointless, since the unstructured covariance already captures everything

Covariance pattern models use no random effects at all, relying entirely on the structured covariance of the errors to capture within-subject correlation.

Model Selection

For nested correlation structures (e.g., AR(1) is nested within Toeplitz), use likelihood ratio tests. For non-nested structures (e.g., AR(1) vs. compound symmetry), use AIC or similar information criteria. Models should be compared with the same fixed effects and random effects structure.

Example: Comparing Correlation Structures

In a study with 6 equally-spaced measurements, the empirical correlations ranged from 0.72 (lag 1) to 0.31 (lag 5). An AR(1) model with ρ = 0.73 fit well (AIC = 2,341), while compound symmetry (AIC = 2,398) fit poorly because it predicted equal correlations of 0.52 at all lags. The Toeplitz model (AIC = 2,338) offered a slight improvement over AR(1) but used 4 more parameters. Based on parsimony, AR(1) was selected.

Reflection

A study measures blood pressure at 6 monthly visits. The correlation between visits 1 and 2 is 0.60, between visits 1 and 6 is 0.15. Which correlation structure would you initially consider, and why?

Model answerCorrelations decay with time-distance: 0.60 between visits 1–2 (1 month apart) and 0.15 between visits 1–6 (5 months apart). This is the signature of an autoregressive AR(1) structure, where correlation between visits at times s and t is ρ^|s−t|; with adjacent ρ = 0.60, then ρ⁵ for visits 5 apart = 0.60⁵ = 0.078, echoing the observed 0.15: both show correlation decaying steeply as the gap widens. AR(1) fits the data structure where temporal proximity matters, appropriate for many longitudinal measures (BP, weight, biomarkers). Alternative structures to consider: continuous-AR(1) for irregular time intervals, antedependence if correlations also decay in non-stationary ways, or unstructured as a more flexible (but parameter-heavy) alternative. Test the AR(1) assumption with an unstructured model and compare AIC.

Reflection saved!

* Complete the quiz and reflection to continue.

Section 4

Trend Models, Discrete Outcomes & GEE

⏱ Estimated time: 20 minutes

Section 4 of 4

Trend Models, Discrete Outcomes & GEE

Random slopes for time, extending to binary and count data, and the population-averaged alternative.

Trend models

Random slopes: each subject with their own trajectory

Random intercept and slope model

\[ \color{#0B7B6B}{Y_{ij}} = (\color{#C2410C}{\beta_0} + \color{#6D28D9}{b_{0i}}) + (\color{#C2410C}{\beta_1} + \color{#6D28D9}{b_{1i}})\,\color{#1D4ED8}{t_{ij}} + \color{#BE185D}{\varepsilon_{ij}} \]

β₀,β₁ average intercept and time slope b_0i,b_1i subject departures t_ij time ε_ij residual

Discrete outcomes

Transition models for binary repeated measures

Transition model (Eq 23.5, Dohoo et al.)

\[ \color{#0B7B6B}{\operatorname{logit}(p_{ij})} = \color{#C2410C}{\mathbf{X}\boldsymbol{\beta} + \mathbf{Z}\mathbf{u}} + \color{#6D28D9}{\gamma\, Y_{i,j-1}} \]

logit(p_ij) current log-odds Xβ+Zu fixed and random effects γY_i,j-1 previous-outcome effect

The coefficient \(\gamma\) is the log odds ratio for the event at time \(j\) given the event occurred at time \(j-1\). A positive \(\gamma\) means having the event previously raises the odds of having it now.

Limitation: all other coefficients become conditional on the prior outcome, and the first observation has no predecessor.

GEE

Generalised estimating equations: the marginal alternative

GEE estimating equation (Liang & Zeger, 1986)

\[ \sum_{i=1}^{n} \color{#C2410C}{\mathbf{D}_i^\top} \color{#6D28D9}{\mathbf{V}_i^{-1}}(\color{#0B7B6B}{\mathbf{Y}_i - \boldsymbol{\mu}_i}) = \mathbf{0} \]

Y_i−μ_i observed minus fitted D_i derivative of the mean V_i working covariance

\(\mathbf{V}_i\) encodes the working correlation. Sandwich standard errors \(\hat{V}_{\text{sandwich}}\) remain consistent even when \(\mathbf{V}_i\) is mis-specified, as long as \(n\) is large enough.

The key distinction

Conditional versus population-averaged effects

Mixed model (conditional)

Effect for a given individual, holding random effects fixed. Answers: what changes for this subject?

GEE (marginal)

Average effect across the population. Answers: what is the average change across everyone?

For linear models these coincide. For logistic or Poisson regression they diverge. Choose based on the scientific question, not convenience.

Series complete

The complete toolkit

Random slopes let individual trajectories carry the autocorrelation structure.
Transition models handle discrete outcomes by conditioning on the prior value, at the cost of interpretability.
GEE targets population-averaged effects with sandwich standard errors; requires enough subjects for the sandwich variance to be reliable.
Choose mixed models when subject-specific effects matter; choose GEE when the population-average is the question.

Introduction and Overview

Pulling the threads together. An earlier section framed within-subject correlation as something to be modelled directly. This section closes the loop on three remaining concerns. First, trend models with random slopes: rather than (or in addition to) structuring the residuals, we let each subject have their own trajectory over time, a particularly intuitive approach when the question is about individual change. Second, discrete longitudinal outcomes: extending the GLMM machinery from an earlier lesson to repeated binary or count measurements. Third, generalised estimating equations (GEE): the marginal alternative to mixed models, which prioritises population-average effects and is robust to mis-specification of the working correlation. Together with an earlier section, this section gives you a complete repeated-measures toolbox.

Learning Objectives

Fit trend models with random slopes for time and explain how individual trajectory variation induces within-subject autocorrelation.
Choose linear, polynomial, or log-time parameterisations to match the shape of change over time.
Apply transition and GLMM-based approaches to discrete repeated-measures outcomes.
Describe how generalised estimating equations (GEE) target population-averaged effects with a working correlation matrix and a robust sandwich variance.
Decide between mixed models and GEE based on whether the substantive question is conditional or marginal.

Trend Models with Random Slopes

An alternative to modelling the error correlation directly is to include random slopes for time. This allows each subject to have their own rate of change (growth or decline) over time, with the population-average trend captured by the fixed effect of time.

The variation in individual trajectories naturally induces autocorrelation: subjects who start high and decline slowly will have correlated measurements. This can be sufficient to capture the temporal structure in many datasets, especially when the primary interest is in individual trajectories.

The time variable can be parameterized in different ways: linear (for constant rates of change), polynomial (for curved trajectories), or log-transformed (for rapid early change that levels off).

Discrete Repeated Measures Data

Extending mixed models to discrete outcomes (binary, count) with correlation structures is much harder than for continuous outcomes. The fundamental challenge is that in GLMs, the error term and the linear predictor operate on different scales: the link function transforms the relationship, making it difficult to add correlation structures to the error term in a meaningful way.

When to Use GEE vs. Mixed Models

Use GEE when your research question focuses on population-averaged (marginal) effects; for example, “What is the average treatment effect across the population?” Use mixed models when you want subject-specific (conditional) effects or when the random effects themselves are of scientific interest; for example, “How much do individual subjects vary in their response?”

Transition Models

One approach for discrete repeated measures is the transition model, which includes the previous outcome as a predictor. This captures autocorrelation informally through dependence on the prior outcome.

Transition model (Eq 23.5)

\[ \color{#0B7B6B}{\operatorname{logit}(p_{ij})} = \color{#C2410C}{\mathbf{X}\boldsymbol{\beta} + \mathbf{Z}\mathbf{u}} + \color{#6D28D9}{\gamma\, Y_{i,j-1}} \]

The log-odds at the current time depend on the usual fixed and random effects plus a term for the previous outcome, so history feeds directly into the present.

Here, γ is the log odds ratio comparing those with versus without the previous event. A positive γ means that having the event at the previous time point increases the odds of having it at the current time point.

Generalised Estimating Equations (GEE)

GEE is a population-averaged (marginal) approach that does not require specifying random effects (Liang & Zeger, 1986). Instead, it specifies a “working” correlation structure and uses robust (sandwich) standard errors that provide valid inference even if the working correlation is misspecified.

Trend Models

Trend models add random slopes for time, allowing each subject to have their own trajectory. The random slope induces autocorrelation through the variation in individual trajectories. This approach is particularly natural when the scientific question is about individual growth or decline rates.

Key considerations: Choice of time parameterization (linear, polynomial, log), whether to include both random intercepts and slopes, and whether the induced autocorrelation is sufficient or additional error correlation is needed.

Transition Models

Transition models include the previous outcome Y_i,j−1 as a predictor in the model. The coefficient γ represents the log OR for the event given the previous event occurred. This approach is intuitive and can be combined with random effects.

Limitations: Difficult to interpret coefficients for other predictors (they are conditional on the previous outcome), requires careful handling of the first observation (which has no “previous” value), and may not fully capture complex autocorrelation patterns.

Generalised Estimating Equations (GEE)

GEE estimates population-averaged effects using a quasi-likelihood approach. Key features:

Specifies a working correlation (e.g., exchangeable, AR(1), unstructured)
With robust (sandwich) SEs, inference is valid even if the working correlation is wrong
Requires enough clusters/subjects (≥20–30) for reliable sandwich SEs
Cannot estimate cluster-specific (random) effects; gives only PA estimates
Better working correlation = more efficient estimates (but always valid with robust SEs)

Population-AveragedClick to explore

Subject-SpecificClick to explore

Robust (Sandwich) SEsClick to explore

Example: GEE Analysis of Repeated Binary Outcome

A study followed 200 patients over 4 visits, recording whether they experienced a symptom (yes/no) at each visit along with a treatment indicator. A GEE model with exchangeable working correlation and robust SEs estimated the treatment OR as 0.65 (95% CI: 0.48–0.88), suggesting treatment reduced the odds of symptoms by 35% on average across the population. The working correlation was estimated as 0.42.

Feature	GEE	Mixed Models (GLMM)
Estimate type	Population-averaged (PA)	Subject-specific (SS)
Random effects	Not estimated	Estimated
Correlation	Working correlation + robust SEs	Explicit random effects / correlation
Missing data assumption	MCAR	MAR
Minimum clusters	≥20–30	Fewer acceptable
Best for	PA inference	SS inference, variance components

Reflection

Compare the GEE approach and the mixed model approach for analyzing repeated binary outcomes. In what research context would you prefer each approach, and why?

Model answerGEE vs. mixed model for repeated binary outcomes. GEE: produces population-averaged (marginal) estimates; robust to mis-specification of the within-subject correlation structure when robust SEs are used; computationally simpler; preferred when the research question is population-level ("how does the proportion of adherers change with treatment?") and when you want valid SEs despite uncertain correlation structure. Mixed model (GLMM): produces subject-specific (conditional) estimates; explicitly models random effects; preferred when the research question is within-subject ("how does an individual's adherence change over time?") or when you need to predict for individuals. Practical guidance: report both, with explicit interpretation labels; GEE is more robust to missing-at-random patterns (when robust SEs are used), GLMM gives richer information about within-subject variability.

Reflection saved!

* Complete the quiz and reflection to continue.

HSCI 410, Lesson 12

Exploratory Data Analysis For Epidemiology

Repeated Measures Data

Learning objectives for this lesson:

Glossary: Key Terms, People & Concepts

Introduction & Descriptive Approaches

Repeated Measures Data

Where other courses converge

Introduction & Descriptive Approaches

Repeated measures: observations in sequence

Four properties that shape analysis choices

Balanced & uniform

Equidistant & autocorrelated

Dropout and its consequences

Intermittent missingness

Monotone / dropout

See the structure before modelling it

What to take into the next section

Introduction and Overview

Learning Objectives

What Are Repeated Measures?

Why Repeated Measures Require Special Methods

Key Terminology

Missing Data and Drop-Outs

Descriptive Approaches

Reflection

Univariate & Multivariate Approaches

Univariate & Multivariate Approaches

Separate analyses and summary statistics

Separate time points

Summary statistics

Repeated measures analysis of variance

Multivariate analysis of variance

What breaks in each classical method

Introduction and Overview

Learning Objectives

Simple Approaches to Repeated Measures

Separate Time Point Analysis

Summary Statistics Approach

Repeated Measures ANOVA

MANOVA (Multivariate Analysis of Variance)

Covariance and Correlation Matrices

Limitations of Each Approach

R Reflect on what you just ran

Reflection

Linear Mixed Models with Correlation Structure

Linear Mixed Models with Correlation Structure

Beyond random intercepts

Compound symmetry and AR(1)

ARMA(1,1), Toeplitz, and unstructured

ARMA(1,1)

Toeplitz

Unstructured

Random effects plus correlation structures

Redundant combinations

Useful combinations

Selecting the right correlation structure

Introduction and Overview

Learning Objectives

Beyond Random Intercepts

Choosing a Correlation Structure

Key Correlation Structures

Compound Symmetry (Exchangeable)

First-order autoregressive: AR(1)

ARMA(1,1)

Toeplitz (Stationary)

Unstructured

Combining Random Effects with Correlation Structures

Model Selection

Reflection

Trend Models, Discrete Outcomes & GEE

Trend Models, Discrete Outcomes & GEE

Random slopes: each subject with their own trajectory

Transition models for binary repeated measures

Generalised estimating equations: the marginal alternative

Conditional versus population-averaged effects

Mixed model (conditional)

GEE (marginal)

The complete toolkit

Introduction and Overview