Modelling Ordinal & Multinomial Data

Exploratory Data Analysis For Epidemiology

Kiffer G. Card, PhD, Faculty of Health Sciences, Simon Fraser University

Learning objectives for this lesson:

Select an appropriate model (multinomial, proportional-odds, adjacent-category, or continuation-ratio) based on study objectives and data
Fit all of the models listed above
Evaluate the assumptions on which each model is based
Interpret OR estimates from each model
Compute predicted probabilities from each model

This course was developed by Kiffer G. Card, PhD, as a companion to Dohoo, I. R., Martin, S. W., & Stryhn, H. (2012). Methods in Epidemiologic Research. VER Inc.

Section 1

Introduction & Overview of Models

⏱ Estimated time: 20 minutes

When Outcomes Have More Than Two Categories

In many epidemiological studies the outcome variable has more than two categories. These outcomes fall into two broad types: nominal data, where the categories have no natural ordering (e.g., type of disease, preferred clinic), and ordinal data, where the categories are ordered (e.g., pain severity: none, mild, moderate, severe).

The choice of model depends on whether the outcome is nominal or ordinal. Nominal data require multinomial logistic regression or log-linear models. Ordinal data can be analysed with the same multinomial model (ignoring the ordering), but more efficient approaches exploit the ordering: proportional-odds, adjacent-category, and continuation-ratio models.

📈

Nominal Data

Click to learn more

📊

Ordinal Data

Click to learn more

💡

Choosing the Right Model

Click to learn more

The Apgar Score Example

Throughout Chapter 17, the authors use Apgar scores as a running example. Apgar scores (measured at birth) are recoded into four ordinal categories. The research question is whether the number of prenatal visits is associated with Apgar score category.

Apgar Category	Code	Prenatal Visits < 6	Prenatal Visits ≥ 6	Total
1–6 (Low)	0	47	25	72
7	1	48	42	90
8	2	59	72	131
9–10 (High)	3	134	227	361
Total		288	366	654

Overview of the Four Models

Each of the four models for multi-category outcomes uses a different formulation of the logit (log-odds). Understanding the logit structure is the key to understanding each model.

Multinomial Logistic Regression (Eq 17.1)

Compares each outcome category to a baseline category. For J categories, the model estimates J−1 sets of coefficients. Each set describes how predictors relate to the log-odds of being in category j versus the baseline.

Equation 17.1

ln[p(Y = j) / p(Y = 1)] = β₀^(j) + β₁^(j)X

No assumptions about ordering are made, so this model is appropriate for both nominal and ordinal outcomes (though it is less efficient for ordinal data).

Proportional-Odds Model (Eq 17.2)

Based on cumulative probabilities. The logit compares the probability of being at or above category j versus below it. A single coefficient per predictor applies at every cutpoint—the proportional-odds assumption.

Equation 17.2

ln[p(Y ≥ j) / p(Y < j)] = β₀^(j) + β₁X

This is the most common ordinal logistic model and is more parsimonious than the multinomial model.

Adjacent-Category Model (Eq 17.3)

Compares each category to the adjacent (next lower) category. This model is a constrained version of the multinomial model where the coefficient for categories n levels apart equals n times the coefficient for adjacent categories.

Equation 17.3

ln[p(Y = j) / p(Y = j−1)] = β₀^(j) + β₁X

Like the proportional-odds model, it estimates a single β₁ per predictor.

Continuation-Ratio Model (Eq 17.4)

Compares each category to all lower categories combined. This model is especially appropriate when the outcome represents sequential stages that must be “passed through” to reach higher levels (e.g., number of attempts to achieve certification).

Equation 17.4

ln[p(Y = j) / p(Y < j)] = β₀^(j) + β₁X

Can be fit as a series of separate binary logistic regressions with appropriately recoded outcome variables.

✎ Reflection

Think of an ordinal outcome variable from your own field of study. What are the categories, and which of the four models introduced here do you think would be most appropriate? Why?

✓ Reflection saved!

● Complete the quiz and reflection to continue.

Section 3

Proportional-Odds Model

⏱ Estimated time: 25 minutes

The Most Common Ordinal Model

The proportional-odds model (also called the cumulative logit model or ordinal logistic regression) is the most widely used model for ordinal outcomes. It is based on the idea of an underlying continuous latent variable that is divided into the observed ordinal categories by a series of cutpoints.

Equation 17.7 — Latent Variable

S_i = β₁X_1i + β₂X_2i + … + β_kX_ki + ε_i

The latent variable S_i is divided by cutpoints (τ₁, τ₂, …, τ_J−1) into J observed categories. If S_i falls between τ_j−1 and τ_j, the observation is classified into category j.

The Proportional-Odds Logit

The model takes the form of a cumulative logit: logit(p(Y ≥ j)) = β_0j + βX. The key feature is that the intercept varies across cutpoints (giving parallel lines on a logit scale) but the slope coefficients are the same for every cutpoint. This means a single OR summarises the effect of each predictor across all levels of the outcome.

Equation 17.9 — Predicted Probability from Latent Variable

p(Y = j) = p(S ≤ τ_j) − p(S ≤ τ_j−1)

📋 Example: Apgar Scores — Proportional-Odds Model

In the Apgar score example, the proportional-odds model yields an OR of 1.59 for prenatal visits (≥6 vs <6). This means that individuals with 6 or more prenatal visits have 1.59 times the odds of being at or above any given Apgar category, compared to those with fewer visits. This single OR applies at every cutpoint (0 vs 1+, 0–1 vs 2+, and 0–2 vs 3).

Testing the Proportional-Odds Assumption

The proportional-odds assumption is that the effect of each predictor is the same at every cutpoint. If violated, the model may give misleading results. Several tests are available:

Testing Proportional Odds

Three main approaches exist:

Approximate LRT: Compare the log-likelihoods of the proportional-odds model and the multinomial model. A significant difference suggests the proportional-odds assumption is violated.
Wolfe-Gould approximate LRT: Based on J−1 separate binary logistic models at each cutpoint. Sum the log-likelihoods and compare to the proportional-odds model.
Brant (Wald) test: Provides both an overall test and individual tests for each predictor, showing which specific variables violate the assumption.

Generalised Ordinal Logistic Regression

If the proportional-odds assumption is violated, a generalised ordinal logistic regression model allows separate coefficients at each cutpoint. This model is equivalent to fitting J−1 separate binary logistic regressions simultaneously. It is more flexible but less parsimonious than the proportional-odds model.

Partial Proportional-Odds Model

A compromise approach is the partial proportional-odds model, which relaxes the proportional-odds assumption for selected predictors only (those that fail the Brant test) while maintaining it for the rest. This provides a good balance between flexibility and parsimony. Other alternatives include the stereotype logistic model and the heterogeneous choice logistic model.

⚠ The Proportional-Odds Assumption in Practice

The proportional-odds assumption is often violated in practice, especially with many predictors or when the outcome categories represent very different phenomena. Always test this assumption before reporting results from a proportional-odds model. If violated, consider a partial proportional-odds model or generalised ordinal logistic regression.

Brant Test Results Example

The Brant test provides both an overall test and predictor-specific tests. Here is an example of how results might be presented:

Predictor	χ²	df	P-value	Assumption Holds?
Prenatal visits	2.14	2	0.343	Yes
Maternal age	8.92	2	0.012	No
Parity	1.03	2	0.598	Yes
Overall	12.45	6	0.053	Borderline

In this example, only maternal age violates the assumption. A partial proportional-odds model that allows maternal age to have different effects at each cutpoint (while constraining prenatal visits and parity) would be appropriate.

Regression Diagnostics

Regression diagnostics for the proportional-odds model can be conducted by fitting binary logistic models at each cutpoint and applying the diagnostic techniques from Chapter 16 (residual analysis, influence measures, goodness-of-fit tests).

✎ Reflection

Why do you think the proportional-odds assumption is so often violated in practice? Can you think of a scenario in your own research where you would expect the effect of a predictor to differ across cutpoints?

✓ Reflection saved!

● Complete the quiz and reflection to continue.

Section 4

Adjacent-Category & Continuation-Ratio Models

⏱ Estimated time: 20 minutes

Adjacent-Category Model

The adjacent-category model compares the probability of being in category j versus category j−1 (the next lower category). It is a constrained version of the multinomial logistic model: the constraint is that the coefficient for categories n levels apart equals n times the coefficient for adjacent categories.

Like the proportional-odds model, the adjacent-category model estimates a single β₁ per predictor, making it more parsimonious than the unconstrained multinomial model. The validity of this constraint can be tested by comparing the adjacent-category model to the unconstrained multinomial model using a likelihood-ratio test (LRT).

📋 Example: Apgar Scores — Adjacent-Category Model

For the Apgar score data, the LRT comparing the adjacent-category model to the unconstrained multinomial model yielded χ² = 6.76, df = 5, P = 0.239. Since this is not significant, the adjacent-category model is a valid simplification of the multinomial model for these data.

Continuation-Ratio Model

The continuation-ratio model compares the probability of being in category j versus all lower categories combined. It is particularly useful when the outcome represents sequential stages that must be “passed through” to reach higher levels.

When Is the Continuation-Ratio Model Appropriate?

The continuation-ratio model is ideal for outcomes where each level must be reached before the next can be attained. Examples include: number of attempts to pass an exam, stages of disease progression where remission must occur before relapse, or sequential rounds of a selection process. It is NOT appropriate when movements between categories are not sequential (e.g., Apgar scores, where a baby does not “pass through” each score level).

Fitting the Continuation-Ratio Model

The continuation-ratio model can be fit as a series of separate binary logistic regressions with a recoded outcome variable. For each comparison:

Y = 1 for the level of interest
Y = 0 for all lower levels
Observations at higher levels are excluded (treated as missing)

Consider an example with 4 categories representing the number of attempts to gain admission to medical school (1, 2, 3, 4+):

Original Category	Y₁ (1 vs 0)	Y₂ (2 vs 0–1)	Y₃ (3 vs 0–2)
0 (1 attempt)	0	0	0
1 (2 attempts)	1	0	0
2 (3 attempts)	—	1	0
3 (4+ attempts)	—	—	1

You can fit either a constrained version (equal ORs across levels, tested by LRT) or an unconstrained version (separate ORs at each level). The constrained version is more parsimonious and can be compared to the unconstrained version using a likelihood-ratio test.

↔

Adjacent-Category Model

Click to learn more

↗

Continuation-Ratio Model

Click to learn more

🔧

Choosing Between Models

Click to learn more

When to Use the Adjacent-Category Model

The adjacent-category model is appropriate when the comparison of interest is between neighbouring categories of an ordinal outcome. It is a natural choice when you believe the effect of a predictor operates by shifting individuals one category at a time. The model can be validated by comparing it to the unconstrained multinomial model via LRT.

When to Use the Continuation-Ratio Model

The continuation-ratio model is most appropriate when the outcome represents sequential stages that must be passed through in order. Each category must be reached before the next can be attained. Examples include: successive attempts at an exam, sequential rounds of treatment, or stages of career advancement. If categories can be reached without passing through lower levels, this model is not appropriate.

Comparing Models with LRT

When one model is a constrained (nested) version of another, the likelihood-ratio test can be used to compare them. The test statistic is −2(lnL_constrained − lnL_{unconstrained}), which follows a χ² distribution with degrees of freedom equal to the difference in the number of parameters. A significant result suggests the constraint is not valid and the more complex model is needed.

Decision Guide: Choosing Among the Four Models

Step 1: Is the outcome nominal or ordinal? If nominal, use multinomial logistic regression.
Step 2: If ordinal, does the outcome represent sequential stages? If yes, consider the continuation-ratio model.
Step 3: If not sequential, fit the proportional-odds model and test the assumption. If it holds, use proportional-odds.
Step 4: If the proportional-odds assumption fails, consider the adjacent-category model, partial proportional-odds, or generalised ordinal logistic regression.
Step 5: Compare nested models using LRT to select the most parsimonious adequate model.

✎ Reflection

Can you think of an example from public health or epidemiology where a continuation-ratio model would be more appropriate than a proportional-odds model? What makes the outcome sequential in your example?

✓ Reflection saved!

● Complete the quiz and reflection to continue.

HSCI 410 — Lesson 5

Exploratory Data Analysis For Epidemiology

Modelling Ordinal & Multinomial Data

Learning objectives for this lesson:

Introduction & Overview of Models

When Outcomes Have More Than Two Categories

The Apgar Score Example

Overview of the Four Models

✔ Check Your Understanding

✎ Reflection

Multinomial Logistic Regression

The Multinomial Logistic Model

Predicted Probabilities

Interpreting Odds Ratios

Predicted Probabilities

Independence of Irrelevant Alternatives (IIA)

Alternative-Specific Data

✔ Check Your Understanding

✎ Reflection

Proportional-Odds Model

The Most Common Ordinal Model

The Proportional-Odds Logit

Testing the Proportional-Odds Assumption

Brant Test Results Example

Regression Diagnostics

✔ Check Your Understanding

✎ Reflection

Adjacent-Category & Continuation-Ratio Models

Adjacent-Category Model

Continuation-Ratio Model

Fitting the Continuation-Ratio Model

✔ Check Your Understanding

✎ Reflection

Lesson 5 — Final Assessment

✎ Final Reflection

✔ Final Assessment

🏆 Congratulations!