A Copula Based Framework for Modeling Dependence and Growth in Educational Assessment
National Center for the Improvement of Educational Assessment
NCIEA Board Meeting · April 2026 · Boston, Massachusetts
This manuscript represents the collaborative intellectual work of the authors with substantive and continuous assistance from AI-based systems (OpenAI GPT, Anthropic Claude), integrated natively into all phases of the research.
The use of AI in this paper is not incidental or peripheral — it is essential, embedded, and reflective of a methodological shift toward AI-native scholarship. This statement is offered in the spirit of methodological clarity and epistemic honesty.
Theoretical development: The foundational idea to extend TAMP using copulas and Sklar’s Theorem emerged in human–AI collaboration.
Mathematical formalization: AI supported the expression of key relationships — the Fréchet–Hoeffding bounds reinterpreting TAMP as a degenerate copula, and the clarification that SGPc is a scaled evaluation of the transition kernel associated with the copula (the conditional CDF that carries prior-year ranks to current-year ranks)
Computational modules: AI contributed to the design of R code for copula estimation, simulation, and visualization — forming part of the SGPc R package
Authorship & responsibility: Authors retain full responsibility for all claims (ICMJE/AERA norms). AI is not listed as author but its instrumental role is explicitly acknowledged
In order to perform longitudinal data analysis, one needs longitudinal data.
Under certain circumstances — precisely those that hold for many large-scale assessments in education — cross-sectional data can support credible growth inference.
What those circumstances look like:
If we can model how ranks move together, we can reconstruct a valid joint distribution — and therefore growth — from cross-sectional data alone. Copulas are the machinery that makes this possible.
The central claim of this work:
Group-level growth inference does not require student-level longitudinal data.
On first impression, this seems impossible. The intuition that longitudinal questions require longitudinal data is deeply ingrained. Support of our claim requires we make two complementary arguments:
A theoretical argument. Sklar’s theorem and the copula decomposition establish that group-level growth is mathematically recoverable from marginals combined with an appropriate dependence structure — the claim is at least possible.
An empirical argument. Across 966 longitudinal conditions, four datasets, and four content areas, the dependence structure in educational data turns out to be remarkably stable, well-approximated by a small family of copulas, and yields growth inferences that are accurate enough to trust — the claim is also practical.
Neither argument stands alone. Theory without evidence is speculation; evidence without theory is coincidence. Together, they make a counter-intuitive claim defensible — which is what the remainder of this presentation will demonstrate.
Extends Student Growth Percentiles (SGP). SGP is an NCIEA-developed methodology (Betebenner, 2009) now used in dozens of state accountability systems. This copula based framework retains longitudinal SGP when linkage exists while extending SGPc-style growth inference to settings where student-level linkage does not exist.
Unlocks growth inference for NAEP, TIMSS, PISA, and cross-state comparisons. These are programs NCIEA advises on where longitudinal linkage is often impossible by design; the copula framework opens population- and subpopulation-level growth questions that were previously out of reach.
Strengthens the measurement foundation for fair score use. Copulas provide an ordinally defensible basis for growth comparisons, aligning with NCIEA’s long-standing commitment to valid, equitable, and defensible assessment interpretation.
Improves practical monitoring and decision support through a Statistical Process Control (SPC) lens. Combining SGPc with SPC frameworks for process monitoring and signal detection helps distinguish real process change from other forms of change, such as scale drift — supporting clearer, more actionable signals for policy and practice.
A single methodological shift — separating dependence from measurement — strengthens SGP practice, expands to support NAEP-class growth inference, reinforces measurement validity, and sharpens SPC-style monitoring signals.
The Methodology
How Does It Work?
A brief look under the hood.
With longitudinal data, we observe linked pairs — the same student measured at two time points.
The joint density encodes everything — the marginals and the dependence between them.
The joint density can be decomposed into three mathematical pieces: the marginal density (left), the marginal density (right), and the bivariate scatter encoding dependence between them (center).
With cross-sectional data like TIMSS or NAEP, the marginals and are observed — but the joint density connecting them is unknown. The center panel is missing. We need a mathematical bridge to reconstruct the dependence.
The copula is that bridge. It models the dependence structure independently of the marginals, allowing us to reconstruct a valid joint distribution from cross-sectional data alone.
Sklar’s Theorem (1959)
Any joint distribution with continuous marginals can be decomposed uniquely as:
where is a copula — a function that couples the marginals into a valid joint distribution.
Sklar’s decomposition turns the “missing data” problem into a “missing dependence” problem — and copulas provide a principled way to fill that gap.
Copulas provide a flexible toolkit for modeling dependence that goes far beyond correlation:
The copula is not just a decomposition tool — it is a modular design choice. Swap in a different copula, and you get a different joint distribution with the same marginals.
Braun (1988) introduced Trajectory Analysis of Matched Percentiles (TAMP) to investigate how growth might be measured on assessments like NAEP where no longitudinal student data exist.
AI Breakthrough: TAMP = Comonotonic Copula
TAMP implicitly assumes perfect rank preservation — a student at the -th percentile at time 1 is assumed to remain at the -th percentile at time 2. The equipercentile rule is an identity mapping in percentile space.
Once you see TAMP as a copula, you see how to generalize it. Any copula supports a pseudo-longitudinal model. The comonotonic case is just one — the most extreme/degenerate — choice.
Copula CDF surface. The t-copula is the joint CDF of . The green slice fixes . The tangent slope in the -direction at equals
Conditional CDF. For fixed , the partial derivative is a one-dimensional CDF in . At the height , so
the 27th conditional percentile.
Conditional PDF. Differentiating once more in yields the conditional density . The shaded area satisfies
matching the conditional CDF value and .
The copula encodes the Student Growth Percentile of any pair of pseudo-observations as a tangent-line slope to the copula surface.
Two cases arise in practice:
Longitudinal data exist. Fit an empirical copula directly from paired observations.
Only cross-sectional data. Can we substitute an “off-the-shelf” parametric copula and still recover meaningful growth inferences?
The probability-integral transform strips all metric structure, mapping every score distribution onto . In this uniform space, educational data exhibit remarkably similar dependence structures regardless of content area, grade span, or subgroup.
Our bet: Once the PIT removes metric structure from the scales, a simple parametric copula is a good-enough proxy for the unknown empirical copula — especially for group-level summaries with .
966 longitudinal conditions across 4 datasets, 4 content areas, and year spans 1–4. Six copula families fitted to each; model selected by AIC with bootstrap goodness-of-fit.
| Family | Best (AIC) | Share |
|---|---|---|
| t | 614 | 63.6% |
| frank | 297 | 30.7% |
| gumbel | 35 | 3.6% |
| gaussian | 20 | 2.1% |
| comonotonic | 0 | 0.0% |
The -copula dominates every year span (58–65%) and content area. It beats the Gaussian in 98% of head-to-head comparisons (mean AIC = 169–241).
A single -copula family covers >94% of observed educational dependence structures — the canonical copula is not an arbitrary choice but an empirically justified default.
Observe unlinked marginals. We have the prior-year PDF and current-year PDF — but no student-level pairs . The PIT yields marginal pseudo-observations and whose joint dependence is unknown.
Estimate the growth regime. Search the Beta family for the regime that minimises the Wasserstein distance between the observed and the predicted current-year CDF induced by the canonical copula:
Verify the fit. The forward CDF check compares the observed current-year CDF (solid) against the CDF predicted by the estimated regime passed through the canonical copula kernel (dashed). Close agreement confirms the regime recovers the marginal.
From marginals to growth: the canonical copula provides the dependence kernel; the minimum-distance regime recovers mean SGPc — all without student-level linkage.
239 subgroups across multiple conditions, datasets, content areas, year spans, and pool types.
The canonical copula approach recovers subgroup growth with median absolute error under 0.5 percentile points across the full validation battery.
At the individual student level (37.4M observations):
What do we lose without student-level linkage?
For subgroups of (typical NAEP/TIMSS state samples). All values in SGPc units (0–100):
| Metric | Paired (linked) | Independent (unlinked) | Ratio |
|---|---|---|---|
| SE (mean SGPc) | 0.60 | 1.48 | 2.5 |
| CI width (mean) | 2.16 | 5.65 | 2.6 |
| SE (median SGPc) | 0.79 | 1.96 | 2.5 |
| CI width (median) | 2.91 | 7.48 | 2.6 |
Linkage buys a precision gain. But even the unlinked mean SGPc CI width ( pts) is actionable for policy comparisons at scale. The price of cross-sectionality is quantifiable and manageable.
Copula parameter stability:
The copula is recommended across all year spans:
| Year span | (median) | (median) | (median) |
|---|---|---|---|
| 1 | 0.638 | 0.842 | 25.6 |
| 2 | 0.601 | 0.810 | 28.7 |
| 3 | 0.574 | 0.784 | 28.2 |
| 4 | 0.553 | 0.764 | 27.4 |
Dependence attenuates with time span as expected, but the structure remains remarkably stable.
Cross-sectional data are sufficient for group-level growth inference. The canonical -copula supplies the dependence kernel; the minimum-distance regime recovers mean SGPc from marginals alone — no student-level linkage required.
The canonical copula is empirically grounded. Across 966 conditions, 4 datasets, and 4 content areas, the -copula is selected in 64% of cases, and together with the Frank covers >94%. Dependence parameters attenuate smoothly with year span (: 0.64 0.55).
Accuracy is high; the precision cost is quantifiable. Inferred vs true mean SGPc: , MAE pts. Without linkage, confidence intervals widen — a manageable price for programs (NAEP, TIMSS, PISA) where paired data simply do not exist.
Growth measurement no longer requires longitudinal data — it requires marginals, a canonical copula, and a minimum-distance fit.
In the next few weeks.
Over the coming year.
These steps walk the Four NCIEA Connections in sequence: extending SGP (1), unlocking growth inference for NAEP- and TIMSS-class assessments (2), exercising the ordinally defensible measurement foundation (3), and opening the door to SPC-style monitoring of tail behavior (4). The work sits squarely within the Center’s mission — both within its near- and long-term agendas.
Longitudinal Inference Without Longitudinal Data
A Copula Based Framework for Modeling Dependence and Growth in Educational Assessment
Damian W. Betebenner
National Center for the Improvement of Educational Assessment
dbetebenner@nciea.org
Thank you
Appendix
Longitudinal Inference Without Longitudinal Data
Technical Background & Supplemental Material
The development is distribution-function first — CDFs are the primitive objects, not densities.
Key notation:
| Symbol | Meaning |
|---|---|
| CDF of | |
| Density of | |
| Quantile function | |
| Copula | |
| Pseudo-observations on | |
| Latent conditional percentile | |
| Subgroup growth regime CDF |
Why CDF-first?
If a score is defensible only up to strictly increasing transformations (ordinal meaning), then:
The copula is the maximal invariant of dependence available under that measurement regime.
Two joint distributions are related by coordinatewise strictly increasing transformations if and only if they share the same copula.
This places the classical “permissible statistics” debate (Stevens, Luce & Suppes) into direct contact with copula theory:
Copulas are not merely compatible with rank-based analysis — they are the dependence-theoretic core of it.
— Schweizer & Wolff (1981)
For educational measurement:
Omitted variable bias (OVB) is the core problem of causal inference in non-experimental research. Standard remedies — covariate adjustment, instrumental variables — require external information that is often unavailable or unreliable.
Park & Gupta (2012) introduced a copula correction that requires neither covariates nor instruments:
The key identification assumption is mild: must be non-normally distributed — a condition routinely satisfied by educational assessment data.
The copula decomposition that separates marginals from dependence is the same machinery used here for growth inference — now applied to the endogeneity problem.
Why this matters for us:
The copula is not just a tool for modeling dependence — it is a tool for protecting causal claims from unobserved confounding.
SPC and copula based growth are complements, not competitors:
A regime departure can arise through:
If scale score diagnostics move while copula/SGPc diagnostics remain stable, the alarm is living in the units, not the order.
If the copula itself moves, the problem is no longer mere relabeling of the scale.
The copula–SPC lens:
Betebenner, D. W. (2009). Norm- and criterion-referenced student growth. Educational Measurement: Issues and Practice, 28(4), 42–51.
Braun, H. I. (1988). A new approach to avoiding problems of scale in interpreting trends in mental measurement data. Journal of Educational Measurement, 25(3), 171–191.
Falkenström, F., Park, S., & McIntosh, C. N. (2022). Using copulas to enable causal inference from non-experimental data: Tutorial and simulation studies. Psychological Methods, 27(6), 1009–1039.
Nelsen, R. B. (2006). An Introduction to Copulas (2nd ed.). Springer.
Park, S., & Gupta, S. (2012). Handling endogenous regressors by joint estimation using copulas. Marketing Science, 31(4), 567–586.
Schweizer, B., & Wolff, E. F. (1981). On nonparametric measures of dependence for random variables. Annals of Statistics, 9(4), 879–885.
Sklar, A. (1959). Fonctions de répartition à dimensions et leurs marges. Publications de l’Institut de Statistique de l’Université de Paris, 8, 229–231.