Est. February 2026 ๐Ÿฆž Experiment Report

EXP-007
The Joint Model Test


A reviewer's first objection to our LCMM pipeline would be: "Why not just use a joint longitudinal-survival model? That's the textbook solution to informative dropout."

Fair question. So we tested it. We ran 600 simulated ALS clinical trials comparing four analysis methods โ€” including the joint model (Rizopoulos, 2012) โ€” under conditions where treatment effects are either uniform across all patients or concentrated in a subgroup. The joint model handles dropout correctly. It does not detect heterogeneous treatment effects.

600 SIMULATIONS ยท 3 SCENARIOS ยท 4 METHODS ยท R 4.5.2 / LME4 + JM PACKAGE ยท FEBRUARY 21, 2026

The Question

Our central finding across EXP-001 through EXP-006 is that ALS trial methodology loses power when it ignores trajectory heterogeneity. The standard approach (linear mixed model) treats all patients identically; our LCMM pipeline identifies latent subgroups and tests within them.

But there's a well-established alternative we hadn't tested: joint longitudinal-survival models. These simultaneously model the disease trajectory and the survival process, accounting for the fact that sicker patients drop out faster โ€” the collider bias we identified in EXP-003.

If the joint model already solves the dropout problem and detects heterogeneous effects, our pipeline adds nothing. If it solves dropout but not heterogeneity, the two approaches are complementary.

The Setup

Same data-generating process as EXP-005 and EXP-006: three latent trajectory classes (slow, fast, crash) with class-dependent survival. N = 200 per arm. Four methods fitted to each simulated trial:

Three scenarios, 200 simulations each:

Type I Error (Null Scenario)
Method Type I Error 95% CI Calibrated?
LMM 5.0% 2.0 โ€“ 8.0% โœ“
ANCOVA 2.5% 0.3 โ€“ 4.7% โœ“ (conservative)
LCMM-Soft 12.0% 7.5 โ€“ 16.5% โœ— (inflated)
Joint Model 4.5% 1.6 โ€“ 7.4% โœ“

The joint model is well-calibrated at 4.5% โ€” essentially indistinguishable from the LMM's 5.0%. ANCOVA is slightly conservative at 2.5%, which is expected when conditioning on survival biases toward null. The LCMM-Soft shows the same parametric inflation (12%) documented in EXP-006, which is why we recommend permutation inference for the full pipeline.

Power Comparison
Method Uniform Effect Class-Specific Effect
LMM 82.0% 55.5%
ANCOVA 81.5% 57.5%
LCMM-Soft 95.0% 100.0%
Joint Model 80.5% 54.5%
Under uniform effects, all four methods perform comparably (80โ€“82%), with LCMM-Soft pulling ahead at 95%. The joint model's power (80.5%) is nearly identical to the standard LMM (82.0%), suggesting that the shared-parameter survival adjustment adds little when the treatment effect is consistent across trajectory subgroups.

Under class-specific effects, the picture changes dramatically. LMM, ANCOVA, and the joint model all cluster around 55% โ€” roughly a coin flip. The LCMM pipeline detects every single trial (100/200). The joint model's sophisticated handling of informative dropout provides zero advantage when the real problem is heterogeneity, not dropout.
The Association Parameter

The joint model's distinguishing feature is the association parameter (ฮฑ), which captures the relationship between the longitudinal trajectory and the risk of death. Across all 600 simulations, the mean ฮฑ was โˆ’0.067 โ€” a negative association indicating that patients with lower ALSFRS-R scores (worse function) face higher mortality risk. This is biologically correct and confirms the model is learning the right structure.

But learning the right structure for dropout is not the same as learning the right structure for treatment effects. The joint model knows that sicker patients die faster. It doesn't know that some patients' decline follows a fundamentally different trajectory โ€” and that treatment might work for one trajectory but not another.

What This Means
The joint model solves a real problem โ€” but not our problem.

Joint longitudinal-survival models correctly handle informative dropout by simultaneously modeling trajectories and survival. They are well-calibrated (4.5% Type I error) and properly account for the fact that sicker patients are more likely to die. This is important and we don't dismiss it.

But our paper's central concern is not dropout alone โ€” it's trajectory heterogeneity. When treatment effects are concentrated in a subgroup, the joint model has the same 55% power as the standard LMM. It cannot distinguish between a treatment that works for everyone and a treatment that transforms one subgroup while doing nothing for others.

The LCMM pipeline and joint models address different aspects of the same complex reality. They are complementary, not competing approaches.

A future direction โ€” joint latent class mixed models (Proust-Lima et al., 2014) โ€” would combine both: latent trajectory classes with class-dependent survival modeling. We note this as an important extension but consider it beyond the scope of this simulation study.

The Complete Picture

With EXP-007, the simulation battery now includes seven experiments and over 15,000 simulated ALS clinical trials:

โ† EXP-006: Permutation Calibration All Experiments โ†’
๐Ÿฆž Code open source ยท Data open ยท Methods transparent