>

Monitoring morphometric drift in lifelong learning segmentation of the spinal cord

Enamundram Naga Karthik; Sandrine Bédard; Jan Valošek; Christoph S. Aigner; Elise Bannier; Josef Bednařík; Virginie Callot; Anna Combes; Armin Curt; Gergely David; Falk Eippert; Lynn Farner; Michael G. Fehlings; Patrick Freund; Tobias Granberg; Cristina Granziera; Ulrike Horn; Tomáš Horák; Suzanne Humphreys; Markus Hupp; Anne Kerbrat; Nawal Kinany; Shannon Kolind; Petr Kudlička; Anna Lebret; Lisa Eunyoung Lee; Caterina Mainero; Allan R. Martin; Megan McGrath; Govind Nair; Kristin P. O’Grady; Jiwon Oh; Russell Ouellette; Nikolai Pfender; Dario Pfyffer; Pierre-François Pradat; Alexandre Prat; Emanuele Pravatà; Daniel S. Reich; Ilaria Ricchi; Naama Rotem-Kohavi; Simon Schading-Sassenhausen; Maryam Seif; Andrew Smith; Seth A. Smith; Grace Sweeney; Roger Tam; Anthony Traboulsee; Constantina Andrada Treaba; Charidimos Tsagkas; Zachary Vavasour; Dimitri Van De Ville; Kenneth Arnold Weber II; Sarath Chandar; Julien Cohen-Adad (2026)..Imaging Neuroscience, 4, Article a.1105.

This study looks at how measurements of the spinal cord—such as its cross-sectional area (the size of the cord when viewed in a slice)—can be used as important indicators (biomarkers) for diagnosing and tracking neurological diseases like multiple sclerosis or spinal cord compression. Modern artificial intelligence methods can automatically identify and outline (segment) the spinal cord in MRI scans, but it is unclear whether these measurements stay consistent as models are updated with new data over time. This consistency is especially important when building “normal” reference values from healthy individuals.

To address this, the researchers developed a spinal cord segmentation model trained on a large and diverse dataset collected from 75 sites and over 1,600 participants, covering different MRI types and various spinal cord conditions. They also created a “lifelong learning” system that continuously monitors changes in measurements (called morphometric drift) whenever the model is updated. This system automatically runs through a workflow (via GitHub Actions, an automated coding tool) to track how measurements evolve over time.

The results showed that the new model performs very well, accurately identifying the spinal cord even in challenging cases such as severe compression or tissue damage, with a high Dice score (a measure of how closely the model’s segmentation matches the true anatomy) of 0.95. The monitoring system also proved useful for quickly detecting any changes in measurements between model versions. Importantly, the study found that updates to the model caused only minimal shifts in spinal cord measurements, meaning the results remain stable and reliable. This allowed the researchers to safely update an existing database of normal spinal cord measurements. Overall, this work provides a reliable and transparent way to maintain consistency in AI-based medical measurements as models evolve.

Fig 1

Overview of the dataset and image characteristics. Representative axial slices of nine contrasts and the total of images used for each contrast in brackets, the orientation (axial/sagittal) along with the median resolution of images. The respective doughnut chart illustrates the proportion of clinical status among the scanned participants, including healthy controls (HC), patients with radiologically isolated syndrome (RIS), patients with multiple sclerosis (MS), and their different phenotypes, including primary progressive (PPMS) and relapsing-remitting (RRMS), patients with amyotrophic lateral sclerosis (ALS), patients with neuromyelitis optica spectrum disorder (NMOSD), pre-decompression acute traumatic SCI (AcuteSCI), post-decompression traumatic spinal cord injury (SCI), degenerative cervical myelopathy (DCM), and syringomyelia (SYR; not shown). Labels indicate the phenotype associated with the patient, with their respective colors shared across contrast sets.

Explore Story Topics