Research

Paper

AI LLM March 04, 2026

Understanding Sources of Demographic Predictability in Brain MRI via Disentangling Anatomy and Contrast

Authors

Mehmet Yigit Avci, Akshit Achara, Andrew King, Jorge Cardoso

Abstract

Demographic attributes such as age, sex, and race can be predicted from medical images, raising concerns about bias in clinical AI systems. In brain MRI, this signal may arise from anatomical variation, acquisition-dependent contrast differences, or both, yet these sources remain entangled in conventional analyses. Without disentangling them, mitigation strategies risk failing to address the underlying causes. We propose a controlled framework based on disentangled representation learning, decomposing brain MRI into anatomy-focused representations that suppress acquisition influence and contrast embeddings that capture acquisition-dependent characteristics. Training predictive models for age, sex, and race on full images, anatomical representations, and contrast-only embeddings allows us to quantify the relative contributions of structure and acquisition to the demographic signal. Across three datasets and multiple MRI sequences, we find that demographic predictability is primarily rooted in anatomical variation: anatomy-focused representations largely preserve the performance of models trained on raw images. Contrast-only embeddings retain a weaker but systematic signal that is dataset-specific and does not generalise across sites. These findings suggest that effective mitigation must explicitly account for the distinct anatomical and acquisition-dependent origins of the demographic signal, ensuring that any bias reduction generalizes robustly across domains.

Metadata

arXiv ID: 2603.04113

Provider: ARXIV

Primary Category: cs.CV

Published: 2026-03-04

Fetched: 2026-03-05 06:06

Related papers

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Kaituo Feng, Manyuan Zhang, Shuang Chen, Yunlong Lin, Kaixuan Fan, Yilei Jian... • 2026-03-30

On-the-fly Repulsion in the Contextual Space for Rich Diversity in Diffusion Transformers

Omer Dahary, Benaya Koren, Daniel Garibi, Daniel Cohen-Or • 2026-03-30

Graphilosophy: Graph-Based Digital Humanities Computing with The Four Books

Minh-Thu Do, Quynh-Chau Le-Tran, Duc-Duy Nguyen-Mai, Thien-Trang Nguyen, Khan... • 2026-03-30

ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining

Anuj Diwan, Eunsol Choi, David Harwath • 2026-03-30

RAD-AI: Rethinking Architecture Documentation for AI-Augmented Ecosystems

Oliver Aleksander Larsen, Mahyar T. Moghaddam • 2026-03-30

Raw Data (Debug)

{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2603.04113v1</id>\n    <title>Understanding Sources of Demographic Predictability in Brain MRI via Disentangling Anatomy and Contrast</title>\n    <updated>2026-03-04T14:33:07Z</updated>\n    <link href='https://arxiv.org/abs/2603.04113v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2603.04113v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>Demographic attributes such as age, sex, and race can be predicted from medical images, raising concerns about bias in clinical AI systems. In brain MRI, this signal may arise from anatomical variation, acquisition-dependent contrast differences, or both, yet these sources remain entangled in conventional analyses. Without disentangling them, mitigation strategies risk failing to address the underlying causes. We propose a controlled framework based on disentangled representation learning, decomposing brain MRI into anatomy-focused representations that suppress acquisition influence and contrast embeddings that capture acquisition-dependent characteristics. Training predictive models for age, sex, and race on full images, anatomical representations, and contrast-only embeddings allows us to quantify the relative contributions of structure and acquisition to the demographic signal. Across three datasets and multiple MRI sequences, we find that demographic predictability is primarily rooted in anatomical variation: anatomy-focused representations largely preserve the performance of models trained on raw images. Contrast-only embeddings retain a weaker but systematic signal that is dataset-specific and does not generalise across sites. These findings suggest that effective mitigation must explicitly account for the distinct anatomical and acquisition-dependent origins of the demographic signal, ensuring that any bias reduction generalizes robustly across domains.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.CV'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.AI'/>\n    <published>2026-03-04T14:33:07Z</published>\n    <arxiv:primary_category term='cs.CV'/>\n    <author>\n      <name>Mehmet Yigit Avci</name>\n      <arxiv:affiliation>and for the Alzheimer's Disease Neuroimaging Initiative</arxiv:affiliation>\n    </author>\n    <author>\n      <name>Akshit Achara</name>\n      <arxiv:affiliation>and for the Alzheimer's Disease Neuroimaging Initiative</arxiv:affiliation>\n    </author>\n    <author>\n      <name>Andrew King</name>\n      <arxiv:affiliation>and for the Alzheimer's Disease Neuroimaging Initiative</arxiv:affiliation>\n    </author>\n    <author>\n      <name>Jorge Cardoso</name>\n      <arxiv:affiliation>and for the Alzheimer's Disease Neuroimaging Initiative</arxiv:affiliation>\n    </author>\n  </entry>"
}