Paper
A Generalist Model Including Evolved Star Mass and Age
Authors
Mengmeng Zhang, Yude Bu, Siqi Wang, Shanshan Li, Jiangchuan Zhang, Jingzhen Sun, Yuhang Zhang, Ke Wang, Jian Liu, Hongliang Yan, Zhenping Yi, Meng Liu, Xiaoming Kong
Abstract
Determining precise stellar ages and masses for evolved giants is crucial for Galactic archaeology but challenged by spectral degeneracies. Gaia's low-resolution XP spectra offer a unique opportunity to infer these parameters on a massive scale using data-driven methods. We extend a transformer-based astronomical foundation model to evolved stars, establishing a unified framework to simultaneously predict atmospheric parameters ($T_{\mathrm{eff}}$, $\log g$, $[\mathrm{M}/\mathrm{H}]$) and evolutionary labels (mass, age) with physical consistency. Treating spectra as token sequences, we integrated mass and age into the model's vocabulary. The model is trained on Gaia XP spectra cross-matched with the APOGEE DR17 DistMass catalog. Our generative approach enables flexible input handling, including spectral inpainting and parameter-to-spectrum generation. On an independent test set, the model achieves a prediction scatter of $σ\approx 0.114 \, M_{\odot}$ for mass and $σ\approx 1.334$ Gyr for age. Beyond numerical accuracy, it successfully reproduces the giant branch's mass-luminosity relation and autonomously disentangles interstellar extinction from intrinsic temperature variations without explicit physical priors. It also robustly recovers missing spectral data and estimates reliable uncertainties. Validating that foundation models can internalize stellar physics from data, this physically-aware, probabilistic framework offers a powerful tool for unraveling Milky Way history using large-scale spectroscopic surveys.
Metadata
Related papers
Fractal universe and quantum gravity made simple
Fabio Briscese, Gianluca Calcagni • 2026-03-25
POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan
Marta Moscati, Muhammad Saad Saeed, Marina Zanoni, Mubashir Noman, Rohan Kuma... • 2026-03-25
LensWalk: Agentic Video Understanding by Planning How You See in Videos
Keliang Li, Yansong Li, Hongze Shen, Mengdi Liu, Hong Chang, Shiguang Shan • 2026-03-25
Orientation Reconstruction of Proteins using Coulomb Explosions
Tomas André, Alfredo Bellisario, Nicusor Timneanu, Carl Caleman • 2026-03-25
The role of spatial context and multitask learning in the detection of organic and conventional farming systems based on Sentinel-2 time series
Jan Hemmerling, Marcel Schwieder, Philippe Rufin, Leon-Friedrich Thomas, Mire... • 2026-03-25
Raw Data (Debug)
{
"raw_xml": "<entry>\n <id>http://arxiv.org/abs/2603.03732v1</id>\n <title>A Generalist Model Including Evolved Star Mass and Age</title>\n <updated>2026-03-04T05:04:42Z</updated>\n <link href='https://arxiv.org/abs/2603.03732v1' rel='alternate' type='text/html'/>\n <link href='https://arxiv.org/pdf/2603.03732v1' rel='related' title='pdf' type='application/pdf'/>\n <summary>Determining precise stellar ages and masses for evolved giants is crucial for Galactic archaeology but challenged by spectral degeneracies. Gaia's low-resolution XP spectra offer a unique opportunity to infer these parameters on a massive scale using data-driven methods. We extend a transformer-based astronomical foundation model to evolved stars, establishing a unified framework to simultaneously predict atmospheric parameters ($T_{\\mathrm{eff}}$, $\\log g$, $[\\mathrm{M}/\\mathrm{H}]$) and evolutionary labels (mass, age) with physical consistency. Treating spectra as token sequences, we integrated mass and age into the model's vocabulary. The model is trained on Gaia XP spectra cross-matched with the APOGEE DR17 DistMass catalog. Our generative approach enables flexible input handling, including spectral inpainting and parameter-to-spectrum generation. On an independent test set, the model achieves a prediction scatter of $σ\\approx 0.114 \\, M_{\\odot}$ for mass and $σ\\approx 1.334$ Gyr for age. Beyond numerical accuracy, it successfully reproduces the giant branch's mass-luminosity relation and autonomously disentangles interstellar extinction from intrinsic temperature variations without explicit physical priors. It also robustly recovers missing spectral data and estimates reliable uncertainties. Validating that foundation models can internalize stellar physics from data, this physically-aware, probabilistic framework offers a powerful tool for unraveling Milky Way history using large-scale spectroscopic surveys.</summary>\n <category scheme='http://arxiv.org/schemas/atom' term='astro-ph.SR'/>\n <category scheme='http://arxiv.org/schemas/atom' term='astro-ph.GA'/>\n <category scheme='http://arxiv.org/schemas/atom' term='astro-ph.IM'/>\n <published>2026-03-04T05:04:42Z</published>\n <arxiv:primary_category term='astro-ph.SR'/>\n <author>\n <name>Mengmeng Zhang</name>\n </author>\n <author>\n <name>Yude Bu</name>\n </author>\n <author>\n <name>Siqi Wang</name>\n </author>\n <author>\n <name>Shanshan Li</name>\n </author>\n <author>\n <name>Jiangchuan Zhang</name>\n </author>\n <author>\n <name>Jingzhen Sun</name>\n </author>\n <author>\n <name>Yuhang Zhang</name>\n </author>\n <author>\n <name>Ke Wang</name>\n </author>\n <author>\n <name>Jian Liu</name>\n </author>\n <author>\n <name>Hongliang Yan</name>\n </author>\n <author>\n <name>Zhenping Yi</name>\n </author>\n <author>\n <name>Meng Liu</name>\n </author>\n <author>\n <name>Xiaoming Kong</name>\n </author>\n </entry>"
}