Paper
Simulation-based inference from the Lyman-alpha forest 1D power spectrum with CAMELS
Authors
Francesco Sinigaglia, Patricia Iglesias-Navarro, Matteo Viel
Abstract
We perform for the first time full simulation-based inference on the Lyman-$α$ forest 1D power spectrum. In particular, we consider the prediction of the Lyman-$α$ forest $P_{\rm 1D}(k)$ at $2.0<z<3.5$ from the CAMELS cosmological hydrodynamic simulations run with the IllustrisTNG and SIMBA galaxy formation models. We train a normalizing flow to perform neural posterior estimation of two cosmological parameters ($Ω_m$ and $σ_8$) and four astrophysical parameters parametrizing supernova and AGN feedback. When training and testing the neural network on the same baryon physics model, the posterior distributions of the cosmological parameters are found to be in excellent agreement with the true parameters values (within $10\%$ deviations in $\gtrsim 75\%$ and $\gtrsim 90\%$ of the cases for $Ω_m$ and $σ_8$, and a precision better than $10\%$ in both), while the astrophysical parameters are generally unconstrained due to the limited probed volume. When training on one model and testing on the other (e.g., training on IllustrisTNG and testing on SIMBA, or viceversa), the performance is significantly worse, both in accuracy and in precision, resulting in a $\sim 10\%$ positive bias on the predicted values for $σ_8$. We show that a multi-domain training based on the combination of simulations from both models recovers unbiased constraints, offering an effective solution to cope with the complex problem of the lack of convergence in the predictions from different galaxy formation models. This study represents a promising way forward to constrain cosmology and fundamental physics with the Lyman-$α$ forest with artificial intelligence.
Metadata
Related papers
Fractal universe and quantum gravity made simple
Fabio Briscese, Gianluca Calcagni • 2026-03-25
POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan
Marta Moscati, Muhammad Saad Saeed, Marina Zanoni, Mubashir Noman, Rohan Kuma... • 2026-03-25
LensWalk: Agentic Video Understanding by Planning How You See in Videos
Keliang Li, Yansong Li, Hongze Shen, Mengdi Liu, Hong Chang, Shiguang Shan • 2026-03-25
Orientation Reconstruction of Proteins using Coulomb Explosions
Tomas André, Alfredo Bellisario, Nicusor Timneanu, Carl Caleman • 2026-03-25
The role of spatial context and multitask learning in the detection of organic and conventional farming systems based on Sentinel-2 time series
Jan Hemmerling, Marcel Schwieder, Philippe Rufin, Leon-Friedrich Thomas, Mire... • 2026-03-25
Raw Data (Debug)
{
"raw_xml": "<entry>\n <id>http://arxiv.org/abs/2603.13011v1</id>\n <title>Simulation-based inference from the Lyman-alpha forest 1D power spectrum with CAMELS</title>\n <updated>2026-03-13T14:17:40Z</updated>\n <link href='https://arxiv.org/abs/2603.13011v1' rel='alternate' type='text/html'/>\n <link href='https://arxiv.org/pdf/2603.13011v1' rel='related' title='pdf' type='application/pdf'/>\n <summary>We perform for the first time full simulation-based inference on the Lyman-$α$ forest 1D power spectrum. In particular, we consider the prediction of the Lyman-$α$ forest $P_{\\rm 1D}(k)$ at $2.0<z<3.5$ from the CAMELS cosmological hydrodynamic simulations run with the IllustrisTNG and SIMBA galaxy formation models. We train a normalizing flow to perform neural posterior estimation of two cosmological parameters ($Ω_m$ and $σ_8$) and four astrophysical parameters parametrizing supernova and AGN feedback. When training and testing the neural network on the same baryon physics model, the posterior distributions of the cosmological parameters are found to be in excellent agreement with the true parameters values (within $10\\%$ deviations in $\\gtrsim 75\\%$ and $\\gtrsim 90\\%$ of the cases for $Ω_m$ and $σ_8$, and a precision better than $10\\%$ in both), while the astrophysical parameters are generally unconstrained due to the limited probed volume. When training on one model and testing on the other (e.g., training on IllustrisTNG and testing on SIMBA, or viceversa), the performance is significantly worse, both in accuracy and in precision, resulting in a $\\sim 10\\%$ positive bias on the predicted values for $σ_8$. We show that a multi-domain training based on the combination of simulations from both models recovers unbiased constraints, offering an effective solution to cope with the complex problem of the lack of convergence in the predictions from different galaxy formation models. This study represents a promising way forward to constrain cosmology and fundamental physics with the Lyman-$α$ forest with artificial intelligence.</summary>\n <category scheme='http://arxiv.org/schemas/atom' term='astro-ph.CO'/>\n <published>2026-03-13T14:17:40Z</published>\n <arxiv:comment>40 pages, 22 figures. Submitted to JCAP. Comments welcome</arxiv:comment>\n <arxiv:primary_category term='astro-ph.CO'/>\n <author>\n <name>Francesco Sinigaglia</name>\n </author>\n <author>\n <name>Patricia Iglesias-Navarro</name>\n </author>\n <author>\n <name>Matteo Viel</name>\n </author>\n </entry>"
}