Papers
Research papers from arXiv and related sources
PDE foundation model-accelerated inverse estimation of system parameters in inertial confinement fusion
PDE foundation models are typically pretrained on large, diverse corpora of PDE datasets and can be adapted to new settings with limited task-specific data. However, most downstream evaluations foc...
Mahindra Rautela, Alexander Scheinker, Bradley Love, Diane Oyen, Nathan DeBardeleben, Earl Lawren...
Vibe Code Bench: Evaluating AI Models on End-to-End Web Application Development
Code generation has emerged as one of AI's highest-impact use cases, yet existing benchmarks measure isolated tasks rather than the complete "zero-to-one" process of building a working application ...
Hung Tran, Langston Nashold, Rayan Krishnan, Antoine Bigeard, Alex Gu
PinPoint: Evaluation of Composed Image Retrieval with Explicit Negatives, Multi-Image Queries, and Paraphrase Testing
Composed Image Retrieval (CIR) has made significant progress, yet current benchmarks are limited to single ground-truth answers and lack the annotations needed to evaluate false positive avoidance,...
Rohan Mahadev, Joyce Yuan, Patrick Poirson, David Xue, Hao-Yu Wu, Dmitry Kislyuk
PulSKASim: A Pulsar Simulator for SKA-Scale Interferometric Observations
Accurate simulation of pulsar flux variability is critical for testing Square Kilometre Array (SKA) interferometric pipelines. However, most existing simulators neglect the effects of integration t...
X. Li, V. Stolyarov
Industrial Survey on Robustness Testing In Cyber Physical Systems
Cyber-Physical Systems (CPS) play a critical role in modern industrial domains, including manufacturing, energy, transportation, and healthcare, where they enable automation, optimization, and real...
Christophe Ponsard, Abiola Paterne Chokki, Jean-François Daune
Token Taxes: mitigating AGI's economic risks
The development of AGI threatens to erode government tax bases, lower living standards, and disempower citizens -- risks that make the 40-year stagnation of wages during the first industrial revolu...
Lucas Irwin, Tung-Yu Wu, Fazl Barez
EBLM XVII - Tidal Synchronization and Circularization in Tight Stellar Binaries
Tidal interactions in close stellar binaries are central to their orbital and rotational evolution, making observational tests of theoretical predictions essential for our understanding of the evol...
Ritika Sethi, David V. Martin, Adrian Barker, Pierre F. L. Maxted, Amaury H. M. J. Triaud, Vedad ...
Discovering mathematical concepts through a multi-agent system
Mathematical concepts emerge through an interplay of processes, including experimentation, efforts at proof, and counterexamples. In this paper, we present a new multi-agent model for computational...
Daattavya Aggarwal, Oisin Kim, Carl Henrik Ek, Challenger Mishra
Coexistence of Chromatic Flares and an Achromatic QPO in the Gamma-ray Blazar PG 1553+113
The physical origin of quasi-periodic oscillations (QPOs) in blazars remains debated, with geometric and plasma-driven scenarios as the main competing interpretations. Discriminating between them r...
Elena Madero, Alberto Domínguez
Towards Predictive Quantum Algorithmic Performance: Modeling Time-Correlated Noise at Scale
Combining tensor network techniques with quantum autoregressive moving average models, we quantify the effects of time-correlated noise on quantum algorithms and predict their performance at scale....
Amit Jamadagni, Gregory Quiroz, Eugene Dumitrescu
MXDFz4.4: A LyC emitter 250Myr after the epoch of reionization and a first test of Ly-alpha morphology as a tracer of LyC escape at high redshift
Assessing the contribution of ionizing sources to cosmic reionization is a central goal of extragalactic astrophysics. Understanding and quantifying ionizing escape remains challenging near the epo...
Ilias Goovaerts, Marc Rafelski, Alexander Beckett, Grecco Oyarzùn, Annalisa Citro, Farhanul Hasan...
The erasure of Galactic bar resonances by dark matter subhaloes
In the context of increasing appreciation for the coupling between the Galactic bar and the halo, we introduce a new framework using stars trapped in resonance with the bar to probe the Galactic da...
Elliot Y. Davies, Adam M. Dillamore, Vasily Belokurov, Lina Necib
NASA's Pandora SmallSat Mission: Simulated Modeling and Retrieval of Near-Infrared Exoplanet Transmission Spectra
Pandora is a SmallSat mission dedicated to understanding exoplanets and their host stars by disentangling the impact of stellar heterogeneity on exoplanet transmission spectra. Selected as a NASA A...
Yoav Rotman, Peter McGill, Luis Welbanks, Benjamin V. Rackham, Aishwarya Iyer, Daniel Apai, Micha...
HyQBench: A Benchmark Suite for Hybrid CV-DV Quantum Computing
Hybrid continuous-variable (CV)-discrete-variable (DV) quantum systems present a promising direction for quantum computing by combining the high dimensional encoding capabilities of qumodes with th...
Shubdeep Mohapatra, Yuan Liu, Eddy Z. Zhang, Huiyang Zhou
CLARC: C/C++ Benchmark for Robust Code Search
Efficient code retrieval is critical for developer productivity, yet existing benchmarks largely focus on Python and rarely stress-test robustness beyond superficial lexical cues. To address the ga...
Kaicheng Wang, Liyan Huang, Weike Fang, Weihang Wang
SELDON: Supernova Explosions Learned by Deep ODE Networks
The discovery rate of optical transients will explode to 10 million public alerts per night once the Vera C. Rubin Observatory's Legacy Survey of Space and Time comes online, overwhelming the tradi...
Jiezhong Wu, Jack O'Brien, Jennifer Li, M. S. Krafczyk, Ved G. Shah, Amanda R. Wasserman, Daniel ...
A Dual-Helix Governance Approach Towards Reliable Agentic AI for WebGIS Development
WebGIS development requires rigor, yet agentic AI frequently fails due to five large language model (LLM) limitations: context constraints, cross-session forgetting, stochasticity, instruction fail...
Boyuan, Guan, Wencong Cui, Levente Juhasz
ZipMap: Linear-Time Stateful 3D Reconstruction with Test-Time Training
Feed-forward transformer models have driven rapid progress in 3D vision, but state-of-the-art methods such as VGGT and $π^3$ have a computational cost that scales quadratically with the number of i...
Haian Jin, Rundi Wu, Tianyuan Zhang, Ruiqi Gao, Jonathan T. Barron, Noah Snavely, Aleksander Holy...
TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning
Traditional vision-language models struggle with contrastive fine-grained taxonomic reasoning, particularly when distinguishing between visually similar species within the same genus or family. We ...
Maximilian von Klinski, Maximilian Schall
Robustness of Agentic AI Systems via Adversarially-Aligned Jacobian Regularization
As Large Language Models (LLMs) transition into autonomous multi-agent ecosystems, robust minimax training becomes essential yet remains prone to instability when highly non-linear policies induce ...
Furkan Mumcu, Yasin Yilmaz