Research

Papers

Research papers from arXiv and related sources

Total: 4513 AI/LLM: 2483 Testing: 2030
TESTING

Forecasting Antimicrobial Resistance Trends Using Machine Learning on WHO GLASS Surveillance Data: A Retrieval-Augmented Generation Approach for Policy Decision Support

Antimicrobial resistance (AMR) is a growing global crisis projected to cause 10 million deaths per year by 2050. While the WHO Global Antimicrobial Resistance and Use Surveillance System (GLASS) pr...

Md Tanvir Hasan Turja

2602.22673 2026-02-26
TESTING

Does the testing environment matter? Carsickness across on-road, test-track, and driving simulator conditions

Carsickness has gained significant attention with the rise of automated vehicles, prompting extensive research across on-road, test-track, and driving simulator environments to understand its occur...

Georgios Papaioannou, Barys Shyrokau

2602.22671 2026-02-26
TESTING

Deepfake Word Detection by Next-token Prediction using Fine-tuned Whisper

Deepfake speech utterances can be forged by replacing one or more words in a bona fide utterance with semantically different words synthesized by speech generative models. While a dedicated synthet...

Hoan My Tran, Xin Wang, Wanying Ge, Xuechen Liu, Junichi Yamagishi

2602.22658 2026-02-26
TESTING

AHBid: An Adaptable Hierarchical Bidding Framework for Cross-Channel Advertising

In online advertising, the inherent complexity and dynamic nature of advertising environments necessitate the use of auto-bidding services to assist advertisers in bid optimization. This complexity...

Xinxin Yang, Yangyang Tang, Yikun Zhou, Yaolei Liu, Yun Li, Bo Yang

2602.22650 2026-02-26
TESTING

Tackling Privacy Heterogeneity in Differentially Private Federated Learning

Differentially private federated learning (DP-FL) enables clients to collaboratively train machine learning models while preserving the privacy of their local data. However, most existing DP-FL app...

Ruichen Xu, Ying-Jun Angela Zhang, Jianwei Huang

2602.22633 2026-02-26
TESTING

TorchLean: Formalizing Neural Networks in Lean

Neural networks are increasingly deployed in safety- and mission-critical pipelines, yet many verification and analysis results are produced outside the programming environment that defines and run...

Robert Joseph George, Jennifer Cruden, Xiangru Zhong, Huan Zhang, Anima Anandkumar

2602.22631 2026-02-26
TESTING

ContextRL: Enhancing MLLM's Knowledge Discovery Efficiency with Context-Augmented RL

We propose ContextRL, a novel framework that leverages context augmentation to overcome these bottlenecks. Specifically, to enhance Identifiability, we provide the reward model with full reference ...

Xingyu Lu, Jinpeng Wang, YiFan Zhang, Shijie Ma, Xiao Hu, Tianke Zhang, Haonan fan, Kaiyu Jiang, ...

2602.22623 2026-02-26
TESTING

Mitigating Membership Inference in Intermediate Representations via Layer-wise MIA-risk-aware DP-SGD

In Embedding-as-an-Interface (EaaI) settings, pre-trained models are queried for Intermediate Representations (IRs). The distributional properties of IRs can leak training-set membership signals, e...

Jiayang Meng, Tao Huang, Chen Hou, Guolong Zheng, Hong Chen

2602.22611 2026-02-26
TESTING

EvolveGen: Algorithmic Level Hardware Model Checking Benchmark Generation through Reinforcement Learning

Progress in hardware model checking depends critically on high-quality benchmarks. However, the community faces a significant benchmark gap: existing suites are limited in number, often distributed...

Guangyu Hu, Xiaofeng Zhou, Wei Zhang, Hongce Zhang

2602.22609 2026-02-26
TESTING

Beyond Vintage Rotation: Bias-Free Sparse Representation Learning with Oracle Inference

Learning low-dimensional latent representations is a central topic in statistics and machine learning, and rotation methods have long been used to obtain sparse and interpretable representations. D...

Chengyu Cui, Yunxiao Chen, Jing Ouyang, Gongjun Xu

2602.22590 2026-02-26
TESTING

Towards Faithful Industrial RAG: A Reinforced Co-adaptation Framework for Advertising QA

Industrial advertising question answering (QA) is a high-stakes task in which hallucinated content, particularly fabricated URLs, can lead to financial loss, compliance violations, and legal risk. ...

Wenwei Li, Ming Xu, Tianle Xia, Lingxiang Hu, Yiding Sun, Linfang Shang, Liqun Liu, Peng Shu, Hua...

2602.22584 2026-02-26
TESTING

Strategy Executability in Mathematical Reasoning: Leveraging Human-Model Differences for Effective Guidance

Example-based guidance is widely used to improve mathematical reasoning at inference time, yet its effectiveness is highly unstable across problems and models-even when the guidance is correct and ...

Weida Liang, Yiyou Sun, Shuyuan Nan, Chuang Li, Dawn Song, Kenji Kawaguchi

2602.22583 2026-02-26
TESTING

Metamorphic Testing of Vision-Language Action-Enabled Robots

Vision-Language-Action (VLA) models are multimodal robotic task controllers that, given an instruction and visual inputs, produce a sequence of low-level control actions (or motor commands) enablin...

Pablo Valle, Sergio Segura, Shaukat Ali, Aitor Arrieta

2602.22579 2026-02-26
TESTING

GIFSplat: Generative Prior-Guided Iterative Feed-Forward 3D Gaussian Splatting from Sparse Views

Feed-forward 3D reconstruction offers substantial runtime advantages over per-scene optimization, which remains slow at inference and often fragile under sparse views. However, existing feed-forwar...

Tianyu Chen, Wei Xiang, Kang Han, Yu Lu, Di Wu, Gaowen Liu, Ramana Rao Kompella

2602.22571 2026-02-26
TESTING

Operationalizing Fairness: Post-Hoc Threshold Optimization Under Hard Resource Limits

The deployment of machine learning in high-stakes domains requires a balance between predictive safety and algorithmic fairness. However, existing fairness interventions often as- sume unconstraine...

Moirangthem Tiken Singh, Amit Kalita, Sapam Jitu Singh

2602.22560 2026-02-26
TESTING

RepoMod-Bench: A Benchmark for Code Repository Modernization via Implementation-Agnostic Testing

The evolution of AI coding agents has shifted the frontier from simple snippet completion to autonomous repository-level engineering. However, evaluating these agents remains ill-posed in general c...

Xuefeng Li, Nir Ben-Israel, Yotam Raz, Belal Ahmed, Doron Serebro, Antoine Raux

2602.22518 2026-02-26
TESTING

A Perfectoid Duality Between M-Theory and F-Theory

We present a non-singular, definition-level formulation of F-theory by replacing the traditional shrinking-fiber limit of M-theory with compactification on a tower-completed circle described using ...

Arshid Shabir, Bobby Eka Gunara, Mir Faizal

2602.22503 2026-02-26
TESTING

Small HVAC Control Demonstrations in Larger Buildings Often Overestimate Savings

How much energy, money, and emissions can advanced control of heating and cooling equipment save in real buildings? To address this question, researchers sometimes control a small number of thermal...

Arash J. Khabbazi, Kevin J. Kircher

2602.22499 2026-02-26
TESTING

Cosmic Environment as the Primary Driver of Dwarf Satellite Statistics

Context:Satellite dwarf galaxies provide key constraints on galaxy formation and evolution as their abundance and spatial distribution reflect both host properties and large-scale environment. Ai...

Saeed Tavasoli, Parsa Ghafour

2602.22485 2026-02-25
TESTING

Fused-Silica Activation Cherenkov Detector for Pulsed D--T Fusion Yields

We demonstrate a compact, non-toxic, low-cost neutron-yield diagnostic for pulsed D--T fusion systems using an undoped fused-silica (SiO$2$) rod as both activation target and Cherenkov radiator. D-...

N. Kaneshige, S. Alawabdeh, W. Hennig, D. Cech, M. Hua, R. Grazioso

2602.22477 2026-02-25