Research

Papers

Research papers from arXiv and related sources

Total: 4694 AI/LLM: 2583 Testing: 2111
AI LLM

Surgical Post-Training: Cutting Errors, Keeping Knowledge

Enhancing the reasoning capabilities of Large Language Models (LLMs) via post-training is often constrained by the trade-off between efficiency and catastrophic forgetting. While prior research emp...

Wenye Lin, Kai Han

2603.01683 2026-03-02
TESTING

$\mathcal{H}$-EFTCAMB: A Cobaya-Integrated, Python-Wrapped Extension of EFTCAMB for Covariant Horndeski Gravity

We present $\mathcal{H}\mathtt{-EFTCAMB}$, the official successor to $\mathtt{EFTCAMB}$. The original $\mathtt{EFTCAMB}$ is designed as a consistent and numerically stable implementation of the eff...

Gen Ye, Shijie Lin, Jiaming Pan, Dani de Boe, Stan Verhoeve, Marco Raveri, Bin Hu, Noemi Fruscian...

2603.01662 2026-03-02
AI LLM

HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoC

With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips (SoCs) has become a promising way to ...

Maoliang Li, Jiayu Chen, Zihao Zheng, Ziqian Li, Xinhao Sun, Guojie Luo, Chenchen Liu, Xiang Chen

2603.01661 2026-03-02
TESTING

Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling

Ray tracing has become a standard for accurate radio propagation modeling, but suffers from exponential computational complexity, as the number of candidate paths scales with the number of objects ...

Jérome Eertmans, Enrico M. Vitucci, Vittorio Degli-Esposti, Nicola Di Cicco, Laurent Jacques, Cla...

2603.01655 2026-03-02
AI LLM

CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development

The development of chemical processes, a cornerstone of chemical engineering, presents formidable challenges due to its multi-faceted nature, integrating specialized knowledge, conceptual design, a...

Yuhang Yang, Ruikang Li, Jifei Ma, Kai Zhang, Qi Liu, Jianyu Han, Yonggan Bu, Jibin Zhou, Defu Li...

2603.01654 2026-03-02
AI LLM

LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence

Understanding and predicting judicial outcomes demands nuanced analysis of legal documents. Traditional approaches treat judgments and proceedings as unstructured text, limiting the effectiveness o...

Anka Chandrahas Tummepalli, Preethu Rose Anish

2603.01651 2026-03-02
AI LLM

PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts

Modern stereo matching methods have leveraged monocular depth foundation models to achieve superior zero-shot generalization performance. However, most existing methods primarily focus on extractin...

Xianqi Wang, Hao Yang, Hangtian Wang, Junda Cheng, Gangwei Xu, Min Lin, Xin Yang

2603.01650 2026-03-02
AI LLM

QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image

Recent methods for pathology report generation from whole-slide image (WSI) are capable of producing slide-level diagnostic descriptions but fail to ground fine-grained statements in localized visu...

Rundong Wang, Wei Ba, Ying Zhou, Yingtai Li, Bowen Liu, Baizhi Wang, Yuhao Wang, Zhidong Yang, Ku...

2603.01647 2026-03-02
TESTING

An Investigation of the Relation Between Immersion and Learning Across Three Domains

We investigate the relationship between immersion and learning across three domains (cultural heritage, environmental awareness, and high school physics) through the lens of the Cognitive Affective...

Paolo Boffi, Alberto Gallace, Pier Luca Lanzi

2603.01644 2026-03-02
TESTING

Learning Structured Reasoning via Tractable Trajectory Control

Large language models can exhibit emergent reasoning behaviors, often manifested as recurring lexical patterns (e.g., "wait," indicating verification). However, complex reasoning trajectories remai...

Po-Nien Kung, Zhen Yang, Jeffrey Luo, Cheng-Fu Yang, Haikang Deng, Zi-Yi Dou, Yinfei Yang, Nanyun...

2603.01641 2026-03-02
AI LLM

Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning

Speculative decoding accelerates large language model (LLM) inference by using a small draft model to generate candidate tokens for a larger target model to verify. The efficacy of this technique h...

Jiebin Zhang, Zhenghan Yu, Liang Wang, Nan Yang, Eugene J. Yu, Zheng Li, Yifan Song, Dawei Zhu, X...

2603.01639 2026-03-02
AI LLM

Who Explains Privacy Policies to Me? Embodied and Textual LLM-Powered Privacy Assistants in Virtual Reality

Virtual Reality (VR) systems collect fine-grained behavioral and biometric data, yet privacy policies are rarely read or understood due to their complex language, length, and poor integration into ...

Vincent Freiberger, Moritz Dresch, Florian Alt, Arthur Fleig, Viktorija Paneva

2603.01638 2026-03-02
TESTING

Episode-wise spectro-polarimetry of GRB 220107A: Testing the hypothesis of evolving radiation mechanisms

We investigate the spectro-polarimetric properties of the long-duration GRB~220107A, which exhibited two distinct emission episodes separated by a 40 s quiescent gap, to test whether such multi-epi...

Rahul Gupta, Rushikesh Sonawane, Shabnam Iyyani, D. Frederiks, Judith Racusin, Tanmoy Chattopadha...

2603.01633 2026-03-02
AI LLM

DeLo: Dual Decomposed Low-Rank Experts Collaboration for Continual Missing Modality Learning

Adapting Large Multimodal Models (LMMs) to real-world scenarios poses the dual challenges of learning from sequential data streams while handling frequent modality incompleteness, a task known as C...

Xiwei Liu, Yulong Li, Feilong Tang, Imran Razzak

2603.01632 2026-03-02
TESTING

SEED-SET: Scalable Evolving Experimental Design for System-level Ethical Testing

As autonomous systems such as drones, become increasingly deployed in high-stakes, human-centric domains, it is critical to evaluate the ethical alignment since failure to do so imposes imminent da...

Anjali Parashar, Yingke Li, Eric Yang Yu, Fei Chen, James Neidhoefer, Devesh Upadhyay, Chuchu Fan

2603.01630 2026-03-02
AI LLM

Assessing Crime Disclosure Patterns in a Large-Scale Cybercrime Forum

Cybercrime forums play a central role in the cybercrime ecosystem, serving as hubs for the exchange of illicit goods, services, and knowledge. Previous studies have explored the market and social s...

Raphael Hoheisel, Tom Meurs, Jai Wientjes, Marianne Junger, Abhishta Abhishta, Masarah Paquet-Clo...

2603.01624 2026-03-02
AI LLM

The Invisibility Hypothesis: Promises of AGI and the Future of the Global South

Discussions surrounding Artificial General Intelligence have largely focused on technical feasibility, timelines, and existential risk, often treating its social impact as being the same across dif...

L. Julian Lechuga Lopez, Luis Lara

2603.01616 2026-03-02
AI LLM

Closing the Gap Between Float and Posit Hardware Efficiency

The b-posit, or bounded posit, is a variation of the posit format designed for high performance computing (HPC) and AI applications. Unlike traditional floating-point formats (floats), posits use v...

Aditya Anirudh Jonnalagadda, Rishi Thotli, John L. Gustafson

2603.01615 2026-03-02
TESTING

Mapping properties of the $S$-operator

In this paper, we study the $\ell^p\to \ell^r$ estimates for the $S$-operator arising in restriction problems for spheres over finite fields. We establish a necessary and sufficient condition for t...

Hunseok Kang, Doowon Koh, Changhun Yang

2603.01614 2026-03-02
AI LLM

Evaluating and Understanding Scheming Propensity in LLM Agents

As frontier language models are increasingly deployed as autonomous agents pursuing complex, long-term objectives, there is increased risk of scheming: agents covertly pursuing misaligned goals. Pr...

Mia Hopman, Jannes Elstner, Maria Avramidou, Amritanshu Prasad, David Lindner

2603.01608 2026-03-02