Research

Papers

Research papers from arXiv and related sources

Total: 4513 AI/LLM: 2483 Testing: 2030
AI LLM

SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation

This paper introduces SignAgent, a novel agentic framework that utilises Large Language Models (LLMs) for scalable, linguistically-grounded Sign Language (SL) annotation and dataset curation. Tradi...

Oliver Cory, Ozge Mercanoglu Sincan, Richard Bowden

2603.19059 2026-03-19
AI LLM

Mitigating the Bandwidth Wall via Data-Streaming System-Accelerator Co-Design

Transformers have revolutionized AI in natural language processing and computer vision, but their large computation and memory demands pose major challenges for hardware acceleration. In practice, ...

Qunyou Liu, Marina Zapater, David Atienza

2603.19057 2026-03-19
AI LLM

The Simplicity of the Hodge Bundle

This paper shows that the Hodge bundle over the moduli space of genus $g \geq 2$ curves does not contain any non-trivial sub-bundles. Notably, the mathematical content was generated by Aletheia, a ...

Anand Patel

2603.19052 2026-03-19
TESTING

Optimal Sample Size Calculation in Cost-Effectiveness Longitudinal Cluster Randomized Trials

Longitudinal cluster randomized trials (L-CRTs) are increasingly used to evaluate the cost-effectiveness of healthcare interventions across multiple assessment periods, yet design methods for power...

Hao Wang, Jingxia Liu, Drew B. Cameron, Jiaqi Tong, Donna Spiegelman, Daniella Meeker, Fan Li

2603.19051 2026-03-19
AI LLM

MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models

Scientific ideation aims to propose novel solutions within a given scientific context. Existing LLM-based agentic approaches emulate human research workflows, yet inadequately model scientific reas...

Chenyang Gu, Jiahao Cheng, Meicong Zhang, Pujun Zheng, Jinquan Zheng, Guoxiu He

2603.19044 2026-03-19
AI LLM

Man and machine: artificial intelligence and judicial decision making

The integration of artificial intelligence (AI) technologies into judicial decision-making - particularly in pretrial, sentencing, and parole contexts - has generated substantial concerns about tra...

Arthur Dyevre, Ahmad Shahvaroughi

2603.19042 2026-03-19
TESTING

Non-Markovian Cosmic-Ray Pitch-Angle Transport from Mirror Interactions

Cosmic-ray pitch-angle transport in magnetohydrodynamic (MHD) turbulence is governed by the interplay between magnetic mirroring and gyroresonant scattering. We develop a guiding-center (GC) Langev...

Kai Yan, Huirong Yan, Parth Pavaskar, Chuanpeng Hou, Ruo-Yu Liu

2603.19037 2026-03-19
AI LLM

LLMs Aren't Human: A Critical Perspective on LLM Personality

A growing body of research examines personality traits in Large Language Models (LLMs), particularly in human-agent collaboration. Prior work has frequently applied the Big Five inventory to assess...

Kim Zierahn, Cristina Cachero, Anna Korhonen, Nuria Oliver

2603.19030 2026-03-19
AI LLM

SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models

Models that bridge vision and language, such as CLIP, are key components of multimodal AI, yet their large-scale, uncurated training data introduce severe social and spurious biases. Existing post-...

Quentin Guimard, Federico Bartsch, Simone Caldarella, Rahaf Aljundi, Elisa Ricci, Massimiliano Ma...

2603.19028 2026-03-19
AI LLM

Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token

Recent segmentation methods leveraging Multi-modal Large Language Models (MLLMs) have shown reliable object-level segmentation and enhanced spatial perception. However, almost all previous methods ...

Anqi Zhang, Xiaokang Ji, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei

2603.19026 2026-03-19
AI LLM

Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference

When large AI models are deployed as cloud-based services, clients have no guarantee that responses are correct or were produced by the intended model. Rerunning inference locally is infeasible for...

Pranay Anchuri, Matteo Campanelli, Paul Cesaretti, Rosario Gennaro, Tushar M. Jois, Hasan S. Kaym...

2603.19025 2026-03-19
AI LLM

Behavioral Fingerprints for LLM Endpoint Stability and Identity

The consistency of AI-native applications depends on the behavioral consistency of the model endpoints that power them. Traditional reliability metrics such as uptime, latency and throughput do not...

Jonah Leshin, Manish Shah, Ian Timmis, Daniel Kang

2603.19022 2026-03-19
TESTING

GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants

This is the third paper of the set recording the results of the suite of tests of general relativity (GR) performed on the signals from the fourth Gravitational-Wave Transient Catalog (GWTC-4.0), w...

The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Aba...

2603.19021 2026-03-19
TESTING

GWTC-4.0: Tests of General Relativity. II. Parameterized Tests

In this second of three papers on tests of general relativity (GR) applied to the compact binary coalescence signals in the fourth Gravitational-Wave Transient Catalog (GWTC-4.0), we present the re...

The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Aba...

2603.19020 2026-03-19
TESTING

GWTC-4.0: Tests of General Relativity. I. Overview and General Tests

The worldwide LIGO-Virgo-KAGRA network of gravitational-wave (GW) detectors continues to increase in sensitivity, thus increasing the quantity and quality of the detected GW signals from compact bi...

The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Aba...

2603.19019 2026-03-19
AI LLM

What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?

We present MultiTempBench, a multilingual temporal reasoning benchmark spanning three tasks, date arithmetic, time zone conversion, and temporal relation extraction across five languages (English, ...

Gagan Bhatia, Ahmad Muhammad Isa, Maxime Peyrard, Wei Zhao

2603.19017 2026-03-19
AI LLM

Generalized Hand-Object Pose Estimation with Occlusion Awareness

Generalized 3D hand-object pose estimation from a single RGB image remains challenging due to the large variations in object appearances and interaction patterns, especially under heavy occlusion. ...

Hui Yang, Wei Sun, Jian Liu, Jian Xiao Tao Xie, Hossein Rahmani, Ajmal Saeed mian, Nicu Sebe, Gim...

2603.19013 2026-03-19
TESTING

Progressive Integrality Outer-Inner Approximation for AC Unit Commitment with Conic Formulation

The alternating-current unit commitment (AC-UC) problem provides a realistic representation of power system operations, which is a nonconvex mixed-integer nonlinear programming problem and hence is...

Yongzheng Dai

2603.19012 2026-03-19
AI LLM

Security awareness in LLM agents: the NDAI zone case

NDAI zones let inventor and investor agents negotiate inside a Trusted Execution Environment (TEE) where any disclosed information is deleted if no deal is reached. This makes full IP disclosure th...

Enrico Bottazzi, Pia Park

2603.19011 2026-03-19
AI LLM

Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval

Retrieval-Augmented Generation (RAG) improves Large Language Models (LLMs) by grounding generation in external, non-parametric knowledge. However, when a task requires choosing among competing opti...

Hangeol Chang, Changsun Lee, Seungjoon Rho, Junho Yeo, Jong Chul Ye

2603.19008 2026-03-19