Papers
Research papers from arXiv and related sources
SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation
This paper introduces SignAgent, a novel agentic framework that utilises Large Language Models (LLMs) for scalable, linguistically-grounded Sign Language (SL) annotation and dataset curation. Tradi...
Oliver Cory, Ozge Mercanoglu Sincan, Richard Bowden
Mitigating the Bandwidth Wall via Data-Streaming System-Accelerator Co-Design
Transformers have revolutionized AI in natural language processing and computer vision, but their large computation and memory demands pose major challenges for hardware acceleration. In practice, ...
Qunyou Liu, Marina Zapater, David Atienza
The Simplicity of the Hodge Bundle
This paper shows that the Hodge bundle over the moduli space of genus $g \geq 2$ curves does not contain any non-trivial sub-bundles. Notably, the mathematical content was generated by Aletheia, a ...
Anand Patel
Optimal Sample Size Calculation in Cost-Effectiveness Longitudinal Cluster Randomized Trials
Longitudinal cluster randomized trials (L-CRTs) are increasingly used to evaluate the cost-effectiveness of healthcare interventions across multiple assessment periods, yet design methods for power...
Hao Wang, Jingxia Liu, Drew B. Cameron, Jiaqi Tong, Donna Spiegelman, Daniella Meeker, Fan Li
MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models
Scientific ideation aims to propose novel solutions within a given scientific context. Existing LLM-based agentic approaches emulate human research workflows, yet inadequately model scientific reas...
Chenyang Gu, Jiahao Cheng, Meicong Zhang, Pujun Zheng, Jinquan Zheng, Guoxiu He
Man and machine: artificial intelligence and judicial decision making
The integration of artificial intelligence (AI) technologies into judicial decision-making - particularly in pretrial, sentencing, and parole contexts - has generated substantial concerns about tra...
Arthur Dyevre, Ahmad Shahvaroughi
Non-Markovian Cosmic-Ray Pitch-Angle Transport from Mirror Interactions
Cosmic-ray pitch-angle transport in magnetohydrodynamic (MHD) turbulence is governed by the interplay between magnetic mirroring and gyroresonant scattering. We develop a guiding-center (GC) Langev...
Kai Yan, Huirong Yan, Parth Pavaskar, Chuanpeng Hou, Ruo-Yu Liu
LLMs Aren't Human: A Critical Perspective on LLM Personality
A growing body of research examines personality traits in Large Language Models (LLMs), particularly in human-agent collaboration. Prior work has frequently applied the Big Five inventory to assess...
Kim Zierahn, Cristina Cachero, Anna Korhonen, Nuria Oliver
SEM: Sparse Embedding Modulation for Post-Hoc Debiasing of Vision-Language Models
Models that bridge vision and language, such as CLIP, are key components of multimodal AI, yet their large-scale, uncurated training data introduce severe social and spurious biases. Existing post-...
Quentin Guimard, Federico Bartsch, Simone Caldarella, Rahaf Aljundi, Elisa Ricci, Massimiliano Ma...
Rethinking MLLM Itself as a Segmenter with a Single Segmentation Token
Recent segmentation methods leveraging Multi-modal Large Language Models (MLLMs) have shown reliable object-level segmentation and enhanced spatial perception. However, almost all previous methods ...
Anqi Zhang, Xiaokang Ji, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei
Towards Verifiable AI with Lightweight Cryptographic Proofs of Inference
When large AI models are deployed as cloud-based services, clients have no guarantee that responses are correct or were produced by the intended model. Rerunning inference locally is infeasible for...
Pranay Anchuri, Matteo Campanelli, Paul Cesaretti, Rosario Gennaro, Tushar M. Jois, Hasan S. Kaym...
Behavioral Fingerprints for LLM Endpoint Stability and Identity
The consistency of AI-native applications depends on the behavioral consistency of the model endpoints that power them. Traditional reliability metrics such as uptime, latency and throughput do not...
Jonah Leshin, Manish Shah, Ian Timmis, Daniel Kang
GWTC-4.0: Tests of General Relativity. III. Tests of the Remnants
This is the third paper of the set recording the results of the suite of tests of general relativity (GR) performed on the signals from the fourth Gravitational-Wave Transient Catalog (GWTC-4.0), w...
The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Aba...
GWTC-4.0: Tests of General Relativity. II. Parameterized Tests
In this second of three papers on tests of general relativity (GR) applied to the compact binary coalescence signals in the fourth Gravitational-Wave Transient Catalog (GWTC-4.0), we present the re...
The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Aba...
GWTC-4.0: Tests of General Relativity. I. Overview and General Tests
The worldwide LIGO-Virgo-KAGRA network of gravitational-wave (GW) detectors continues to increase in sensitivity, thus increasing the quantity and quality of the detected GW signals from compact bi...
The LIGO Scientific Collaboration, the Virgo Collaboration, the KAGRA Collaboration, A. G. Aba...
What Really Controls Temporal Reasoning in Large Language Models: Tokenisation or Representation of Time?
We present MultiTempBench, a multilingual temporal reasoning benchmark spanning three tasks, date arithmetic, time zone conversion, and temporal relation extraction across five languages (English, ...
Gagan Bhatia, Ahmad Muhammad Isa, Maxime Peyrard, Wei Zhao
Generalized Hand-Object Pose Estimation with Occlusion Awareness
Generalized 3D hand-object pose estimation from a single RGB image remains challenging due to the large variations in object appearances and interaction patterns, especially under heavy occlusion. ...
Hui Yang, Wei Sun, Jian Liu, Jian Xiao Tao Xie, Hossein Rahmani, Ajmal Saeed mian, Nicu Sebe, Gim...
Progressive Integrality Outer-Inner Approximation for AC Unit Commitment with Conic Formulation
The alternating-current unit commitment (AC-UC) problem provides a realistic representation of power system operations, which is a nonconvex mixed-integer nonlinear programming problem and hence is...
Yongzheng Dai
Security awareness in LLM agents: the NDAI zone case
NDAI zones let inventor and investor agents negotiate inside a Trusted Execution Environment (TEE) where any disclosed information is deleted if no deal is reached. This makes full IP disclosure th...
Enrico Bottazzi, Pia Park
Hypothesis-Conditioned Query Rewriting for Decision-Useful Retrieval
Retrieval-Augmented Generation (RAG) improves Large Language Models (LLMs) by grounding generation in external, non-parametric knowledge. However, when a task requires choosing among competing opti...
Hangeol Chang, Changsun Lee, Seungjoon Rho, Junho Yeo, Jong Chul Ye