Research

Papers

Research papers from arXiv and related sources

Total: 4513 AI/LLM: 2483 Testing: 2030
AI LLM

Can You Tell It's AI? Human Perception of Synthetic Voices in Vishing Scenarios

Large Language Models and commercial speech synthesis systems now enable highly realistic AI-generated voice scams (vishing), raising urgent concerns about deception at scale. Yet it remains unclea...

Zoha Hayat Bhatti, Bakhtawar Ahtisham, Seemal Tausif, Niklas George, Nida ul Habib Bajwa, Mobin J...

2602.20061 2026-02-23
TESTING

MeanFuser: Fast One-Step Multi-Modal Trajectory Generation and Adaptive Reconstruction via MeanFlow for End-to-End Autonomous Driving

Generative models have shown great potential in trajectory planning. Recent studies demonstrate that anchor-guided generative models are effective in modeling the uncertainty of driving behaviors a...

Junli Wang, Xueyi Liu, Yinan Zheng, Zebing Xing, Pengfei Li, Guang Li, Kun Ma, Guang Chen, Hangju...

2602.20060 2026-02-23
AI LLM

Interaction Theater: A case of LLM Agents Interacting at Scale

As multi-agent architectures and agent-to-agent protocols proliferate, a fundamental question arises: what actually happens when autonomous LLM agents interact at scale? We study this question empi...

Sarath Shekkizhar, Adam Earle

2602.20059 2026-02-23
AI LLM

To Move or Not to Move: Constraint-based Planning Enables Zero-Shot Generalization for Interactive Navigation

Visual navigation typically assumes the existence of at least one obstacle-free path between start and goal, which must be discovered/planned by the robot. However, in real-world scenarios, such as...

Apoorva Vashisth, Manav Kulshrestha, Pranav Bakshi, Damon Conover, Guillaume Sartoretti, Aniket Bera

2602.20055 2026-02-23
AI LLM

Entropy in Large Language Models

In this study, the output of large language models (LLM) is considered an information source generating an unlimited sequence of symbols drawn from a finite alphabet. Given the probabilistic nature...

Marco Scharringhausen

2602.20052 2026-02-23
TESTING

noDice: Inference for Discrete Probabilistic Programs with Nondeterminism and Conditioning

Probabilistic programming languages (PPLs) are an expressive and intuitive means of representing complex probability distributions. In that realm, languages like Dice target an important class of p...

Tobias Gürtler, Benjamin Lucien Kaminski

2602.20049 2026-02-23
AI LLM

CodeCompass: Navigating the Navigation Paradox in Agentic Code Intelligence

Modern code intelligence agents operate in contexts exceeding 1 million tokens--far beyond the scale where humans manually locate relevant files. Yet agents consistently fail to discover architectu...

Tarakanath Paipuru

2602.20048 2026-02-23
AI LLM

Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook

Within twelve days of launch, an AI-native social platform exhibits extreme attention concentration, hierarchical role separation, and one-way attention flow, consistent with the hypothesis that st...

H. C. W. Price, H. AlMuhanna, P. M. Bassani, M. Ho, T. S. Evans

2602.20044 2026-02-23
AI LLM

AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization

Large language models (LLMs) offer substantial promise for automating clinical text summarization, yet maintaining factual consistency remains challenging due to the length, noise, and heterogeneit...

Fahmida Liza Piya, Rahmatollah Beheshti

2602.20040 2026-02-23
AI LLM

Latent Introspection: Models Can Detect Prior Concept Injections

We uncover a latent capacity for introspection in a Qwen 32B model, demonstrating that the model can detect when concepts have been injected into its earlier context and identify which concept was ...

Theia Pearson-Vogel, Martin Vanek, Raymond Douglas, Jan Kulveit

2602.20031 2026-02-23
AI LLM

Agents of Chaos

We report an exploratory red-teaming study of autonomous language-model-powered agents deployed in a live laboratory environment with persistent memory, email accounts, Discord access, file systems...

Natalie Shapira, Chris Wendler, Avery Yen, Gabriele Sarti, Koyena Pal, Olivia Floody, Adam Belfki...

2602.20021 2026-02-23
TESTING

gencat: Generative computerized adaptive testing

Existing computerized Adaptive Testing (CAT) frameworks are typically built on predicting the correctness of a student response to a question. Although effective, this approach fails to leverage te...

Wanyong Feng, Andrew Lan

2602.20020 2026-02-23
TESTING

From High-Level Requirements to KPIs: Conformal Signal Temporal Logic Learning for Wireless Communications

Softwarized radio access networks (RANs), such as those based on the Open RAN (O-RAN) architecture, generate rich streams of key performance indicators (KPIs) that can be leveraged to extract actio...

Jiechen Chen, Michele Polese, Osvaldo Simeone

2602.20018 2026-02-23
TESTING

QUIETT: Query-Independent Table Transformation for Robust Reasoning

Real-world tables often exhibit irregular schemas, heterogeneous value formats, and implicit relational structure, which degrade the reliability of downstream table reasoning and question answering...

Gaurav Najpande, Tampu Ravi Kumar, Manan Roy Choudhury, Neha Valeti, Yanjie Fu, Vivek Gupta

2602.20017 2026-02-23
TESTING

Existence of weak solutions for incompressible fluid-Koiter shell interactions with Navier slip boundary condition

We study a three-dimensional fluid-structure interaction problem describing the motion of an incompressible, viscous fluid coupled with a deformable elastic shell of Koiter type that forms part of ...

Claudiu Mîndrilă, Arnab Roy

2602.20016 2026-02-23
AI LLM

Protecting and Promoting Human Agency in Education in the Age of Artificial Intelligence

Human agency is crucial in education and increasingly challenged by the use of generative AI. This meeting report synthesizes interdisciplinary insights and conceptualizes four aspects that delinea...

Olga Viberg, Mutlu Cukurova, Rene F. Kizilcec, Simon Buckingham Shum, Dorottya Demszky, Dragan Ga...

2602.20014 2026-02-23
TESTING

Change point analysis of high-dimensional data using random projections

This paper develops a novel change point identification method for high-dimensional data using random projections. By projecting high-dimensional time series into a one-dimensional space, we are ab...

Yi Xu, Yeonwoo Rho

2602.19988 2026-02-23
TESTING

Multivariate time-series forecasting of ASTRI-Horn monitoring data: A Normal Behavior Model

This study presents a Normal Behavior Model (NBM) developed to forecast monitoring time-series data from the ASTRI-Horn Cherenkov telescope under normal operating conditions. The analysis focused o...

Federico Incardona, Alessandro Costa, Farida Farsian, Francesco Franchina, Giuseppe Leto, Emilio ...

2602.19984 2026-02-23
AI LLM

SongEcho: Towards Cover Song Generation via Instance-Adaptive Element-wise Linear Modulation

Cover songs constitute a vital aspect of musical culture, preserving the core melody of an original composition while reinterpreting it to infuse novel emotional depth and thematic emphasis. Althou...

Sifei Li, Yang Li, Zizhou Wang, Yuxin Zhang, Fuzhang Wu, Oliver Deussen, Tong-Yee Lee, Weiming Dong

2602.19976 2026-02-23
AI LLM

RL-RIG: A Generative Spatial Reasoner via Intrinsic Reflection

Recent advancements in image generation have achieved impressive results in producing high-quality images. However, existing image generation models still generally struggle with a spatial reasonin...

Tianyu Wang, Zhiyuan Ma, Qian Wang, Xinyi Zhang, Xinwei Long, Bowen Zhou

2602.19974 2026-02-23