Research

Papers

Research papers from arXiv and related sources

Total: 4513 AI/LLM: 2483 Testing: 2030
AI LLM

ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation

Autonomous web agents such as \textbf{OpenClaw} are rapidly moving into high-impact real-world workflows, but their security robustness under live network threats remains insufficiently evaluated. ...

Haochen Zhao, Shaoyang Cui

2603.18762 2026-03-19
TESTING

NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics

Standard attention mechanisms in transformers are limited by their pairwise formulation, which hinders the modeling of higher-order dependencies among tokens. We introduce the NeuroGame Transformer...

Djamel Bouchaffra, Fayçal Ykhlef, Hanene Azzag, Mustapha Lebbah, Bilal Faye

2603.18761 2026-03-19
AI LLM

Dual-Model Prediction of Affective Engagement and Vocal Attractiveness from Speaker Expressiveness in Video Learning

This paper outlines a machine learning-enabled speaker-centric Emotion AI approach capable of predicting audience-affective engagement and vocal attractiveness in asynchronous video-based learning,...

Hung-Yue Suen, Kuo-En Hung, Fan-Hsun Tseng

2603.18758 2026-03-19
AI LLM

Are complicated loss functions necessary for teaching LLMs to reason?

Recent advances in large language models (LLMs) highlight the importance of post training techniques for improving reasoning and mathematical ability. Group Relative Policy Optimization (GRPO) has ...

Gabriele Carrino, Andrea Sassella, Nicolo Brunello, Federico Toschi, Mark James Carman

2603.18756 2026-03-19
TESTING

Spreading of pathological proteins through brain networks: a case study for Alzheimers disease

Mathematical modeling offers a valuable approach to understanding Alzheimers disease (AD) given its complexity, unknown causes, and lack of effective treatments. Models, once validated, offer a pow...

G. Landi, A. Scaravelli, M. C. Tesi, C. Testa

2603.18755 2026-03-19
AI LLM

Automatic detection of Gen-AI texts: A comparative framework of neural models

The rapid proliferation of Large Language Models has significantly increased the difficulty of distinguishing between human-written and AI generated texts, raising critical issues across academic, ...

Cristian Buttaro, Irene Amerini

2603.18750 2026-03-19
AI LLM

Memento-Skills: Let Agents Design Agents

We introduce \emph{Memento-Skills}, a generalist, continually-learnable LLM agent system that functions as an \emph{agent-designing agent}: it autonomously constructs, adapts, and improves task-spe...

Huichi Zhou, Siyuan Guo, Anjie Liu, Zhongwei Yu, Ziqin Gong, Bowen Zhao, Zhixun Chen, Menglong Zh...

2603.18743 2026-03-19
AI LLM

Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review

Security code reviews increasingly rely on systems integrating Large Language Models (LLMs), ranging from interactive assistants to autonomous agents in CI/CD pipelines. We study whether confirmati...

Dimitris Mitropoulos, Nikolaos Alexopoulos, Georgios Alexopoulos, Diomidis Spinellis

2603.18740 2026-03-19
AI LLM

EdgeCrafter: Compact ViTs for Edge Dense Prediction via Task-Specialized Distillation

Deploying high-performance dense prediction models on resource-constrained edge devices remains challenging due to strict limits on computation and memory. In practice, lightweight systems for obje...

Longfei Liu, Yongjie Hou, Yang Li, Qirui Wang, Youyang Sha, Yongjun Yu, Yinzhi Wang, Peizhe Ru, X...

2603.18739 2026-03-19
TESTING

Kinematic diagnostics for non-axisymmetry in the Milky Way's nuclear stellar disc

There is now strong evidence that the Milky Way (MW) hosts a nuclear stellar disc (NSD). However, whether the NSD is purely axisymmetric or contains a nuclear bar remains unresolved. Since approxim...

Karl Fiteni, Xingchen Li, Mattia C. Sormani, Victor P. Debattista, Arianna Vasini, Francisco Nogu...

2603.18738 2026-03-19
AI LLM

CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks

Despite the success of reinforcement learning from human feedback (RLHF) in aligning language models, current reward modeling heavily relies on experimental feedback data collected from human annot...

Hao Wang, Licheng Pan, Zhichao Chen, Chunyuan Zheng, Zhixuan Chu, Xiaoxi Li, Yuan Lu, Xinggao Liu...

2603.18736 2026-03-19
TESTING

SpaceTime Programming: Live and Omniscient Exploration of Code and Execution

Programming environments typically separate the world of static code from the dynamic execution of programs. Developers must switch between writing code and observing its execution, often with limi...

Jean-Baptiste Döderlein, Djamel Eddine Khelladi, Mathieu Acher, Benoit Combemale

2603.18735 2026-03-19
AI LLM

Green Architectural Tactics in ML-enabled Systems: An LLM-based Repository Mining Study

Context: The increasing adoption of machine learning (ML) and artificial intelligence (AI) technologies raises growing concerns about their environmental sustainability. Developing and deploying ML...

Vincenzo De Martino, Silverio Martínez-Fernández, Fabio Palomba

2603.18734 2026-03-19
AI LLM

Analysis Of Linguistic Stereotypes in Single and Multi-Agent Generative AI Architectures

Many works in the literature show that LLM outputs exhibit discriminatory behaviour, triggering stereotype-based inferences based on the dialect in which the inputs are written. This bias has been ...

Martina Ullasci, Marco Rondina, Riccardo Coppola, Flavio Giobergia, Riccardo Bellanca, Gabriele M...

2603.18729 2026-03-19
TESTING

Reconstructions of Single Pixel X-Ray Transforms with Applications in Nuclear-Disarmament Verification

In nuclear arms control and disarmament processes, it is crucial to determine whether an object is a nuclear weapon or not without revealing sensitive information about it. At the MIT: Laboratory f...

Christopher Fichtlscherer, R. Scott Kemp, Christina Brandt

2603.18728 2026-03-19
AI LLM

Ontology-Guided Diffusion for Zero-Shot Visual Sim2Real Transfer

Bridging the simulation-to-reality (sim2real) gap remains challenging as labelled real-world data is scarce. Existing diffusion-based approaches rely on unstructured prompts or statistical alignmen...

Mohamed Youssef, Mayar Elfares, Anna-Maria Meer, Matteo Bortoletto, Andreas Bulling

2603.18719 2026-03-19
AI LLM

MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution

Memory-augmented LLM agents maintain external memory banks to support long-horizon interaction, yet most existing systems treat construction, retrieval, and utilization as isolated subroutines. Thi...

Minhua Lin, Zhiwei Zhang, Hanqing Lu, Hui Liu, Xianfeng Tang, Qi He, Xiang Zhang, Suhang Wang

2603.18718 2026-03-19
AI LLM

Holter-to-Sleep: AI-Enabled Repurposing of Single-Lead ECG for Sleep Phenotyping

Sleep disturbances are tightly linked to cardiovascular risk, yet polysomnography (PSG)-the clinical reference standard-remains resource-intensive and poorly suited for multi-night, home-based, and...

Donglin Xie, Qingshuo Zhao, Jingyu Wang, Shijia Geng, Jiarui Jin, Jun Li, Rongrong Guo, Guangkun ...

2603.18714 2026-03-19
TESTING

Let's Play Tag: Linear Time Evaluation of Conjunctive Queries under TGD Constraints

We study the limits of linear time evaluation of conjunctive queries under constraints expressed as tuple-generating dependencies (TGDs), across several modes of query evaluation: single-testing, a...

Nofar Carmeli, Carsten Lutz, Marcin Przybyłko

2603.18709 2026-03-19
TESTING

High-Performance Portable GPU Primitives for Arbitrary Types and Operators in Julia

Portable GPU frameworks such as Kokkos and RAJA reduce the burden of cross-architecture development but typically incur measurable overhead on fundamental parallel primitives relative to vendor-opt...

Emmanuel Pilliat

2603.18695 2026-03-19