Research

Papers

Research papers from arXiv and related sources

Total: 4513 AI/LLM: 2483 Testing: 2030
AI LLM

AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science

Data science plays a critical role in transforming complex data into actionable insights across numerous domains. Recent developments in large language models (LLMs) and artificial intelligence (AI...

An Luo, Jin Du, Xun Xian, Robert Specht, Fangqiao Tian, Ganghua Wang, Xuan Bi, Charles Fleming, A...

2603.19005 2026-03-19
TESTING

Unleashing the Power of Simplicity: A Minimalist Strategy for State-of-the-Art Fingerprint Enhancement

Fingerprint recognition systems, which rely on the unique characteristics of human fingerprints, are essential in modern security and verification applications. Accurate minutiae extraction, a crit...

Raffaele Cappelli

2603.19004 2026-03-19
AI LLM

RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation

Simulation of surveys using LLMs is emerging as a powerful application for generating human-like responses at scale. Prior work evaluates survey simulation using metrics borrowed from other domains...

Weronika Łajewska, Paul Missault, George Davidson, Saab Mansour

2603.19002 2026-03-19
AI LLM

SVLAT: Scientific Visualization Literacy Assessment Test

Scientific visualization (SciVis) has become an essential means for exploring, understanding, and communicating complex scientific phenomena. However, the field still lacks a validated instrument a...

Patrick Phuoc Do, Kaiyuan Tang, Kuangshi Ai, Chaoli Wang

2603.19000 2026-03-19
AI LLM

Regret Bounds for Competitive Resource Allocation with Endogenous Costs

We study online resource allocation among N interacting modules over T rounds. Unlike standard online optimization, costs are endogenous: they depend on the full allocation vector through an intera...

Rui Chai

2603.18999 2026-03-19
TESTING

Computation of thermal entropy for the doped Hubbard Model

We develop a highly efficient framework for computing the thermal entropy in the doped Fermi-Hubbard model within the grand-canonical ensemble. The framework comprises four calculation schemes that...

Yu-Feng Song, Youjin Deng, Yuan-Yao He

2603.18998 2026-03-19
TESTING

Radar Detection through Rectified Flow Matching

Radar target detection in the presence of a mixture of non-Gaussian clutter and white thermal noise is a challenging problem. This paper proposes a Rectified Flow Matching-based method for radar de...

P. Meena, Y. A. Rouzoumka, J. Pinsolle, C. Ren, M. N. El Korso, J. -P. Ovarlez

2603.18995 2026-03-19
TESTING

A calibration-free null test from anisotropic BAO

Baryon acoustic oscillation (BAO) analyses usually report the anisotropic shift parameters $α_\perp(z)$ and $α_\parallel(z)$ relative to a fiducial cosmology, and these quantities are primarily use...

Domenico Sapone, Savvas Nesseris

2603.18986 2026-03-19
AI LLM

Book your room in the Turing Hotel! A symmetric and distributed Turing Test with multiple AIs and humans

In this paper, we report our experience with ``TuringHotel'', a novel extension of the Turing Test based on interactions within mixed communities of Large Language Models (LLMs) and human participa...

Christian Di Maio, Tommaso Guidi, Luigi Quarantiello, Jack Bell, Marco Gori, Stefano Melacci, Vin...

2603.18981 2026-03-19
TESTING

A bilinear inverse problem with forward operator inaccuracy applied to neonatal atlas-based diffuse optical tomography

Linear inverse problems are highly common in practical real-world applications from industry to medical imaging. The forward operator is often built on some approximations of the studied system. Ha...

Aada Hakula, Pauliina Hirvi, Nuutti Hyvönen

2603.18980 2026-03-19
AI LLM

Evaluating 5W3H Structured Prompting for Intent Alignment in Human-AI Interaction

Natural language prompts often suffer from intent transmission loss: the gap between what users actually need and what they communicate to AI systems. We evaluate PPS (Prompt Protocol Specification...

Peng Gang

2603.18976 2026-03-19
AI LLM

Terms of (Ab)Use: An Analysis of GenAI Services

Generative AI services like ChatGPT and Gemini are some of the fastest-growing consumer services. Individuals using such services must accept their terms of use before access, and conform to these ...

Harshvardhan J. Pandit, Dick A. H. Blankvoort, Dick A. H. Blankvoort, Sasha Luccioni, Abeba Birhane

2603.18964 2026-03-19
AI LLM

Sketch2Topo: Using Hand-Drawn Inputs for Diffusion-Based Topology Optimization

Topology optimization (TO) is employed in engineering to optimize structural performance while maximizing material efficiency. However, traditional TO methods incur significant computational and ti...

Shuyue Feng, Cedric Caremel, Yoshihiro Kawahara

2603.18960 2026-03-19
AI LLM

Balancing Performance and Fairness in Explainable AI for Anomaly Detection in Distributed Power Plants Monitoring

Reliable anomaly detection in distributed power plant monitoring systems is essential for ensuring operational continuity and reducing maintenance costs, particularly in regions where telecom opera...

Corneille Niyonkuru, Marcellin Atemkeng, Gabin Maxime Nguegnang, Arnaud Nguembang Fadja

2603.18954 2026-03-19
AI LLM

Context Bootstrapped Reinforcement Learning

Reinforcement Learning from Verifiable Rewards (RLVR) suffers from exploration inefficiency, where models struggle to generate successful rollouts, resulting in minimal learning signal. This challe...

Saaket Agashe, Jayanth Srinivasa, Gaowen Liu, Ramana Kompella, Xin Eric Wang

2603.18953 2026-03-19
AI LLM

Entropy trajectory shape predicts LLM reasoning reliability: A diagnostic study of uncertainty dynamics in chain-of-thought

Chain-of-thought (CoT) reasoning improves LLM accuracy, yet detecting failures cheaply remains elusive. We study whether the shape of uncertainty dynamics across reasoning steps--captured by sampli...

Xinghao Zhao

2603.18940 2026-03-19
TESTING

Controller Datapath Aware Verification of Masked Hardware Generated via High Level Synthesis

Masking is a countermeasure against Power Side Channel Attacks (PSCAs) in both software and hardware implementations of cryptographic algorithms. Compared to software masking, implementing masked h...

Nilotpola Sarma, Vaishali Ghanshyam Chaudhuri, Chandan Karfa

2603.18939 2026-03-19
TESTING

Organosulfur Chemistry on sub-Neptunes: Implications for hazes and biosignatures

The organosulfur biosignature gases dimethylsulfide (DMS) and dimethlydisulfide (DMDS) have recently been claimed to be present in the atmosphere of sub-Neptune exoplanet K2-18b, leading to the sug...

Sean Jordan, Shang-Min Tsai, Paul B. Rimmer, Oliver Shorttle

2603.18923 2026-03-19
AI LLM

Agentic Business Process Management: A Research Manifesto

This paper presents a manifesto that articulates the conceptual foundations of Agentic Business Process Management (APM), an extension of Business Process Management (BPM) for governing autonomous ...

Diego Calvanese, Angelo Casciani, Giuseppe De Giacomo, Marlon Dumas, Fabiana Fournier, Timotheus ...

2603.18916 2026-03-19
AI LLM

Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections

The rapid proliferation of artificial intelligence (AI) technologies has led to a dynamic regulatory landscape, where legislative frameworks strive to keep pace with technical advancements. As AI p...

Shiliang Zhang, Sabita Maharjan

2603.18914 2026-03-19