Research

Papers

Research papers from arXiv and related sources

Total: 4513 AI/LLM: 2483 Testing: 2030
TESTING

A Robotic Testing Platform for Pipelined Discovery of Resilient Soft Actuators

Short lifetime under high electrical fields hinders the widespread robotic application of linear dielectric elastomer actuators (DEAs). Systematic scanning is difficult due to time-consuming per-sa...

Ang, Li, Alexander Yin, Alexander White, Sahib Sandhu, Matthew Francoeur, Victor Jimenez-Santia...

2602.20963 2026-02-24
TESTING

EKF-Based Depth Camera and Deep Learning Fusion for UAV-Person Distance Estimation and Following in SAR Operations

Search and rescue (SAR) operations require rapid responses to save lives or property. Unmanned Aerial Vehicles (UAVs) equipped with vision-based systems support these missions through prior terrain...

Luka Šiktar, Branimir Ćaran, Bojan Šekoranja, Marko Švaco

2602.20958 2026-02-24
AI LLM

See and Fix the Flaws: Enabling VLMs and Diffusion Models to Comprehend Visual Artifacts via Agentic Data Synthesis

Despite recent advances in diffusion models, AI generated images still often contain visual artifacts that compromise realism. Although more thorough pre-training and bigger models might reduce art...

Jaehyun Park, Minyoung Ahn, Minkyu Kim, Jonghyun Lee, Jae-Gil Lee, Dongmin Park

2602.20951 2026-02-24
AI LLM

Some Simple Economics of AGI

For millennia, human cognition was the primary engine of progress on Earth. As AI decouples cognition from biology, the marginal cost of measurable execution falls to zero, absorbing any labor capt...

Christian Catalini, Xiang Hui, Jane Wu

2602.20946 2026-02-24
AI LLM

The Art of Efficient Reasoning: Data, Reward, and Optimization

Large Language Models (LLMs) consistently benefit from scaled Chain-of-Thought (CoT) reasoning, but also suffer from heavy computational overhead. To address this issue, efficient reasoning aims to...

Taiqiang Wu, Zenan Zu, Bo Zhou, Ngai Wong

2602.20945 2026-02-24
AI LLM

Architecting AgentOS: From Token-Level Context to Emergent System-Level Intelligence

The paradigm of Large Language Models is undergoing a fundamental transition from static inference engines to dynamic autonomous cognitive systems.While current research primarily focuses on scalin...

ChengYou Li, XiaoDong Liu, XiangBao Meng, XinYu Zhao

2602.20934 2026-02-24
AI LLM

HELP: HyperNode Expansion and Logical Path-Guided Evidence Localization for Accurate and Efficient GraphRAG

Large Language Models (LLMs) often struggle with inherent knowledge boundaries and hallucinations, limiting their reliability in knowledge-intensive tasks. While Retrieval-Augmented Generation (RAG...

Yuqi Huang, Ning Liao, Kai Yang, Anning Hu, Shengchao Hu, Xiaoxing Wang, Junchi Yan

2602.20926 2026-02-24
TESTING

Airavat: An Agentic Framework for Internet Measurement

Internet measurement faces twin challenges: complex analyses require expert-level orchestration of tools, yet even syntactically correct implementations can have methodological flaws and can be dif...

Alagappan Ramanathan, Eunju Kang, Dongsu Han, Sangeetha Abdu Jyothi

2602.20924 2026-02-24
AI LLM

Predicting Sentence Acceptability Judgments in Multimodal Contexts

Previous work has examined the capacity of deep neural networks (DNNs), particularly transformers, to predict human sentence acceptability judgments, both independently of context, and in document ...

Hyewon Jang, Nikolai Ilinykh, Sharid Loáiciga, Jey Han Lau, Shalom Lappin

2602.20918 2026-02-24
TESTING

A Corrected Welch Satterthwaite Equation. And: What You Always Wanted to Know About Kish's Effective Sample but Were Afraid to Ask

This article presents a corrected version of the Satterthwaite (1941, 1946) approximation for the degrees of freedom of a weighted sum of independent variance components. The original formula is kn...

Matthias von Davier

2602.20912 2026-02-24
TESTING

On Stein's test of uniformity on the hypersphere

We propose a new test of uniformity on the hypersphere based on a Stein characterization associated with the Laplace--Beltrami operator. We identify a sufficient class of test functions for this ch...

Paul Axmann, Bruno Ebner, Eduardo García-Portugués

2602.20896 2026-02-24
AI LLM

InterPilot: Exploring the Design Space of AI-assisted Job Interview Support for HR Professionals

Recruitment interviews are cognitively demanding interactions in which interviewers must simultaneously listen, evaluate candidates, take notes, and formulate follow-up questions. To better underst...

Zhengtao Xu, Zimo Xia, Zicheng Zhu, Nattapat Boonprakong, Yu-An Chen, Rabih Zbib, Casimiro Pio Ca...

2602.20891 2026-02-24
AI LLM

When LLMs Enter Everyday Feminism on Chinese Social Media: Opportunities and Risks for Women's Empowerment

Everyday digital feminism refers to the ordinary, often pragmatic ways women articulate lived experiences and cultivate solidarity in online spaces. In China, such practices flourish on RedNote thr...

Runhua Zhang, Ziqi Pan, Kangyu Yuan, Qiaoyi Chen, Yulin Tian, Huamin Qu, Xiaojuan Ma

2602.20876 2026-02-24
AI LLM

MUSE: Harnessing Precise and Diverse Semantics for Few-Shot Whole Slide Image Classification

In computational pathology, few-shot whole slide image classification is primarily driven by the extreme scarcity of expert-labeled slides. Recent vision-language methods incorporate textual semant...

Jiahao Xu, Sheng Huang, Xin Zhang, Zhixiong Nan, Jiajun Dong, Nankun Mu

2602.20873 2026-02-24
TESTING

FGFRFT: Fast Graph Fractional FourierTransform via Fourier Series Approximation

The graph fractional Fourier transform (GFRFT) generalizes the graph Fourier transform (GFT) but suffers from a significant computational bottleneck: determining the optimal transform order require...

Ziqi Yan, Sen Shi, Feiyue Zhao, Manjun Cui, Yangfan He, Zhichao Zhang

2602.20870 2026-02-24
AI LLM

SoK: Agentic Skills -- Beyond Tool Use in LLM Agents

Agentic systems increasingly rely on reusable procedural capabilities, \textit{a.k.a., agentic skills}, to execute long-horizon workflows reliably. These capabilities are callable modules that pack...

Yanna Jiang, Delong Li, Haiyu Deng, Baihe Ma, Xu Wang, Qin Wang, Guangsheng Yu

2602.20867 2026-02-24
AI LLM

FinAnchor: Aligned Multi-Model Representations for Financial Prediction

Financial prediction from long documents involves significant challenges, as actionable signals are often sparse and obscured by noise, and the optimal LLM for generating embeddings varies across t...

Zirui He, Huopu Zhang, Yanguang Liu, Sirui Wu, Mengnan Du

2602.20859 2026-02-24
TESTING

Maximum entropy based testing in network models: ERGMs and constrained optimization

Stochastic network models play a central role across a wide range of scientific disciplines, and questions of statistical inference arise naturally in this context. In this paper we investigate goo...

Subhrosekhar Ghosh, Rathindra Nath Karmakar, Samriddha Lahiry

2602.20844 2026-02-24
AI LLM

Training-Free Multi-Concept Image Editing

Editing images with diffusion models without training remains challenging. While recent optimisation-based methods achieve strong zero-shot edits from text, they struggle to preserve identity or ca...

Niki Foteinopoulou, Ignas Budvytis, Stephan Liwicki

2602.20839 2026-02-24
TESTING

DRESS: A Continuous Framework for Structural Graph Refinement

The Weisfeiler-Lehman (WL) hierarchy is a cornerstone framework for graph isomorphism testing and structural analysis. However, scaling beyond 1-WL to 3-WL and higher requires tensor-based operatio...

Eduar Castrillo Velilla

2602.20833 2026-02-24