Research

Papers

Research papers from arXiv and related sources

Total: 4694 AI/LLM: 2583 Testing: 2111
AI LLM

AoE: Always-on Egocentric Human Video Collection for Embodied AI

Embodied foundation models require large-scale, high-quality real-world interaction data for pre-training and scaling. However, existing data collection methods suffer from high infrastructure cost...

Bowen Yang, Zishuo Li, Yang Sun, Changtao Miao, Yifan Yang, Man Luo, Xiaotong Yan, Feng Jiang, Ji...

2602.23893 2026-02-27
TESTING

Optimization-Based Behavioral Modeling of Mixers for Frequency Comb OFDM Radar Processing

This paper presents an optimization-based behavioral model for mixers driven by multi-tone local oscillator (LO) signals, considered specifically for frequency comb orthogonal frequency-division mu...

Umut Utku Erdem, Henning Poensgen, Taewon Jeong, Lucas Giroto, Benjamin Nuss, Ibrahim Kagan Aksoy...

2602.23889 2026-02-27
TESTING

Characterization of Josephson Junction Aging and Annealing Under Different Environments

Understanding the aging behavior of Josephson junctions and the effect of annealing on junction resistances is important in building large-scale superconducting quantum processors. Here we study th...

Rangga P. Budoyo, Rasanayagam S. Kajen, Bing Wen Cheah, Long H. Nguyen, Rainer Dumke

2602.23888 2026-02-27
AI LLM

LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding

Speculative decoding accelerates autoregressive large language model (LLM) inference by using a lightweight draft model to propose candidate tokens that are then verified in parallel by the target ...

Alexander Samarin, Sergei Krutikov, Anton Shevtsov, Sergei Skvortsov, Filipp Fisin, Alexander Gol...

2602.23881 2026-02-27
AI LLM

RF-Agent: Automated Reward Function Design via Language Agent Tree Search

Designing efficient reward functions for low-level control tasks is a challenging problem. Recent research aims to reduce reliance on expert experience by using Large Language Models (LLMs) with ta...

Ning Gao, Xiuhui Zhang, Xingyu Jiang, Mukang You, Mohan Zhang, Yue Deng

2602.23876 2026-02-27
AI LLM

SWE-rebench V2: Language-Agnostic SWE Task Collection at Scale

Software engineering agents (SWE) are improving rapidly, with recent gains largely driven by reinforcement learning (RL). However, RL training is constrained by the scarcity of large-scale task col...

Ibragim Badertdinov, Maksim Nekrashevich, Anton Shevtsov, Alexander Golubev

2602.23866 2026-02-27
AI LLM

RUMAD: Reinforcement-Unifying Multi-Agent Debate

Multi-agent debate (MAD) systems leverage collective intelligence to enhance reasoning capabilities, yet existing approaches struggle to simultaneously optimize accuracy, consensus formation, and c...

Chao Wang, Han Lin, Huaze Tang, Huijing Lin, Wenbo Ding

2602.23864 2026-02-27
AI LLM

NAU-QMUL: Utilizing BERT and CLIP for Multi-modal AI-Generated Image Detection

With the aim of detecting AI-generated images and identifying the specific models responsible for their generation, we propose a multi-modal multi-task model. The model leverages pre-trained BERT a...

Xiaoyu Guo, Arkaitz Zubiaga

2602.23863 2026-02-27
TESTING

A Fast Heuristic for Stochastic Steiner Tree Problems

Network design under uncertainty arises in countless real-world settings and can be captured by the Stochastic Steiner Tree Problem (SSTP). Although there are a few approaches specifically tailored...

Berend Markhorst, Alessandro Zocca, Joost Berkhout, Rob van der Mei

2602.23858 2026-02-27
AI LLM

CLFEC: A New Task for Unified Linguistic and Factual Error Correction in paragraph-level Chinese Professional Writing

Chinese text correction has traditionally focused on spelling and grammar, while factual error correction is usually treated separately. However, in paragraph-level Chinese professional writing, li...

Jian Kai, Zidong Zhang, Jiwen Chen, Zhengxiang Wu, Songtao Sun, Fuyang Li, Yang Cao, Qiang Liu

2602.23845 2026-02-27
TESTING

OmniXtreme: Breaking the Generality Barrier in High-Dynamic Humanoid Control

High-fidelity motion tracking serves as the ultimate litmus test for generalizable, human-level motor skills. However, current policies often hit a "generality barrier": as motion libraries scale i...

Yunshen Wang, Shaohang Zhu, Peiyuan Zhi, Yuhan Li, Jiaxin Li, Yong-Lu Li, Yuchen Xiao, Xingxing W...

2602.23843 2026-02-27
TESTING

Self-Buckling of Pressurized Cylindrical Tubes

We investigate the buckling of hollow cylindrical tubes subject to their own weight and internal pressure, inspired by the columnar cells of the palisade mesophyll in dicotyledon leaves which resem...

Morten Opstrup Andersen, Nikolaj Tønner Osvald Olsen, Diksha Bhola, Aleca Borsuk, Craig Brodersen...

2602.23836 2026-02-27
AI LLM

Measurement of Born Cross Sections for $e^+e^-\toΣ^-\barΣ^+$ at $\sqrt{s}=3.51-4.95$ GeV and Observation of $ψ(3770)\toΣ^-\barΣ^+$

Using $e^+e^-$ collision data corresponding to an integrated luminosity of 44 fb$^{-1}$ collected with the BESIII detector at the BEPCII collider, we report the first measurement of Born cross sect...

BESIII Collaboration, M. Ablikim, M. N. Achasov, P. Adlarson, X. C. Ai, C. S. Akondi, R. Alibert...

2602.23835 2026-02-27
AI LLM

Enhancing Continual Learning for Software Vulnerability Prediction: Addressing Catastrophic Forgetting via Hybrid-Confidence-Aware Selective Replay for Temporal LLM Fine-Tuning

Recent work applies Large Language Models (LLMs) to source-code vulnerability detection, but most evaluations still rely on random train-test splits that ignore time and overestimate real-world per...

Xuhui Dou, Hayretdin Bahsi, Alejandro Guerra-Manzanares

2602.23834 2026-02-27
TESTING

OmniTrack: General Motion Tracking via Physics-Consistent Reference

Learning motion tracking from rich human motion data is a foundational task for achieving general control in humanoid robots, enabling them to perform diverse behaviors. However, discrepancies in m...

Yuhan Li, Peiyuan Zhi, Yunshen Wang, Tengyu Liu, Sixu Yan, Wenyu Liu, Xinggang Wang, Baoxiong Jia...

2602.23832 2026-02-27
TESTING

Efficient Tests for Testing in Two-way ANOVA under Heteroscedasticity

New tests are developed for two-way ANOVA models with heterogeneous error variances. The testing problems are considered for testing the significant interaction effects, simple effects, and treatme...

Anjana Mondal, Somesh Kumar

2602.23815 2026-02-27
AI LLM

ReasonX: Declarative Reasoning on Explanations

Explaining opaque Machine Learning (ML) models has become an increasingly important challenge. However, current eXplanation in AI (XAI) methods suffer several shortcomings, including insufficient a...

Laura State, Salvatore Ruggieri, Franco Turini

2602.23810 2026-02-27
AI LLM

GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks

Structured deep model compression methods are hardware-friendly and substantially reduce memory and inference costs. However, under aggressive compression, the resulting accuracy degradation often ...

Wenwu Tang, Dong Wang, Lothar Thiele, Olga Saukh

2602.23795 2026-02-27
AI LLM

Divide and Conquer: Accelerating Diffusion-Based Large Language Models via Adaptive Parallel Decoding

Diffusion-based large language models (dLLMs) have shown promising performance across various reasoning tasks, establishing themselves as an alternative to autoregressive large language models (LLM...

Xiangzhong Luo, Yilin An, Zhicheng Yu, Weichen Liu, Xu Yang

2602.23792 2026-02-27
TESTING

Fourier Angle Alignment for Oriented Object Detection in Remote Sensing

In remote sensing rotated object detection, mainstream methods suffer from two bottlenecks, directional incoherence at detector neck and task conflict at detecting head. Ulitising fourier rotation ...

Changyu Gu, Linwei Chen, Lin Gu, Ying Fu

2602.23790 2026-02-27