Research

Papers

Research papers from arXiv and related sources

Total: 4694 AI/LLM: 2583 Testing: 2111
AI LLM

Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks

Frontier Multimodal Large Language Models (MLLMs) exhibit remarkable capabilities in Visual-Language Comprehension (VLC) tasks. However, they are often deployed as zero-shot solution to new tasks i...

Mei Chee Leong, Ying Gu, Hui Li Tan, Liyuan Li, Nancy Chen

2603.11689 2026-03-12
AI LLM

SemBench: A Universal Semantic Framework for LLM Evaluation

Recent progress in Natural Language Processing (NLP) has been driven by the emergence of Large Language Models (LLMs), which exhibit remarkable generative and reasoning capabilities. However, despi...

Mikel Zubillaga, Naiara Perez, Oscar Sainz, German Rigau

2603.11687 2026-03-12
TESTING

In the LLM era, Word Sense Induction remains unsolved

In the absence of sense-annotated data, word sense induction (WSI) is a compelling alternative to word sense disambiguation, particularly in low-resource or domain-specific settings. In this paper,...

Anna Mosolova, Marie Candito, Carlos Ramisch

2603.11686 2026-03-12
AI LLM

LLMs can construct powerful representations and streamline sample-efficient supervised learning

As real-world datasets become increasingly complex and heterogeneous, supervised learning is often bottlenecked by input representation design. Modeling multimodal data for downstream tasks, such a...

Ilker Demirel, Larry Shi, Zeshan Hussain, David Sontag

2603.11679 2026-03-12
AI LLM

From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration

Large Language Models (LLMs) are increasingly used to power autonomous agents for complex, multi-step tasks. However, human-agent interaction remains pointwise and reactive: users approve or correc...

Gaole He, Brian Y. Lim

2603.11677 2026-03-12
TESTING

Compact LABFM: a framework for meshless methods with spectral-like resolving power

Meshless methods are often used in numerical simulations of systems of partial differential equations (PDEs), particularly those which involve complex geometries or free surfaces. Here we present a...

Henry M. Broadley, Steven J. Lind, Jack R. C. King

2603.11668 2026-03-12
AI LLM

Machine Learning-Based Analysis of Critical Process Parameters Influencing Product Quality Defects: A Real-World Case Study in Manufacturing

Quality control is an essential operation in manufacturing, ensuring products meet the necessary standards of quality, safety, and reliability. Traditional methods, such as visual inspections, meas...

Sukumaran Rajasekaran, Ebru Turanoglu Bekar, Kanika Gandhi, Sabino Francesco Roselli, Mohan Rajas...

2603.11666 2026-03-12
AI LLM

Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge

Multimodal Large Language Models (MLLMs) have been widely adopted as MLLM-as-a-Judges due to their strong alignment with human judgment across various visual tasks. However, most existing judge mod...

Junjie Wu, Xuan Kan, Zihao He, Shunwen Tan, Bo Pan, Kaitai Zhang

2603.11665 2026-03-12
AI LLM

Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models

Reinforcement Learning (RL) has become an effective paradigm for enhancing Large Language Models (LLMs) and visual generative models. However, its application in text-to-audio (TTA) generation rema...

Xiquan Li, Junxi Liu, Wenxi Chen, Haina Zhu, Ziyang Ma, Xie Chen

2603.11661 2026-03-12
TESTING

Constraints on Axion-Photon Mixing from Fast Radio Burst Dispersion Measures

Fast radio bursts (FRBs) offer a powerful probe of the ionized Universe through their dispersion measures (DM). While a significant fraction of the DM arises from the intergalactic medium (IGM), th...

Gunalan Muthusami, Gopal Kashyap

2603.11657 2026-03-12
TESTING

Exploring the Viability of Fisher Discriminants in Galaxy Morphology Classification

One of the major challenges in astronomy involves accurately classifying galaxies, particularly distinguishing between different galaxy types. While many complex algorithms have shown strong perfor...

Sazatul Nadhilah Zakaria, Santtosh Muniyandy, John Y. H. Soo

2603.11652 2026-03-12
TESTING

A Hybrid Neural-Assisted Unscented Kalman Filter for Unmanned Ground Vehicle Navigation

Modern autonomous navigation for unmanned ground vehicles relies on different estimators to fuse inertial sensors and GNSS measurements. However, the constant noise covariance matrices often strugg...

Gal Versano, Itzik Klein

2603.11649 2026-03-12
TESTING

Beyond BFS: A Comparative Study of Rooted Spanning Tree Algorithms on GPUs

Rooted spanning trees (RSTs) are a core primitive in parallel graph analytics, underpinning algorithms such as biconnected components and planarity testing. On GPUs, RST construction has traditiona...

Abhijeet Sahu, Srikar Vilas Donur

2603.11645 2026-03-12
TESTING

Chunk-Boundary Artifact in Action-Chunked Generative Policies: A Noise-Sensitive Failure Mechanism

Action chunking has become a central design choice for generative visuomotor policies, yet the execution discontinuities that arise at chunk boundaries remain poorly understood. In a frozen pretrai...

Rui Wang

2603.11642 2026-03-12
AI LLM

Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans

Architectural floor plan design demands joint reasoning over geometry, semantics, and spatial hierarchy, which remains a major challenge for current AI systems. Although recent diffusion and langua...

Sizhong Qin, Ramon Elias Weber, Xinzheng Lu

2603.11640 2026-03-12
AI LLM

Learnable Template Matching Approach for Micro-Deformation Monitoring based on Integrated Sensing and Communication Platform

Existing integrated sensing and communication (ISAC) platforms fail to fully utilize the shared spectrum and aperture resources for sensing, resulting in poor sensing performance. Specifically, wea...

Zhuoyang Liu, Yixiang Luomei, Feng Xu

2603.11639 2026-03-12
AI LLM

Double-twisted surface spectrum from hybridized Majorana Kramers pairs and wallpaper fermions

We theoretically investigate the superconducting surface states of wallpaper fermions, which are surface quasiparticles of topological nonsymmorphic crystalline insulators protected by a wallpaper ...

Kaito Yoda, Ai Yamakage

2603.11637 2026-03-12
AI LLM

The Density of Cross-Persistence Diagrams and Its Applications

Topological Data Analysis (TDA) provides powerful tools to explore the shape and structure of data through topological features such as clusters, loops, and voids. Persistence diagrams are a corner...

Alexander Mironenko, Evgeny. Burnaev, Serguei Barannikov

2603.11623 2026-03-12
AI LLM

Sema: A High-performance System for LLM-based Semantic Query Processing

The integration of Large Language Models (LLMs) into data analytics has unlocked powerful capabilities for reasoning over bulk structured and unstructured data. However, existing systems typically ...

Kangkang Qi, Dongyang Xie, Wenbo Li, Hao Zhang, Yuanyuan Zhu, Jeffrey Xu Yu, Kangfei Zhao

2603.11622 2026-03-12
TESTING

Taming OpenClaw: Security Analysis and Mitigation of Autonomous LLM Agent Threats

Autonomous Large Language Model (LLM) agents, exemplified by OpenClaw, demonstrate remarkable capabilities in executing complex, long-horizon tasks. However, their tightly coupled instant-messaging...

Xinhao Deng, Yixiang Zhang, Jiaqing Wu, Jiaqi Bai, Sibo Yi, Zhuoheng Zou, Yue Xiao, Rennai Qiu, J...

2603.11619 2026-03-12