Papers
Research papers from arXiv and related sources
Explicit Logic Channel for Validation and Enhancement of MLLMs on Zero-Shot Tasks
Frontier Multimodal Large Language Models (MLLMs) exhibit remarkable capabilities in Visual-Language Comprehension (VLC) tasks. However, they are often deployed as zero-shot solution to new tasks i...
Mei Chee Leong, Ying Gu, Hui Li Tan, Liyuan Li, Nancy Chen
SemBench: A Universal Semantic Framework for LLM Evaluation
Recent progress in Natural Language Processing (NLP) has been driven by the emergence of Large Language Models (LLMs), which exhibit remarkable generative and reasoning capabilities. However, despi...
Mikel Zubillaga, Naiara Perez, Oscar Sainz, German Rigau
In the LLM era, Word Sense Induction remains unsolved
In the absence of sense-annotated data, word sense induction (WSI) is a compelling alternative to word sense disambiguation, particularly in low-resource or domain-specific settings. In this paper,...
Anna Mosolova, Marie Candito, Carlos Ramisch
LLMs can construct powerful representations and streamline sample-efficient supervised learning
As real-world datasets become increasingly complex and heterogeneous, supervised learning is often bottlenecked by input representation design. Modeling multimodal data for downstream tasks, such a...
Ilker Demirel, Larry Shi, Zeshan Hussain, David Sontag
From Control to Foresight: Simulation as a New Paradigm for Human-Agent Collaboration
Large Language Models (LLMs) are increasingly used to power autonomous agents for complex, multi-step tasks. However, human-agent interaction remains pointwise and reactive: users approve or correc...
Gaole He, Brian Y. Lim
Compact LABFM: a framework for meshless methods with spectral-like resolving power
Meshless methods are often used in numerical simulations of systems of partial differential equations (PDEs), particularly those which involve complex geometries or free surfaces. Here we present a...
Henry M. Broadley, Steven J. Lind, Jack R. C. King
Machine Learning-Based Analysis of Critical Process Parameters Influencing Product Quality Defects: A Real-World Case Study in Manufacturing
Quality control is an essential operation in manufacturing, ensuring products meet the necessary standards of quality, safety, and reliability. Traditional methods, such as visual inspections, meas...
Sukumaran Rajasekaran, Ebru Turanoglu Bekar, Kanika Gandhi, Sabino Francesco Roselli, Mohan Rajas...
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge
Multimodal Large Language Models (MLLMs) have been widely adopted as MLLM-as-a-Judges due to their strong alignment with human judgment across various visual tasks. However, most existing judge mod...
Junjie Wu, Xuan Kan, Zihao He, Shunwen Tan, Bo Pan, Kaitai Zhang
Resonate: Reinforcing Text-to-Audio Generation via Online Feedback from Large Audio Language Models
Reinforcement Learning (RL) has become an effective paradigm for enhancing Large Language Models (LLMs) and visual generative models. However, its application in text-to-audio (TTA) generation rema...
Xiquan Li, Junxi Liu, Wenxi Chen, Haina Zhu, Ziyang Ma, Xie Chen
Constraints on Axion-Photon Mixing from Fast Radio Burst Dispersion Measures
Fast radio bursts (FRBs) offer a powerful probe of the ionized Universe through their dispersion measures (DM). While a significant fraction of the DM arises from the intergalactic medium (IGM), th...
Gunalan Muthusami, Gopal Kashyap
Exploring the Viability of Fisher Discriminants in Galaxy Morphology Classification
One of the major challenges in astronomy involves accurately classifying galaxies, particularly distinguishing between different galaxy types. While many complex algorithms have shown strong perfor...
Sazatul Nadhilah Zakaria, Santtosh Muniyandy, John Y. H. Soo
A Hybrid Neural-Assisted Unscented Kalman Filter for Unmanned Ground Vehicle Navigation
Modern autonomous navigation for unmanned ground vehicles relies on different estimators to fuse inertial sensors and GNSS measurements. However, the constant noise covariance matrices often strugg...
Gal Versano, Itzik Klein
Beyond BFS: A Comparative Study of Rooted Spanning Tree Algorithms on GPUs
Rooted spanning trees (RSTs) are a core primitive in parallel graph analytics, underpinning algorithms such as biconnected components and planarity testing. On GPUs, RST construction has traditiona...
Abhijeet Sahu, Srikar Vilas Donur
Chunk-Boundary Artifact in Action-Chunked Generative Policies: A Noise-Sensitive Failure Mechanism
Action chunking has become a central design choice for generative visuomotor policies, yet the execution discontinuities that arise at chunk boundaries remain poorly understood. In a frozen pretrai...
Rui Wang
Tokenization Allows Multimodal Large Language Models to Understand, Generate and Edit Architectural Floor Plans
Architectural floor plan design demands joint reasoning over geometry, semantics, and spatial hierarchy, which remains a major challenge for current AI systems. Although recent diffusion and langua...
Sizhong Qin, Ramon Elias Weber, Xinzheng Lu
Learnable Template Matching Approach for Micro-Deformation Monitoring based on Integrated Sensing and Communication Platform
Existing integrated sensing and communication (ISAC) platforms fail to fully utilize the shared spectrum and aperture resources for sensing, resulting in poor sensing performance. Specifically, wea...
Zhuoyang Liu, Yixiang Luomei, Feng Xu
Double-twisted surface spectrum from hybridized Majorana Kramers pairs and wallpaper fermions
We theoretically investigate the superconducting surface states of wallpaper fermions, which are surface quasiparticles of topological nonsymmorphic crystalline insulators protected by a wallpaper ...
Kaito Yoda, Ai Yamakage
The Density of Cross-Persistence Diagrams and Its Applications
Topological Data Analysis (TDA) provides powerful tools to explore the shape and structure of data through topological features such as clusters, loops, and voids. Persistence diagrams are a corner...
Alexander Mironenko, Evgeny. Burnaev, Serguei Barannikov
Sema: A High-performance System for LLM-based Semantic Query Processing
The integration of Large Language Models (LLMs) into data analytics has unlocked powerful capabilities for reasoning over bulk structured and unstructured data. However, existing systems typically ...
Kangkang Qi, Dongyang Xie, Wenbo Li, Hao Zhang, Yuanyuan Zhu, Jeffrey Xu Yu, Kangfei Zhao
Taming OpenClaw: Security Analysis and Mitigation of Autonomous LLM Agent Threats
Autonomous Large Language Model (LLM) agents, exemplified by OpenClaw, demonstrate remarkable capabilities in executing complex, long-horizon tasks. However, their tightly coupled instant-messaging...
Xinhao Deng, Yixiang Zhang, Jiaqing Wu, Jiaqi Bai, Sibo Yi, Zhuoheng Zou, Yue Xiao, Rennai Qiu, J...