Papers
Research papers from arXiv and related sources
KARL: Knowledge Agents via Reinforcement Learning
We present a system for training enterprise search agents via reinforcement learning that achieves state-of-the-art performance across a diverse suite of hard-to-verify agentic search tasks. Our wo...
Jonathan D. Chang, Andrew Drozdov, Shubham Toshniwal, Owen Oertell, Alexander Trott, Jacob Portes...
V2N-Based Algorithm and Communication Protocol for Autonomous Non-Stop Intersections
Intersections are critical areas for road safety and traffic efficiency, accounting for a significant portion of vehicle crashes and fatalities. While connected and autonomous vehicle (CAV) technol...
Lorenzo Farina, Lorenzo Mario Amorosa, Marco Rapelli, Barbara Maví Masini, Claudio Casetti, Aless...
Equivalent Circuit Modeling of Mutually Resistively Coupled Microwave Cavities with Enhanced Phase Sensitivity Using Thin Metallic Foils
We formulate and validate an equivalent circuit model describing mutual resistive coupling between three microwave cavity resonators interconnected via thin metallic foils. Each cavity is represent...
Michael T. Hatzon, Graeme R. Flower, Robert C. Crew, Jeremy F. Bourhill, Michael E. Tobar
Federated Causal Discovery Across Heterogeneous Datasets under Latent Confounding
Causal discovery across multiple datasets is often constrained by data privacy regulations and cross-site heterogeneity, limiting the use of conventional methods that require a single, centralized ...
Maximilian Hahn, Alina Zajak, Dominik Heider, Adèle Helena Ribeiro
ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI
The Abstraction and Reasoning Corpus (ARC-AGI) probes few-shot abstraction and rule induction on small visual grids, but progress is difficult to measure on static collections of hand-authored puzz...
Jens Lehmann, Syeda Khushbakht, Nikoo Salehfard, Nur A Zarin Nishat, Dhananjay Bhandiwad, Andrei ...
TW-Sound580K: A Regional Audio-Text Dataset with Verification-Guided Curation for Localized Audio-Language Modeling
Large Audio-Language Models (LALMs) typically struggle with localized dialectal prosody due to the scarcity of specialized corpora. We present TW-Sound580K, a Taiwanese audio-text instruction datas...
Hao-Hui Xie, Ho-Lam Chung, Yi-Cheng Lin, Ke-Han Lu, Wenze Ren, Xie Chen, Hung-yi Lee
Wire Your Way: Hardware-Contextualized Guidance and In-situ Tests for Personalized Circuit Prototyping
The increasing popularity of microcontroller platforms like Arduino enables diverse end-user developers to participate in circuit prototyping. Traditionally, follow-along tutorials serve as an esse...
Punn Lertjaturaphat, Jungwoo Rhee, Jaewon You, Andrea Bianchi
Constrained Symplectic Quantization: Disclosing the Deterministic Framework Behind Quantum Mechanics
Symplectic quantization is a functional approach to quantum field theory that allows sampling of quantum fluctuations directly in Minkowski space time by means of a generalized Hamiltonian dynamics...
Martina Giachello, Francesco Scardino, Giacomo Gradenigo
A 360-degree Multi-camera System for Blue Emergency Light Detection Using Color Attention RT-DETR and the ABLDataset
This study presents an advanced system for detecting blue lights on emergency vehicles, developed using ABLDataset, a curated dataset that includes images of European emergency vehicles under vario...
Francisco Vacalebri-Lloret, Lucas Banchero, Jose J. Lopez, Jose M. Mossi
MUTEX: Leveraging Multilingual Transformers and Conditional Random Fields for Enhanced Urdu Toxic Span Detection
Urdu toxic span detection remains limited because most existing systems rely on sentence-level classification and fail to identify the specific toxic spans within those text. It is further exacerba...
Inayat Arshad, Fajar Saleem, Ijaz Hussain
Sound Mode and Scale-Dependent Growth in Two-Fluid Dynamical Dark Energy
We investigate the effects of dynamical dark energy (DDE) on the growth of cosmic structure using a two-fluid model. This framework allows the dark energy equation of state to smoothly cross the ph...
Frans van Die, Vincent Desjacques
Exploiting Intermediate Reconstructions in Optical Coherence Tomography for Test-Time Adaption of Medical Image Segmentation
Primary health care frequently relies on low-cost imaging devices, which are commonly used for screening purposes. To ensure accurate diagnosis, these systems depend on advanced reconstruction algo...
Thomas Pinetz, Veit Hucke, Hrvoje Bogunovic
Haptics in Cognition: Disruptor or Enabler of Memory?
This exploratory pilot study investigates the impact of haptic perception --specifically tactile sensitivity (touch) and kinaesthetic intensity (movement)-- on learning, operationalized as informat...
Bibeg Limbu, Irene-Angelica Chounta
Observational and Thermodynamic aspects of one-dimensional Dark Energy EoS parametrization models
We examine the observational viability and physical implications of the Gong-Zhang (GZ) dark--energy equation-of-state parametrizations using exclusively late-time cosmological probes. Two one-dime...
Anirban Chatterjee, Yungui Gong
Poisoning the Inner Prediction Logic of Graph Neural Networks for Clean-Label Backdoor Attacks
Graph Neural Networks (GNNs) have achieved remarkable results in various tasks. Recent studies reveal that graph backdoor attacks can poison the GNN model to predict test nodes with triggers attach...
Yuxiang Zhang, Bin Ma, Enyan Dai
Think, Then Verify: A Hypothesis-Verification Multi-Agent Framework for Long Video Understanding
Long video understanding is challenging due to dense visual redundancy, long-range temporal dependencies, and the tendency of chain-of-thought and retrieval-based agents to accumulate semantic drif...
Zheng Wang, Haoran Chen, Haoxuan Qin, Zhipeng Wei, Tianwen Qian, Cong Bai
Gravitational instantons from closed superstring field theory
We test exact marginality of the deformation describing the resolution of a $\mathbb{Z}_2$ orbifold by analyzing the closed superstring equations of motion to third order in the size, including $α'...
Ivo Sachs, Xianghang Zhang
Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness
Differentially private learning is essential for training models on sensitive data, but empirical studies consistently show that it can degrade performance, introduce fairness issues like disparate...
Ruichen Xu, Kexin Chen
Causally Robust Reward Learning from Reason-Augmented Preference Feedback
Preference-based reward learning is widely used for shaping agent behavior to match a user's preference, yet its sparse binary feedback makes it especially vulnerable to causal confusion. The learn...
Minjune Hwang, Yigit Korkmaz, Daniel Seita, Erdem Bıyık
LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks
Port congestion at major maritime hubs disrupts global supply chains, yet existing prediction systems typically prioritize forecasting accuracy without providing operationally interpretable explana...
Zhiming Xue, Yujue Wang