Personal Assistant Web

TESTING

KARL: Knowledge Agents via Reinforcement Learning

We present a system for training enterprise search agents via reinforcement learning that achieves state-of-the-art performance across a diverse suite of hard-to-verify agentic search tasks. Our wo...

Jonathan D. Chang, Andrew Drozdov, Shubham Toshniwal, Owen Oertell, Alexander Trott, Jacob Portes...

2603.05218 • 2026-03-05

View PDF

TESTING

V2N-Based Algorithm and Communication Protocol for Autonomous Non-Stop Intersections

Intersections are critical areas for road safety and traffic efficiency, accounting for a significant portion of vehicle crashes and fatalities. While connected and autonomous vehicle (CAV) technol...

Lorenzo Farina, Lorenzo Mario Amorosa, Marco Rapelli, Barbara Maví Masini, Claudio Casetti, Aless...

2603.05165 • 2026-03-05

View PDF

TESTING

Equivalent Circuit Modeling of Mutually Resistively Coupled Microwave Cavities with Enhanced Phase Sensitivity Using Thin Metallic Foils

We formulate and validate an equivalent circuit model describing mutual resistive coupling between three microwave cavity resonators interconnected via thin metallic foils. Each cavity is represent...

Michael T. Hatzon, Graeme R. Flower, Robert C. Crew, Jeremy F. Bourhill, Michael E. Tobar

2603.05150 • 2026-03-05

View PDF

TESTING

Federated Causal Discovery Across Heterogeneous Datasets under Latent Confounding

Causal discovery across multiple datasets is often constrained by data privacy regulations and cross-site heterogeneity, limiting the use of conventional methods that require a single, centralized ...

Maximilian Hahn, Alina Zajak, Dominik Heider, Adèle Helena Ribeiro

2603.05149 • 2026-03-05

View PDF

TESTING

ARC-TGI: Human-Validated Task Generators with Reasoning Chain Templates for ARC-AGI

The Abstraction and Reasoning Corpus (ARC-AGI) probes few-shot abstraction and rule induction on small visual grids, but progress is difficult to measure on static collections of hand-authored puzz...

Jens Lehmann, Syeda Khushbakht, Nikoo Salehfard, Nur A Zarin Nishat, Dhananjay Bhandiwad, Andrei ...

2603.05099 • 2026-03-05

View PDF

TESTING

TW-Sound580K: A Regional Audio-Text Dataset with Verification-Guided Curation for Localized Audio-Language Modeling

Large Audio-Language Models (LALMs) typically struggle with localized dialectal prosody due to the scarcity of specialized corpora. We present TW-Sound580K, a Taiwanese audio-text instruction datas...

Hao-Hui Xie, Ho-Lam Chung, Yi-Cheng Lin, Ke-Han Lu, Wenze Ren, Xie Chen, Hung-yi Lee

2603.05094 • 2026-03-05

View PDF

TESTING

Wire Your Way: Hardware-Contextualized Guidance and In-situ Tests for Personalized Circuit Prototyping

The increasing popularity of microcontroller platforms like Arduino enables diverse end-user developers to participate in circuit prototyping. Traditionally, follow-along tutorials serve as an esse...

Punn Lertjaturaphat, Jungwoo Rhee, Jaewon You, Andrea Bianchi

2603.05085 • 2026-03-05

View PDF

TESTING

Constrained Symplectic Quantization: Disclosing the Deterministic Framework Behind Quantum Mechanics

Symplectic quantization is a functional approach to quantum field theory that allows sampling of quantum fluctuations directly in Minkowski space time by means of a generalized Hamiltonian dynamics...

Martina Giachello, Francesco Scardino, Giacomo Gradenigo

2603.05072 • 2026-03-05

View PDF

TESTING

A 360-degree Multi-camera System for Blue Emergency Light Detection Using Color Attention RT-DETR and the ABLDataset

This study presents an advanced system for detecting blue lights on emergency vehicles, developed using ABLDataset, a curated dataset that includes images of European emergency vehicles under vario...

Francisco Vacalebri-Lloret, Lucas Banchero, Jose J. Lopez, Jose M. Mossi

2603.05058 • 2026-03-05

View PDF

TESTING

MUTEX: Leveraging Multilingual Transformers and Conditional Random Fields for Enhanced Urdu Toxic Span Detection

Urdu toxic span detection remains limited because most existing systems rely on sentence-level classification and fail to identify the specific toxic spans within those text. It is further exacerba...

Inayat Arshad, Fajar Saleem, Ijaz Hussain

2603.05057 • 2026-03-05

View PDF

TESTING

Sound Mode and Scale-Dependent Growth in Two-Fluid Dynamical Dark Energy

We investigate the effects of dynamical dark energy (DDE) on the growth of cosmic structure using a two-fluid model. This framework allows the dark energy equation of state to smoothly cross the ph...

Frans van Die, Vincent Desjacques

2603.05049 • 2026-03-05

View PDF

TESTING

Exploiting Intermediate Reconstructions in Optical Coherence Tomography for Test-Time Adaption of Medical Image Segmentation

Primary health care frequently relies on low-cost imaging devices, which are commonly used for screening purposes. To ensure accurate diagnosis, these systems depend on advanced reconstruction algo...

Thomas Pinetz, Veit Hucke, Hrvoje Bogunovic

2603.05041 • 2026-03-05

View PDF

TESTING

Haptics in Cognition: Disruptor or Enabler of Memory?

This exploratory pilot study investigates the impact of haptic perception --specifically tactile sensitivity (touch) and kinaesthetic intensity (movement)-- on learning, operationalized as informat...

Bibeg Limbu, Irene-Angelica Chounta

2603.05019 • 2026-03-05

View PDF

TESTING

Observational and Thermodynamic aspects of one-dimensional Dark Energy EoS parametrization models

We examine the observational viability and physical implications of the Gong-Zhang (GZ) dark--energy equation-of-state parametrizations using exclusively late-time cosmological probes. Two one-dime...

Anirban Chatterjee, Yungui Gong

2603.05009 • 2026-03-05

View PDF

TESTING

Poisoning the Inner Prediction Logic of Graph Neural Networks for Clean-Label Backdoor Attacks

Graph Neural Networks (GNNs) have achieved remarkable results in various tasks. Recent studies reveal that graph backdoor attacks can poison the GNN model to predict test nodes with triggers attach...

Yuxiang Zhang, Bin Ma, Enyan Dai

2603.05004 • 2026-03-05

View PDF

TESTING

Think, Then Verify: A Hypothesis-Verification Multi-Agent Framework for Long Video Understanding

Long video understanding is challenging due to dense visual redundancy, long-range temporal dependencies, and the tendency of chain-of-thought and retrieval-based agents to accumulate semantic drif...

Zheng Wang, Haoran Chen, Haoxuan Qin, Zhipeng Wei, Tianwen Qian, Cong Bai

2603.04977 • 2026-03-05

View PDF

TESTING

Gravitational instantons from closed superstring field theory

We test exact marginality of the deformation describing the resolution of a $\mathbb{Z}_2$ orbifold by analyzing the closed superstring equations of motion to third order in the size, including $α'...

Ivo Sachs, Xianghang Zhang

2603.04953 • 2026-03-05

View PDF

TESTING

Differential Privacy in Two-Layer Networks: How DP-SGD Harms Fairness and Robustness

Differentially private learning is essential for training models on sensitive data, but empirical studies consistently show that it can degrade performance, introduce fairness issues like disparate...

Ruichen Xu, Kexin Chen

2603.04881 • 2026-03-05

View PDF

TESTING

Causally Robust Reward Learning from Reason-Augmented Preference Feedback

Preference-based reward learning is widely used for shaping agent behavior to match a user's preference, yet its sparse binary feedback makes it especially vulnerable to causal confusion. The learn...

Minjune Hwang, Yigit Korkmaz, Daniel Seita, Erdem Bıyık

2603.04861 • 2026-03-05

View PDF

TESTING

LLM-Grounded Explainability for Port Congestion Prediction via Temporal Graph Attention Networks

Port congestion at major maritime hubs disrupts global supply chains, yet existing prediction systems typically prioritize forecasting accuracy without providing operationally interpretable explana...

Zhiming Xue, Yujue Wang

2603.04818 • 2026-03-05

View PDF

Papers