Papers
Research papers from arXiv and related sources
Video Detector: A Dual-Phase Vision-Based System for Real-Time Traffic Intersection Control and Intelligent Transportation Analysis
Urban traffic management increasingly requires intelligent sensing systems capable of adapting to dynamic traffic conditions without costly infrastructure modifications. Vision-based vehicle detect...
Mustafa Fatih Şen, Halûk Gümüşkaya, Şenol Pazar
vPET-ABC: Fast Voxelwise Approximate Bayesian Inference for Kinetic Modeling in PET
Dynamic PET kinetic modeling increasingly demands voxelwise uncertainty quantification and robust model selection. Yet total-body PET (TB-PET) data volumes make conventional Bayesian approaches, su...
Qinlin Gu, Gaelle M. Emvalomenos, Evan D. Morris, Clara Grazian, Steven R. Meikle
PCodeTrans: Translate Decompiled Pseudocode to Compilable and Executable Equivalent
Decompilation is foundational to binary analysis, yet conventional tools prioritize human readability over strict recompilability and verifiable runtime correctness. While recent LLM-based approach...
Yuxin Cui, Zeyu Gao, Shuxian He, Siliang Qin, Chao Zhang
Counterexample Guided Branching via Directional Relaxation Analysis in Complete Neural Network Verification
Deep Neural Networks demonstrate exceptional performance but remain vulnerable to adversarial perturbations, necessitating formal verification for safety-critical deployment. To address the computa...
Jingyang Li, Fu Song, Guoqiang Li
SimCert: Probabilistic Certification for Behavioral Similarity in Deep Neural Network Compression
Deploying Deep Neural Networks (DNNs) on resource-constrained embedded systems requires aggressive model compression techniques like quantization and pruning. However, ensuring that the compressed ...
Jingyang Li, Fu Song, Guoqiang Li
Universe Routing: Why Self-Evolving Agents Need Epistemic Control
A critical failure mode of current lifelong agents is not lack of knowledge, but the inability to decide how to reason. When an agent encounters "Is this coin fair?" it must recognize whether to in...
Zhaohui Geoffrey Wang
Face-to-Face: A Video Dataset for Multi-Person Interaction Modeling
Modeling the reactive tempo of human conversation remains difficult because most audio-visual datasets portray isolated speakers delivering short monologues. We introduce \textbf{Face-to-Face with ...
Ernie Chu, Vishal M. Patel
CORAL: COntextual Reasoning And Local Planning in A Hierarchical VLM Framework for Underwater Monitoring
Oyster reefs are critical ecosystem species that sustain biodiversity, filter water, and protect coastlines, yet they continue to decline globally. Restoring these ecosystems requires regular under...
Zhenqi Wu, Yuanjie Lu, Xuesu Xiao, Xiaomin Lin
$p^2$RAG: Privacy-Preserving RAG Service Supporting Arbitrary Top-$k$ Retrieval
Retrieval-Augmented Generation (RAG) enables large language models to use external knowledge, but outsourcing the RAG service raises privacy concerns for both data owners and users. Privacy-preserv...
Yulong Ming, Mingyue Wang, Jijia Yang, Cong Wang, Xiaohua Jia
Investigating the Impact of Speech Enhancement on Audio Deepfake Detection in Noisy Environments
Logical Access (LA) attacks, also known as audio deepfake attacks, use Text-to-Speech (TTS) or Voice Conversion (VC) methods to generate spoofed speech data. This can represent a serious threat to ...
Anacin, Angela, Shruti Kshirsagar, Anderson R. Avila
Online Learning for Supervisory Switching Control
We study supervisory switching control for partially-observed linear dynamical systems. The objective is to identify and deploy the best controller for the unknown system by periodically selecting ...
Haoyuan Sun, Ali Jadbabaie
The Rise of Null Hypothesis Significance Testing (NHST): Institutional Massification and the Emergence of a Procedural Epistemology
It has long been a puzzle why, despite sustained reform efforts, many applied scientific fields remain dominated by Null Hypothesis Significance Testing (NHST), a framework that dichotomizes study ...
Carol Ting
Towards Privacy-Preserving Machine Translation at the Inference Stage: A New Task and Benchmark
Current online translation services require sending user text to cloud servers, posing a risk of privacy leakage when the text contains sensitive information. This risk hinders the application of o...
Wei Shao, Lemao Liu, Yinqiao Li, Guoping Huang, Shuming Shi, Linqi Song
A Skill-augmented Agentic Framework and Benchmark for Multi-Video Understanding
Multimodal Large Language Models have achieved strong performance in single-video understanding, yet their ability to reason across multiple videos remains limited. Existing approaches typically co...
Yue Zhang, Liqiang Jing, Jia Li, Yapeng Tian, Xinya Du, Yunhui Guo, Vibhav Gogate
GameUIAgent: An LLM-Powered Framework for Automated Game UI Design with Structured Intermediate Representation
Game UI design requires consistent visual assets across rarity tiers yet remains a predominantly manual process. We present GameUIAgent, an LLM-powered agentic framework that translates natural lan...
Wei Zeng, Fengwei An, Zhen Liu, Jian Zhao
Multimodal Deep Learning for Early Prediction of Patient Deterioration in the ICU: Integrating Time-Series EHR Data with Clinical Notes
Early identification of patients at risk for clinical deterioration in the intensive care unit (ICU) remains a critical challenge. Delayed recognition of impending adverse events, including mortali...
Binesh Sadanandan
Quantum-Kinetic Dark Energy (QKDE): An effective dark energy framework with a covariantly completed time-dependent scalar kinetic normalization
A minimal effective dark-energy framework - Quantum-Kinetic Dark Energy (QKDE) - is developed in which the scalar kinetic normalization carries a slow background time dependence through a covariant...
Daniel Brown
Visual Confused Deputy: Exploiting and Defending Perception Failures in Computer-Using Agents
Computer-using agents (CUAs) act directly on graphical user interfaces, yet their perception of the screen is often unreliable. Existing work largely treats these failures as performance limitation...
Xunzhuo Liu, Bowei He, Xue Liu, Andy Luo, Haichen Zhang, Huamin Chen
AdapterTune: Zero-Initialized Low-Rank Adapters for Frozen Vision Transformers
Frozen-backbone transfer with Vision Transformers faces two under-addressed issues: optimization instability when adapters are naively inserted into a fixed feature extractor, and the absence of pr...
Salim Khazem
Beyond Local Code Optimization: Multi-Agent Reasoning for Software System Optimization
Large language models and AI agents have recently shown promise in automating software performance optimization, but existing approaches predominantly rely on local, syntax-driven code transformati...
Huiyun Peng, Parth Vinod Patil, Antonio Zhong Qiu, George K. Thiruvathukal, James C. Davis