Papers
Research papers from arXiv and related sources
How Open is Open TTS? A Practical Evaluation of Open Source TTS Tools for Romanian
Open-source text-to-speech (TTS) frameworks have emerged as highly adaptable platforms for developing speech synthesis systems across a wide range of languages. However, their applicability is not ...
Teodora Răgman, Adrian Bogdan Stânea, Horia Cucu, Adriana Stan
Granular Ball Guided Stable Latent Domain Discovery for Domain-General Crowd Counting
Single-source domain generalization for crowd counting remains highly challenging because a single labeled source domain often contains heterogeneous latent domains, while test data may exhibit sev...
Fan Chen, Shuyin Xia, Yi Wang, Xinbo Gao
Predicting Grain Growth Evolution Under Complex Thermal Profiles with Deep Learning through Thermal Descriptor Modulation
Predicting microstructure evolution during thermomechanical treatment is essential for determining the final mechanical properties of a material, yet conventional simulations based on Partial Diffe...
Pungponhavoan Tep, Marc Bernacki
Hierarchical Spatial-Temporal Graph-Enhanced Model for Map-Matching
The integration of GNSS data into portable devices has led to the generation of vast amounts of trajectory data, which is crucial for applications such as map-matching. To tackle the limitations of...
Anjun Gao, Zhenglin Wan, Pingfu Chao, Shunyu Yao
Minimal Sufficient Representations for Self-interpretable Deep Neural Networks
Deep neural networks (DNNs) achieve remarkable predictive performance but remain difficult to interpret, largely due to overparameterization that obscures the minimal structure required for interpr...
Zhiyao Tan, Liu Li, Huazhen Lin
Blind Quality Enhancement for G-PCC Compressed Dynamic Point Clouds
Point cloud compression often introduces noticeable reconstruction artifacts, which makes quality enhancement necessary. Existing approaches typically assume prior knowledge of the distortion level...
Tian Guo, Hui Yuan, Chang Sun, Wei Zhang, Raouf Hamzaoui, Sam Kwong
Sensing-Assisted Adaptive Beam Probing with Calibrated Multimodal Priors and Uncertainty-Aware Scheduling
Highly directional mmWave/THz links require rapid beam alignment, yet exhaustive codebook sweeps incur prohibitive training overhead. This letter proposes a sensing-assisted adaptive probing policy...
Abidemi Orimogunje, Vukan Ninkovic, Ognjen Kundacina, Hyunwoo Park, Sunwoo Kim, Dejan Vukobratovi...
Precision Tests of Isospin Symmetry through Coulomb excitation of A = 62 Nuclei
Isospin symmetry in the $A=62$ mass system was investigated through Coulomb excitation reactions at the RIKEN Radioactive Isotope Beam Factory. Beams of $^{62}$Zn, $^{62}$Ga, and $^{62}$Ge were stu...
K. Wimmer, T. Hüyük, S. M. Lenzi, A. Poves, F. Browne, P. Doornenbal, T. Koiwai, T. Arici, M. A. ...
COVTrack++: Learning Open-Vocabulary Multi-Object Tracking from Continuous Videos via a Synergistic Paradigm
Multi-Object Tracking (MOT) has traditionally focused on a few specific categories, restricting its applicability to real-world scenarios involving diverse objects. Open-Vocabulary Multi-Object Tra...
Zekun Qian, Wei Feng, Ruize Han, Junhui Hou
Gravitational mass generation and consistent non-minimal couplings: cubics and quartics of a massive vector
An attempt to evade the strict uniqueness of consistent interactions involving spin-2 particles is made by modifying the Noether procedure from the outset. A vector field is introduced, coupled to ...
Carlo Marzo
UW-VOS: A Large-Scale Dataset for Underwater Video Object Segmentation
Underwater Video Object Segmentation (VOS) is essential for marine exploration, yet open-air methods suffer significant degradation due to color distortion, low contrast, and prevalent camouflage. ...
Hongshen Zhao, Jingkang Tai, Yuhang Wu, Wenkang Zhang, Xi Lan, Shangyan Wang, Tianyu Zhang, Wanko...
MonoSIM: An open source SIL framework for Ackermann Vehicular Systems with Monocular Vision
This paper presents an open-source Software-in-the-Loop (SIL) simulation platform designed for autonomous Ackerman vehicle research and education. The proposed framework focuses on simplicity, whil...
Shantanu Rahman, Nayeb Hasin, Mainul Islam, Md. Zubair Alom Rony, Golam Sarowar
Variable-Length Audio Fingerprinting
Audio fingerprinting converts audio to much lower-dimensional representations, allowing distorted recordings to still be recognized as their originals through similar fingerprints. Existing deep le...
Hongjie Chen, Hanyu Meng, Huimin Zeng, Ryan A. Rossi, Lie Lu, Josh Kimball
An Empirical Analysis of Google Play Data Safety Disclosures: A Consistency Study of Privacy Indicators in Mobile Gaming Apps
The Google Play marketplace has introduced the Data Safety section to improve transparency regarding how mobile applications (apps) collect, share, and protect user data. This mechanism requires de...
Bakheet Aljedaani
A cube dismantling problem related to bootstrap percolation
An $n\times n\times\dots\times n$ hypercube is made from $n^d$ unit hypercubes. Two unit hypercubes are neighbours if they share a $(d-1)$-dimensional face. In each step of a dismantling process, w...
János Barát, Ian M. Wanless
SM-Net: Learning a Continuous Spectral Manifold from Multiple Stellar Libraries
We present SM-Net, a machine-learning model that learns a continuous spectral manifold from multiple high-resolution stellar libraries. SM-Net generates stellar spectra directly from the fundamenta...
Omar Anwar, Aaron S. G. Robotham, Luca Cortese, Kevin Vinsen
Off-Policy Safe Reinforcement Learning with Constrained Optimistic Exploration
When safety is formulated as a limit of cumulative cost, safe reinforcement learning (RL) aims to learn policies that maximize return subject to the cost constraint in data collection and deploymen...
Guopeng Li, Matthijs T. J. Spaan, Julian F. P. Kooij
The Luna Bound Propagator for Formal Analysis of Neural Networks
The parameterized CROWN analysis, a.k.a., alpha-CROWN, has emerged as a practically successful bound propagation method for neural network verification. However, existing implementations of alpha-C...
Henry LeCates, Haoze Wu
Joint Source-Channel-Check Coding with HARQ for Reliable Semantic Communications
Semantic communication has emerged as a promising paradigm for improving transmission efficiency and task-level reliability, yet most existing reliability-enhancement approaches rely on retransmiss...
Boyuan Li, Shuoyao Wang, Suzhi Bi, Liping Qian, Yunlong Cai
MLE-UVAD: Minimal Latent Entropy Autoencoder for Fully Unsupervised Video Anomaly Detection
In this paper, we address the challenging problem of single-scene, fully unsupervised video anomaly detection (VAD), where raw videos containing both normal and abnormal events are used directly fo...
Yuang Geng, Junkai Zhou, Kang Yang, Pan He, Zhuoyang Zhou, Jose C. Principe, Joel Harley, Ivan Ru...