Papers
Research papers from arXiv and related sources
Surgical Post-Training: Cutting Errors, Keeping Knowledge
Enhancing the reasoning capabilities of Large Language Models (LLMs) via post-training is often constrained by the trade-off between efficiency and catastrophic forgetting. While prior research emp...
Wenye Lin, Kai Han
$\mathcal{H}$-EFTCAMB: A Cobaya-Integrated, Python-Wrapped Extension of EFTCAMB for Covariant Horndeski Gravity
We present $\mathcal{H}\mathtt{-EFTCAMB}$, the official successor to $\mathtt{EFTCAMB}$. The original $\mathtt{EFTCAMB}$ is designed as a consistent and numerically stable implementation of the eff...
Gen Ye, Shijie Lin, Jiaming Pan, Dani de Boe, Stan Verhoeve, Marco Raveri, Bin Hu, Noemi Fruscian...
HeRo: Adaptive Orchestration of Agentic RAG on Heterogeneous Mobile SoC
With the increasing computational capability of mobile devices, deploying agentic retrieval-augmented generation (RAG) locally on heterogeneous System-on-Chips (SoCs) has become a promising way to ...
Maoliang Li, Jiayu Chen, Zihao Zheng, Ziqian Li, Xinhao Sun, Guojie Luo, Chenchen Liu, Xiang Chen
Transform-Invariant Generative Ray Path Sampling for Efficient Radio Propagation Modeling
Ray tracing has become a standard for accurate radio propagation modeling, but suffers from exponential computational complexity, as the number of candidate paths scales with the number of objects ...
Jérome Eertmans, Enrico M. Vitucci, Vittorio Degli-Esposti, Nicola Di Cicco, Laurent Jacques, Cla...
CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development
The development of chemical processes, a cornerstone of chemical engineering, presents formidable challenges due to its multi-faceted nature, integrating specialized knowledge, conceptual design, a...
Yuhang Yang, Ruikang Li, Jifei Ma, Kai Zhang, Qi Liu, Jianyu Han, Yonggan Bu, Jibin Zhou, Defu Li...
LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence
Understanding and predicting judicial outcomes demands nuanced analysis of legal documents. Traditional approaches treat judgments and proceedings as unstructured text, limiting the effectiveness o...
Anka Chandrahas Tummepalli, Preethu Rose Anish
PromptStereo: Zero-Shot Stereo Matching via Structure and Motion Prompts
Modern stereo matching methods have leveraged monocular depth foundation models to achieve superior zero-shot generalization performance. However, most existing methods primarily focus on extractin...
Xianqi Wang, Hao Yang, Hangtian Wang, Junda Cheng, Gangwei Xu, Min Lin, Xin Yang
QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image
Recent methods for pathology report generation from whole-slide image (WSI) are capable of producing slide-level diagnostic descriptions but fail to ground fine-grained statements in localized visu...
Rundong Wang, Wei Ba, Ying Zhou, Yingtai Li, Bowen Liu, Baizhi Wang, Yuhao Wang, Zhidong Yang, Ku...
An Investigation of the Relation Between Immersion and Learning Across Three Domains
We investigate the relationship between immersion and learning across three domains (cultural heritage, environmental awareness, and high school physics) through the lens of the Cognitive Affective...
Paolo Boffi, Alberto Gallace, Pier Luca Lanzi
Learning Structured Reasoning via Tractable Trajectory Control
Large language models can exhibit emergent reasoning behaviors, often manifested as recurring lexical patterns (e.g., "wait," indicating verification). However, complex reasoning trajectories remai...
Po-Nien Kung, Zhen Yang, Jeffrey Luo, Cheng-Fu Yang, Haikang Deng, Zi-Yi Dou, Yinfei Yang, Nanyun...
Learning to Draft: Adaptive Speculative Decoding with Reinforcement Learning
Speculative decoding accelerates large language model (LLM) inference by using a small draft model to generate candidate tokens for a larger target model to verify. The efficacy of this technique h...
Jiebin Zhang, Zhenghan Yu, Liang Wang, Nan Yang, Eugene J. Yu, Zheng Li, Yifan Song, Dawei Zhu, X...
Who Explains Privacy Policies to Me? Embodied and Textual LLM-Powered Privacy Assistants in Virtual Reality
Virtual Reality (VR) systems collect fine-grained behavioral and biometric data, yet privacy policies are rarely read or understood due to their complex language, length, and poor integration into ...
Vincent Freiberger, Moritz Dresch, Florian Alt, Arthur Fleig, Viktorija Paneva
Episode-wise spectro-polarimetry of GRB 220107A: Testing the hypothesis of evolving radiation mechanisms
We investigate the spectro-polarimetric properties of the long-duration GRB~220107A, which exhibited two distinct emission episodes separated by a 40 s quiescent gap, to test whether such multi-epi...
Rahul Gupta, Rushikesh Sonawane, Shabnam Iyyani, D. Frederiks, Judith Racusin, Tanmoy Chattopadha...
DeLo: Dual Decomposed Low-Rank Experts Collaboration for Continual Missing Modality Learning
Adapting Large Multimodal Models (LMMs) to real-world scenarios poses the dual challenges of learning from sequential data streams while handling frequent modality incompleteness, a task known as C...
Xiwei Liu, Yulong Li, Feilong Tang, Imran Razzak
SEED-SET: Scalable Evolving Experimental Design for System-level Ethical Testing
As autonomous systems such as drones, become increasingly deployed in high-stakes, human-centric domains, it is critical to evaluate the ethical alignment since failure to do so imposes imminent da...
Anjali Parashar, Yingke Li, Eric Yang Yu, Fei Chen, James Neidhoefer, Devesh Upadhyay, Chuchu Fan
Assessing Crime Disclosure Patterns in a Large-Scale Cybercrime Forum
Cybercrime forums play a central role in the cybercrime ecosystem, serving as hubs for the exchange of illicit goods, services, and knowledge. Previous studies have explored the market and social s...
Raphael Hoheisel, Tom Meurs, Jai Wientjes, Marianne Junger, Abhishta Abhishta, Masarah Paquet-Clo...
The Invisibility Hypothesis: Promises of AGI and the Future of the Global South
Discussions surrounding Artificial General Intelligence have largely focused on technical feasibility, timelines, and existential risk, often treating its social impact as being the same across dif...
L. Julian Lechuga Lopez, Luis Lara
Closing the Gap Between Float and Posit Hardware Efficiency
The b-posit, or bounded posit, is a variation of the posit format designed for high performance computing (HPC) and AI applications. Unlike traditional floating-point formats (floats), posits use v...
Aditya Anirudh Jonnalagadda, Rishi Thotli, John L. Gustafson
Mapping properties of the $S$-operator
In this paper, we study the $\ell^p\to \ell^r$ estimates for the $S$-operator arising in restriction problems for spheres over finite fields. We establish a necessary and sufficient condition for t...
Hunseok Kang, Doowon Koh, Changhun Yang
Evaluating and Understanding Scheming Propensity in LLM Agents
As frontier language models are increasingly deployed as autonomous agents pursuing complex, long-term objectives, there is increased risk of scheming: agents covertly pursuing misaligned goals. Pr...
Mia Hopman, Jannes Elstner, Maria Avramidou, Amritanshu Prasad, David Lindner