Personal Assistant Web

AI LLM

A Benchmarking Framework for Model Datasets

Empirical and LLM-based research in model-driven engineering increasingly relies on datasets of software models, for instance, to train or evaluate machine learning techniques for modeling support....

Philipp-Lorenz Glaser, Lola Burgueño, Dominik Bork

2603.05250 • 2026-03-05

View PDF

AI LLM

GCAgent: Enhancing Group Chat Communication through Dialogue Agents System

As a key form in online social platforms, group chat is a popular space for interest exchange or problem-solving, but its effectiveness is often hindered by inactivity and management challenges. Wh...

Zijie Meng, Zheyong Xie, Zheyu Ye, Chonggang Lu, Zuozhu Liu, Zihan Niu, Yao Hu, Shaosheng Cao

2603.05240 • 2026-03-05

View PDF

AI LLM

SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

NVIDIA's 2:4 Sparse Tensor Cores deliver 2x throughput but demand strict 50% pruning -- a ratio that collapses LLM reasoning accuracy (Qwen3: 54% to 15%). Milder $(2N-2):2N$ patterns (e.g., 6:8, 25...

Hanyong Shao, Yingbo Hao, Ting Song, Yan Xia, Di Zhang, Shaohan Huang, Xun Wu, Songchen Xu, Le Xu...

2603.05232 • 2026-03-05

View PDF

AI LLM

Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards

Recently, Automatic Speech Recognition (ASR) systems (e.g., Whisper) have achieved remarkable accuracy improvements but remain highly sensitive to real-world unseen data (data with large distributi...

Linghan Fang, Tianxin Xie, Li Liu

2603.05231 • 2026-03-05

View PDF

AI LLM

Not All Trust is the Same: Effects of Decision Workflow and Explanations in Human-AI Decision Making

A central challenge in AI-assisted decision making is achieving warranted, well-calibrated trust. Both overtrust (accepting incorrect AI recommendations) and undertrust (rejecting correct advice) s...

Laura Spillner, Rachel Ringe, Robert Porzel, Rainer Malaka

2603.05229 • 2026-03-05

View PDF

TESTING

The Geometric Inductive Bias of Grokking: Bypassing Phase Transitions via Architectural Topology

Mechanistic interpretability typically relies on post-hoc analysis of trained networks. We instead adopt an interventional approach: testing hypotheses a priori by modifying architectural topology ...

Alper Yıldırım

2603.05228 • 2026-03-05

View PDF

AI LLM

AI+HW 2035: Shaping the Next Decade

Artificial intelligence (AI) and hardware (HW) are advancing at unprecedented rates, yet their trajectories have become inseparably intertwined. The global research community lacks a cohesive, long...

Deming Chen, Jason Cong, Azalia Mirhoseini, Christos Kozyrakis, Subhasish Mitra, Jinjun Xiong, Cl...

2603.05225 • 2026-03-05

View PDF

TESTING

KARL: Knowledge Agents via Reinforcement Learning

We present a system for training enterprise search agents via reinforcement learning that achieves state-of-the-art performance across a diverse suite of hard-to-verify agentic search tasks. Our wo...

Jonathan D. Chang, Andrew Drozdov, Shubham Toshniwal, Owen Oertell, Alexander Trott, Jacob Portes...

2603.05218 • 2026-03-05

View PDF

AI LLM

Scaling Real-Time Traffic Analytics on Edge-Cloud Fabrics for City-Scale Camera Networks

Real-time city-scale traffic analytics requires processing 100s-1000s of CCTV streams under strict latency, bandwidth, and compute limits. We present a scalable AI-driven Intelligent Transportation...

Akash Sharma, Pranjal Naman, Roopkatha Banerjee, Priyanshu Pansari, Sankalp Gawali, Mayank Arya, ...

2603.05217 • 2026-03-05

View PDF

AI LLM

Core-based Hierarchies for Efficient GraphRAG

Retrieval-Augmented Generation (RAG) enhances large language models by incorporating external knowledge. However, existing vector-based methods often fail on global sensemaking tasks that require r...

Jakir Hossain, Ahmet Erdem Sarıyüce

2603.05207 • 2026-03-05

View PDF

AI LLM

Diffusion LLMs can think EoS-by-EoS

Diffusion LLMs have been proposed as an alternative to autoregressive LLMs, excelling especially at complex reasoning tasks with interdependent sub-goals. Curiously, this is particularly true if th...

Sarah Breckner, Sebastian Schuster

2603.05197 • 2026-03-05

View PDF

AI LLM

Small Changes, Big Impact: Demographic Bias in LLM-Based Hiring Through Subtle Sociocultural Markers in Anonymised Resumes

Large Language Models (LLMs) are increasingly deployed in resume screening pipelines. Although explicit PII (e.g., names) is commonly redacted, resumes typically retain subtle sociocultural markers...

Bryan Chen Zhengyu Tan, Shaun Khoo, Bich Ngoc Doan, Zhengyuan Liu, Nancy F. Chen, Roy Ka-Wei Lee

2603.05189 • 2026-03-05

View PDF

AI LLM

Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks

Covalent organic frameworks (COFs) are promising photocatalysts for solar hydrogen production, yet the most electronically favorable linkages, imines, hydrolyze rapidly in water, creating a stabili...

Iman Peivaste, Nicolas D. Boscher, Ahmed Makradi, Salim Belouettar

2603.05188 • 2026-03-05

View PDF

AI LLM

Mario: Multimodal Graph Reasoning with Large Language Models

Recent advances in large language models (LLMs) have opened new avenues for multimodal reasoning. Yet, most existing methods still rely on pretrained vision-language models (VLMs) to encode image-t...

Yuanfu Sun, Kang Li, Pengkang Guo, Jiajin Liu, Qiaoyu Tan

2603.05181 • 2026-03-05

View PDF

AI LLM

SWARM-SLR AIssistant: A Unified Framework for Scalable Systematic Literature Review Automation

Despite a growing ecosystem of tools supporting Systematic Literature Reviews (SLRs), integrating them into user-friendly workflows remains challenging. The Streamlined Workflow for Automating Mach...

Tim Wittenborg, Allard Oelen, Manuel Prinz

2603.05177 • 2026-03-05

View PDF

AI LLM

Incentive Aware AI Regulations: A Credal Characterisation

While high-stakes ML applications demand strict regulations, strategic ML providers often evade them to lower development costs. To address this challenge, we cast AI regulation as a mechanism desi...

Anurag Singh, Julian Rodemann, Rajeev Verma, Siu Lun Chau, Krikamol Muandet

2603.05175 • 2026-03-05

View PDF

AI LLM

Guidelines for the Annotation and Visualization of Legal Argumentation Structures in Chinese Judicial Decisions

This guideline proposes a systematic and operational annotation framework for representing the structure of legal argumentation in judicial decisions. Grounded in theories of legal reasoning and ar...

Kun Chen, Xianglei Liao, Kaixue Fei, Yi Xing, Xinrui Li

2603.05171 • 2026-03-05

View PDF

AI LLM

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Semi-structured N:M sparsity and low-bit quantization (e.g., 1.58-bit BitNet) are two promising approaches for improving the efficiency of large language models (LLMs), yet they have largely been s...

Di Zhang, Xun Wu, Shaohan Huang, Yudong Wang, Hanyong Shao, Yingbo Hao, Zewen Chi, Li Dong, Ting ...

2603.05168 • 2026-03-05

View PDF

AI LLM

C2-Faith: Benchmarking LLM Judges for Causal and Coverage Faithfulness in Chain-of-Thought Reasoning

Large language models (LLMs) are increasingly used as judges of chain-of-thought (CoT) reasoning, but it remains unclear whether they can reliably assess process faithfulness rather than just answe...

Avni Mittal, Rauno Arike

2603.05167 • 2026-03-05

View PDF

TESTING

V2N-Based Algorithm and Communication Protocol for Autonomous Non-Stop Intersections

Intersections are critical areas for road safety and traffic efficiency, accounting for a significant portion of vehicle crashes and fatalities. While connected and autonomous vehicle (CAV) technol...

Lorenzo Farina, Lorenzo Mario Amorosa, Marco Rapelli, Barbara Maví Masini, Claudio Casetti, Aless...

2603.05165 • 2026-03-05

View PDF

Papers