Research

Paper

AI LLM February 24, 2026

SibylSense: Adaptive Rubric Learning via Memory Tuning and Adversarial Probing

Authors

Yifei Xu, Guilherme Potje, Shivam Shandilya, Tiancheng Yuan, Leonardo de Oliveira Nunes, Rakshanda Agarwal, Saeid Asgari, Adam Atkinson, Emre Kıcıman, Songwu Lu, Ranveer Chandra, Tusher Chakraborty

Abstract

Designing aligned and robust rewards for open-ended generation remains a key barrier to RL post-training. Rubrics provide structured, interpretable supervision, but scaling rubric construction is difficult: expert rubrics are costly, prompted rubrics are often superficial or inconsistent, and fixed-pool discriminative rubrics can saturate and drift, enabling reward hacking. We present SibylSense, an inference-time learning approach that adapts a frozen rubric generator through a tunable memory bank of validated rubric items. Memory is updated via verifier-based item rewards measured by reference-candidate answer discriminative gaps from a handful of examples. SibylSense alternates memory tuning with a rubric-adversarial policy update that produces rubric-satisfying candidate answers, shrinking discriminative gaps and driving the rubric generator to capture new quality dimensions. Experiments on two open-ended tasks show that SibylSense yields more discriminative rubrics and improves downstream RL performance over static and non-adaptive baselines.

Metadata

arXiv ID: 2602.20751

Provider: ARXIV

Primary Category: cs.CL

Published: 2026-02-24

Fetched: 2026-02-25 06:05

Related papers

Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini

Ruofei Du, Benjamin Hersh, David Li, Nels Numan, Xun Qian, Yanhe Chen, Zhongy... • 2026-03-25

Comparing Developer and LLM Biases in Code Evaluation

Aditya Mittal, Ryan Shar, Zichu Wu, Shyam Agarwal, Tongshuang Wu, Chris Donah... • 2026-03-25

The Stochastic Gap: A Markovian Framework for Pre-Deployment Reliability and Oversight-Cost Auditing in Agentic Artificial Intelligence

Biplab Pal, Santanu Bhattacharya • 2026-03-25

Retrieval Improvements Do Not Guarantee Better Answers: A Study of RAG for AI Policy QA

Saahil Mathur, Ryan David Rittner, Vedant Ajit Thakur, Daniel Stuart Schiff, ... • 2026-03-25

MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination

Zhuo Li, Yupeng Zhang, Pengyu Cheng, Jiajun Song, Mengyu Zhou, Hao Li, Shujie... • 2026-03-25

Raw Data (Debug)

{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2602.20751v1</id>\n    <title>SibylSense: Adaptive Rubric Learning via Memory Tuning and Adversarial Probing</title>\n    <updated>2026-02-24T10:28:44Z</updated>\n    <link href='https://arxiv.org/abs/2602.20751v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2602.20751v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>Designing aligned and robust rewards for open-ended generation remains a key barrier to RL post-training. Rubrics provide structured, interpretable supervision, but scaling rubric construction is difficult: expert rubrics are costly, prompted rubrics are often superficial or inconsistent, and fixed-pool discriminative rubrics can saturate and drift, enabling reward hacking. We present SibylSense, an inference-time learning approach that adapts a frozen rubric generator through a tunable memory bank of validated rubric items. Memory is updated via verifier-based item rewards measured by reference-candidate answer discriminative gaps from a handful of examples. SibylSense alternates memory tuning with a rubric-adversarial policy update that produces rubric-satisfying candidate answers, shrinking discriminative gaps and driving the rubric generator to capture new quality dimensions. Experiments on two open-ended tasks show that SibylSense yields more discriminative rubrics and improves downstream RL performance over static and non-adaptive baselines.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.CL'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.AI'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.LG'/>\n    <published>2026-02-24T10:28:44Z</published>\n    <arxiv:primary_category term='cs.CL'/>\n    <author>\n      <name>Yifei Xu</name>\n    </author>\n    <author>\n      <name>Guilherme Potje</name>\n    </author>\n    <author>\n      <name>Shivam Shandilya</name>\n    </author>\n    <author>\n      <name>Tiancheng Yuan</name>\n    </author>\n    <author>\n      <name>Leonardo de Oliveira Nunes</name>\n    </author>\n    <author>\n      <name>Rakshanda Agarwal</name>\n    </author>\n    <author>\n      <name>Saeid Asgari</name>\n    </author>\n    <author>\n      <name>Adam Atkinson</name>\n    </author>\n    <author>\n      <name>Emre Kıcıman</name>\n    </author>\n    <author>\n      <name>Songwu Lu</name>\n    </author>\n    <author>\n      <name>Ranveer Chandra</name>\n    </author>\n    <author>\n      <name>Tusher Chakraborty</name>\n    </author>\n  </entry>"
}