Research

Paper

AI LLM March 11, 2026

Pneuma-Seeker: A Relational Reification Mechanism to Align AI Agents with Human Work over Relational Data

Authors

Muhammad Imam Luthfi Balaka, John Hillesland, Kemal Badur, Raul Castro Fernandez

Abstract

When faced with data problems, many data workers cannot articulate their information need precisely enough for software to help. Although LLMs interpret natural-language requests, they behave brittly when intent is under-specified, e.g., hallucinating fields, assuming join paths, or producing ungrounded answers. We present Pneuma-Seeker, a system built around a central idea: relational reification. Pneuma-Seeker represents a user's evolving information need as a relational schema: a concrete, analysis-ready data model shared between user and system. Rather than answering prompts directly, Pneuma-Seeker iteratively refines this schema, then discovers and prepares relevant sources to construct a relation and executable program that compute the answer. Pneuma-Seeker employs an LLM-powered agentic architecture with conductor-style planning and macro- and micro-level context management to operate effectively over heterogeneous relational corpora. We evaluate Pneuma-Seeker across multiple domains against state-of-the-art academic and industrial baselines, demonstrating higher answer accuracy. Deployment in a real organization highlights trust and inspectability as essential requirements for LLM-mediated data systems.

Metadata

arXiv ID: 2603.10747
Provider: ARXIV
Primary Category: cs.DB
Published: 2026-03-11
Fetched: 2026-03-12 04:21

Related papers

Raw Data (Debug)
{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2603.10747v1</id>\n    <title>Pneuma-Seeker: A Relational Reification Mechanism to Align AI Agents with Human Work over Relational Data</title>\n    <updated>2026-03-11T13:20:16Z</updated>\n    <link href='https://arxiv.org/abs/2603.10747v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2603.10747v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>When faced with data problems, many data workers cannot articulate their information need precisely enough for software to help. Although LLMs interpret natural-language requests, they behave brittly when intent is under-specified, e.g., hallucinating fields, assuming join paths, or producing ungrounded answers.\n  We present Pneuma-Seeker, a system built around a central idea: relational reification. Pneuma-Seeker represents a user's evolving information need as a relational schema: a concrete, analysis-ready data model shared between user and system. Rather than answering prompts directly, Pneuma-Seeker iteratively refines this schema, then discovers and prepares relevant sources to construct a relation and executable program that compute the answer. Pneuma-Seeker employs an LLM-powered agentic architecture with conductor-style planning and macro- and micro-level context management to operate effectively over heterogeneous relational corpora.\n  We evaluate Pneuma-Seeker across multiple domains against state-of-the-art academic and industrial baselines, demonstrating higher answer accuracy. Deployment in a real organization highlights trust and inspectability as essential requirements for LLM-mediated data systems.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.DB'/>\n    <published>2026-03-11T13:20:16Z</published>\n    <arxiv:primary_category term='cs.DB'/>\n    <author>\n      <name>Muhammad Imam Luthfi Balaka</name>\n    </author>\n    <author>\n      <name>John Hillesland</name>\n    </author>\n    <author>\n      <name>Kemal Badur</name>\n    </author>\n    <author>\n      <name>Raul Castro Fernandez</name>\n    </author>\n  </entry>"
}