Research

Paper

TESTING March 23, 2026

Learning Inflation Narratives from Reddit: How Lightweight LLMs Reveal Forward-Looking Economic Signals

Authors

Ryuichi Saito, Sho Tsugawa

Abstract

Public perceptions and expectations of inflation shape household spending, wage bargaining, and policy support, making them key determinants of macroeconomic outcomes. However, current measures rely on infrequent surveys and offer limited insight into underlying narratives and sector-specific concerns. This paper presents a novel approach to measuring public perception of inflation, using lightweight large language models (LLMs) fine-tuned on domain-specific Reddit data. We created an inflation classifier trained on posts related to components of the U.S. Consumer Price Index (CPI). When applied to more than 10 years of Reddit discussions (2012-2022), this classifier produces monthly Reddit inflation scores (RIS), which we validated against actual economic indicators. Our results show that fine-tuned lightweight LLMs perform well even with smaller training datasets, and the Reddit inflation scores strongly correlate with CPI (r=0.91) and closely align with the University of Michigan: Inflation Expectation (MICH). Importantly, Granger causality tests suggested that social media-based inflation scores often precede movements in both CPI and MICH, indicating their potential as predictive, forward-looking economic signals. Furthermore, change-point and lexical analyses uncovered shifts in inflation-related narratives across sectors like groceries, transportation, and housing, revealing dimensions of inflation concern that are not directly observable in aggregate price indices. By complementing traditional economic indicators with narrative-rich signals, this study demonstrates how NLP-based measures can facilitate earlier detection of inflationary pressures and policy responses.

Metadata

arXiv ID: 2603.21501

Provider: ARXIV

Primary Category: cs.SI

Published: 2026-03-23

Fetched: 2026-03-24 06:02

Related papers

Fractal universe and quantum gravity made simple

Fabio Briscese, Gianluca Calcagni • 2026-03-25

POLY-SIM: Polyglot Speaker Identification with Missing Modality Grand Challenge 2026 Evaluation Plan

Marta Moscati, Muhammad Saad Saeed, Marina Zanoni, Mubashir Noman, Rohan Kuma... • 2026-03-25

LensWalk: Agentic Video Understanding by Planning How You See in Videos

Keliang Li, Yansong Li, Hongze Shen, Mengdi Liu, Hong Chang, Shiguang Shan • 2026-03-25

Orientation Reconstruction of Proteins using Coulomb Explosions

Tomas André, Alfredo Bellisario, Nicusor Timneanu, Carl Caleman • 2026-03-25

The role of spatial context and multitask learning in the detection of organic and conventional farming systems based on Sentinel-2 time series

Jan Hemmerling, Marcel Schwieder, Philippe Rufin, Leon-Friedrich Thomas, Mire... • 2026-03-25

Raw Data (Debug)

{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2603.21501v1</id>\n    <title>Learning Inflation Narratives from Reddit: How Lightweight LLMs Reveal Forward-Looking Economic Signals</title>\n    <updated>2026-03-23T02:50:06Z</updated>\n    <link href='https://arxiv.org/abs/2603.21501v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2603.21501v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>Public perceptions and expectations of inflation shape household spending, wage bargaining, and policy support, making them key determinants of macroeconomic outcomes. However, current measures rely on infrequent surveys and offer limited insight into underlying narratives and sector-specific concerns. This paper presents a novel approach to measuring public perception of inflation, using lightweight large language models (LLMs) fine-tuned on domain-specific Reddit data. We created an inflation classifier trained on posts related to components of the U.S. Consumer Price Index (CPI). When applied to more than 10 years of Reddit discussions (2012-2022), this classifier produces monthly Reddit inflation scores (RIS), which we validated against actual economic indicators. Our results show that fine-tuned lightweight LLMs perform well even with smaller training datasets, and the Reddit inflation scores strongly correlate with CPI (r=0.91) and closely align with the University of Michigan: Inflation Expectation (MICH). Importantly, Granger causality tests suggested that social media-based inflation scores often precede movements in both CPI and MICH, indicating their potential as predictive, forward-looking economic signals. Furthermore, change-point and lexical analyses uncovered shifts in inflation-related narratives across sectors like groceries, transportation, and housing, revealing dimensions of inflation concern that are not directly observable in aggregate price indices. By complementing traditional economic indicators with narrative-rich signals, this study demonstrates how NLP-based measures can facilitate earlier detection of inflationary pressures and policy responses.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.SI'/>\n    <published>2026-03-23T02:50:06Z</published>\n    <arxiv:comment>19 pages, accepted at The 20th International AAAI Conference on Web and Social Media (ICWSM'26)</arxiv:comment>\n    <arxiv:primary_category term='cs.SI'/>\n    <author>\n      <name>Ryuichi Saito</name>\n    </author>\n    <author>\n      <name>Sho Tsugawa</name>\n    </author>\n  </entry>"
}