Research

Paper

TESTING March 12, 2026

Beyond BFS: A Comparative Study of Rooted Spanning Tree Algorithms on GPUs

Authors

Abhijeet Sahu, Srikar Vilas Donur

Abstract

Rooted spanning trees (RSTs) are a core primitive in parallel graph analytics, underpinning algorithms such as biconnected components and planarity testing. On GPUs, RST construction has traditionally relied on breadth-first search (BFS) due to its simplicity and work efficiency. However, BFS incurs an O(D) step complexity, which severely limits parallelism on high-diameter and power-law graphs. We present a comparative study of alternative RST construction strategies on modern GPUs. We introduce a GPU adaptation of the Path Reversal RST (PR-RST) algorithm, optimizing its pointer-jumping and broadcast operations for modern GPU architecture. In addition, we evaluate an integrated approach that combines a state-of-the-art connectivity framework (GConn) with Eulerian tour-based rooting. Across more than 10 real-world graphs, our results show that the GConn-based approach achieves up to 300x speedup over optimized BFS on high-diameter graphs. These findings indicate that the O(log n) step complexity of connectivity-based methods can outweigh their structural overhead on modern hardware, motivating a rethinking of RST construction in GPU graph analytics.

Metadata

arXiv ID: 2603.11645
Provider: ARXIV
Primary Category: cs.DC
Published: 2026-03-12
Fetched: 2026-03-13 06:02

Related papers

Raw Data (Debug)
{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2603.11645v1</id>\n    <title>Beyond BFS: A Comparative Study of Rooted Spanning Tree Algorithms on GPUs</title>\n    <updated>2026-03-12T08:13:36Z</updated>\n    <link href='https://arxiv.org/abs/2603.11645v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2603.11645v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>Rooted spanning trees (RSTs) are a core primitive in parallel graph analytics, underpinning algorithms such as biconnected components and planarity testing. On GPUs, RST construction has traditionally relied on breadth-first search (BFS) due to its simplicity and work efficiency. However, BFS incurs an O(D) step complexity, which severely limits parallelism on high-diameter and power-law graphs. We present a comparative study of alternative RST construction strategies on modern GPUs. We introduce a GPU adaptation of the Path Reversal RST (PR-RST) algorithm, optimizing its pointer-jumping and broadcast operations for modern GPU architecture. In addition, we evaluate an integrated approach that combines a state-of-the-art connectivity framework (GConn) with Eulerian tour-based rooting. Across more than 10 real-world graphs, our results show that the GConn-based approach achieves up to 300x speedup over optimized BFS on high-diameter graphs. These findings indicate that the O(log n) step complexity of connectivity-based methods can outweigh their structural overhead on modern hardware, motivating a rethinking of RST construction in GPU graph analytics.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.DC'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.DS'/>\n    <published>2026-03-12T08:13:36Z</published>\n    <arxiv:primary_category term='cs.DC'/>\n    <author>\n      <name>Abhijeet Sahu</name>\n    </author>\n    <author>\n      <name>Srikar Vilas Donur</name>\n    </author>\n  </entry>"
}