Research

Paper

TESTING February 23, 2026

GPU-Native Compressed Neighbor Lists with a Space-Filling-Curve Data Layout

Authors

Felix Thaler, Sebastian Keller

Abstract

We have developed a compressed neighbor list for short-range particle-particle interaction based on a space- filling curve (SFC) memory layout and particle clusters. The neighbor list can be constructed efficiently on GPUs, supporting NVIDIA and AMD hardware, and has a memory footprint of only 4 bytes per particle to store approximately 200 neighbors. Compared to the highly-optimized domain-specific neighbor list implementation of GROMACS, a molecular dynamics code, it has a comparable cluster overhead and delivers similar performance in a neighborhood pass. Thanks to the SFC-based data layout and the support for varying interaction radii per particle, our neighbor list performs well for systems with high density contrasts, such as those encountered in many astrophysical and cosmological applications. Due to the close relation between SFCs and octrees, our neighbor list seamlessly integrates with octree-based domain decomposition and multipole-based methods for long-range gravitational or electrostatic interactions. To demonstrate the coupling between long- and short-range forces, we simulate an Evrard collapse, a standard test case for the coupling between hydrodynamical and gravitational forces, on up to 1024 GPUs, and compare our results to the analytical solution.

Metadata

arXiv ID: 2602.19873
Provider: ARXIV
Primary Category: cs.CE
Published: 2026-02-23
Fetched: 2026-02-24 04:38

Related papers

Raw Data (Debug)
{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2602.19873v1</id>\n    <title>GPU-Native Compressed Neighbor Lists with a Space-Filling-Curve Data Layout</title>\n    <updated>2026-02-23T14:21:12Z</updated>\n    <link href='https://arxiv.org/abs/2602.19873v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2602.19873v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>We have developed a compressed neighbor list for short-range particle-particle interaction based on a space- filling curve (SFC) memory layout and particle clusters. The neighbor list can be constructed efficiently on GPUs, supporting NVIDIA and AMD hardware, and has a memory footprint of only 4 bytes per particle to store approximately 200 neighbors. Compared to the highly-optimized domain-specific neighbor list implementation of GROMACS, a molecular dynamics code, it has a comparable cluster overhead and delivers similar performance in a neighborhood pass. Thanks to the SFC-based data layout and the support for varying interaction radii per particle, our neighbor list performs well for systems with high density contrasts, such as those encountered in many astrophysical and cosmological applications. Due to the close relation between SFCs and octrees, our neighbor list seamlessly integrates with octree-based domain decomposition and multipole-based methods for long-range gravitational or electrostatic interactions. To demonstrate the coupling between long- and short-range forces, we simulate an Evrard collapse, a standard test case for the coupling between hydrodynamical and gravitational forces, on up to 1024 GPUs, and compare our results to the analytical solution.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.CE'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='astro-ph.IM'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.DS'/>\n    <published>2026-02-23T14:21:12Z</published>\n    <arxiv:comment>Accepted at IPDPS 2026</arxiv:comment>\n    <arxiv:primary_category term='cs.CE'/>\n    <author>\n      <name>Felix Thaler</name>\n    </author>\n    <author>\n      <name>Sebastian Keller</name>\n    </author>\n  </entry>"
}