Research

Paper

TESTING February 26, 2026

Kernel Integrated $R^2$: A Measure of Dependence

Authors

Pouya Roudaki, Shakeel Gavioli-Akilagun, Florian Kalinke, Mona Azadkia, Zoltán Szabó

Abstract

We introduce kernel integrated $R^2$, a new measure of statistical dependence that combines the local normalization principle of the recently introduced integrated $R^2$ with the flexibility of reproducing kernel Hilbert spaces (RKHSs). The proposed measure extends integrated $R^2$ from scalar responses to responses taking values on general spaces equipped with a characteristic kernel, allowing to measure dependence of multivariate, functional, and structured data, while remaining sensitive to tail behaviour and oscillatory dependence structures. We establish that (i) this new measure takes values in $[0,1]$, (ii) equals zero if and only if independence holds, and (iii) equals one if and only if the response is almost surely a measurable function of the covariates. Two estimators are proposed: a graph-based method using $K$-nearest neighbours and an RKHS-based method built on conditional mean embeddings. We prove consistency and derive convergence rates for the graph-based estimator, showing its adaptation to intrinsic dimensionality. Numerical experiments on simulated data and a real data experiment in the context of dependency testing for media annotations demonstrate competitive power against state-of-the-art dependence measures, particularly in settings involving non-linear and structured relationships.

Metadata

arXiv ID: 2602.22985
Provider: ARXIV
Primary Category: stat.ML
Published: 2026-02-26
Fetched: 2026-02-27 04:35

Related papers

Raw Data (Debug)
{
  "raw_xml": "<entry>\n    <id>http://arxiv.org/abs/2602.22985v1</id>\n    <title>Kernel Integrated $R^2$: A Measure of Dependence</title>\n    <updated>2026-02-26T13:29:12Z</updated>\n    <link href='https://arxiv.org/abs/2602.22985v1' rel='alternate' type='text/html'/>\n    <link href='https://arxiv.org/pdf/2602.22985v1' rel='related' title='pdf' type='application/pdf'/>\n    <summary>We introduce kernel integrated $R^2$, a new measure of statistical dependence that combines the local normalization principle of the recently introduced integrated $R^2$ with the flexibility of reproducing kernel Hilbert spaces (RKHSs). The proposed measure extends integrated $R^2$ from scalar responses to responses taking values on general spaces equipped with a characteristic kernel, allowing to measure dependence of multivariate, functional, and structured data, while remaining sensitive to tail behaviour and oscillatory dependence structures. We establish that (i) this new measure takes values in $[0,1]$, (ii) equals zero if and only if independence holds, and (iii) equals one if and only if the response is almost surely a measurable function of the covariates. Two estimators are proposed: a graph-based method using $K$-nearest neighbours and an RKHS-based method built on conditional mean embeddings. We prove consistency and derive convergence rates for the graph-based estimator, showing its adaptation to intrinsic dimensionality. Numerical experiments on simulated data and a real data experiment in the context of dependency testing for media annotations demonstrate competitive power against state-of-the-art dependence measures, particularly in settings involving non-linear and structured relationships.</summary>\n    <category scheme='http://arxiv.org/schemas/atom' term='stat.ML'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.IT'/>\n    <category scheme='http://arxiv.org/schemas/atom' term='cs.LG'/>\n    <published>2026-02-26T13:29:12Z</published>\n    <arxiv:primary_category term='stat.ML'/>\n    <author>\n      <name>Pouya Roudaki</name>\n    </author>\n    <author>\n      <name>Shakeel Gavioli-Akilagun</name>\n    </author>\n    <author>\n      <name>Florian Kalinke</name>\n    </author>\n    <author>\n      <name>Mona Azadkia</name>\n    </author>\n    <author>\n      <name>Zoltán Szabó</name>\n    </author>\n  </entry>"
}