The Chi-Square Test of Distance Correlation

Author(s)Shen, Cencheng
Author(s)Panda, Sambit
Author(s)Vogelstein, Joshua T.
Date Accessioned2022-01-26T21:13:22Z
Date Available2022-01-26T21:13:22Z
Publication Date2021-07-19
DescriptionThis is an Accepted Manuscript of an article published by Taylor & Francis in Journal of Computational and Graphical Statistics on 07/19/2021 available online: http://www.tandfonline.com/10.1080/10618600.2021.1938585.en_US
AbstractDistance correlation has gained much recent attention in the data science community: the sample statistic is straightforward to compute and asymptotically equals zero if and only if independence, making it an ideal choice to discover any type of dependency structure given sufficient sample size. One major bottleneck is the testing process: because the null distribution of distance correlation depends on the underlying random variables and metric choice, it typically requires a permutation test to estimate the null and compute the p-value, which is very costly for large amount of data. To overcome the difficulty, in this article, we propose a chi-squared test for distance correlation. Method-wise, the chi-squared test is nonparametric, extremely fast, and applicable to bias-corrected distance correlation using any strong negative type metric or characteristic kernel. The test exhibits a similar testing power as the standard permutation test, and can be used for K-sample and partial testing. Theory-wise, we show that the underlying chi-squared distribution well approximates and dominates the limiting null distribution in upper tail, prove the chi-squared test can be valid and universally consistent for testing independence, and establish a testing power inequality with respect to the permutation test. Supplementary files for this article are available online.en_US
SponsorThis work was supported by the National Science Foundation award DMS-1921310, the National Institute of Health award R01MH120482, and DARPA L2M program FA8650-18-2-7834.en_US
CitationCencheng Shen, Sambit Panda & Joshua T. Vogelstein (2021) The Chi-Square Test of Distance Correlation, Journal of Computational and Graphical Statistics, DOI: 10.1080/10618600.2021.1938585en_US
ISSN1537-2715
URLhttps://udspace.udel.edu/handle/19716/30127
Languageen_USen_US
PublisherJournal of Computational and Graphical Statisticsen_US
KeywordsCentered chi-squared distributionen_US
KeywordsNonparametric testen_US
KeywordsTesting independenceen_US
KeywordsUnbiased distance covarianceen_US
TitleThe Chi-Square Test of Distance Correlationen_US
TypeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
The Chi-Square Test of Distance.pdf
Size:
1.24 MB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.22 KB
Format:
Item-specific license agreed upon to submission
Description: