Misplaced Pages

D'Agostino's K-squared test

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Goodness-of-fit measure in statistics

In statistics, D'Agostino's K test, named for Ralph D'Agostino, is a goodness-of-fit measure of departure from normality, that is the test aims to gauge the compatibility of given data with the null hypothesis that the data is a realization of independent, identically distributed Gaussian random variables. The test is based on transformations of the sample kurtosis and skewness, and has power only against the alternatives that the distribution is skewed and/or kurtic.

Skewness and kurtosis

In the following, { xi } denotes a sample of n observations, g1 and g2 are the sample skewness and kurtosis, mj’s are the j-th sample central moments, and x ¯ {\displaystyle {\bar {x}}} is the sample mean. Frequently in the literature related to normality testing, the skewness and kurtosis are denoted as √β1 and β2 respectively. Such notation can be inconvenient since, for example, √β1 can be a negative quantity.

The sample skewness and kurtosis are defined as

g 1 = m 3 m 2 3 / 2 = 1 n i = 1 n ( x i x ¯ ) 3 ( 1 n i = 1 n ( x i x ¯ ) 2 ) 3 / 2   , g 2 = m 4 m 2 2 3 = 1 n i = 1 n ( x i x ¯ ) 4 ( 1 n i = 1 n ( x i x ¯ ) 2 ) 2 3   . {\displaystyle {\begin{aligned}&g_{1}={\frac {m_{3}}{m_{2}^{3/2}}}={\frac {{\frac {1}{n}}\sum _{i=1}^{n}\left(x_{i}-{\bar {x}}\right)^{3}}{\left({\frac {1}{n}}\sum _{i=1}^{n}\left(x_{i}-{\bar {x}}\right)^{2}\right)^{3/2}}}\ ,\\&g_{2}={\frac {m_{4}}{m_{2}^{2}}}-3={\frac {{\frac {1}{n}}\sum _{i=1}^{n}\left(x_{i}-{\bar {x}}\right)^{4}}{\left({\frac {1}{n}}\sum _{i=1}^{n}\left(x_{i}-{\bar {x}}\right)^{2}\right)^{2}}}-3\ .\end{aligned}}}

These quantities consistently estimate the theoretical skewness and kurtosis of the distribution, respectively. Moreover, if the sample indeed comes from a normal population, then the exact finite sample distributions of the skewness and kurtosis can themselves be analysed in terms of their means μ1, variances μ2, skewnesses γ1, and kurtosis γ2. This has been done by Pearson (1931), who derived the following expressions:

μ 1 ( g 1 ) = 0 , μ 2 ( g 1 ) = 6 ( n 2 ) ( n + 1 ) ( n + 3 ) , γ 1 ( g 1 ) μ 3 ( g 1 ) μ 2 ( g 1 ) 3 / 2 = 0 , γ 2 ( g 1 ) μ 4 ( g 1 ) μ 2 ( g 1 ) 2 3 = 36 ( n 7 ) ( n 2 + 2 n 5 ) ( n 2 ) ( n + 5 ) ( n + 7 ) ( n + 9 ) . {\displaystyle {\begin{aligned}&\mu _{1}(g_{1})=0,\\&\mu _{2}(g_{1})={\frac {6(n-2)}{(n+1)(n+3)}},\\&\gamma _{1}(g_{1})\equiv {\frac {\mu _{3}(g_{1})}{\mu _{2}(g_{1})^{3/2}}}=0,\\&\gamma _{2}(g_{1})\equiv {\frac {\mu _{4}(g_{1})}{\mu _{2}(g_{1})^{2}}}-3={\frac {36(n-7)(n^{2}+2n-5)}{(n-2)(n+5)(n+7)(n+9)}}.\end{aligned}}}

and

μ 1 ( g 2 ) = 6 n + 1 , μ 2 ( g 2 ) = 24 n ( n 2 ) ( n 3 ) ( n + 1 ) 2 ( n + 3 ) ( n + 5 ) , γ 1 ( g 2 ) μ 3 ( g 2 ) μ 2 ( g 2 ) 3 / 2 = 6 ( n 2 5 n + 2 ) ( n + 7 ) ( n + 9 ) 6 ( n + 3 ) ( n + 5 ) n ( n 2 ) ( n 3 ) , γ 2 ( g 2 ) μ 4 ( g 2 ) μ 2 ( g 2 ) 2 3 = 36 ( 15 n 6 36 n 5 628 n 4 + 982 n 3 + 5777 n 2 6402 n + 900 ) n ( n 3 ) ( n 2 ) ( n + 7 ) ( n + 9 ) ( n + 11 ) ( n + 13 ) . {\displaystyle {\begin{aligned}&\mu _{1}(g_{2})=-{\frac {6}{n+1}},\\&\mu _{2}(g_{2})={\frac {24n(n-2)(n-3)}{(n+1)^{2}(n+3)(n+5)}},\\&\gamma _{1}(g_{2})\equiv {\frac {\mu _{3}(g_{2})}{\mu _{2}(g_{2})^{3/2}}}={\frac {6(n^{2}-5n+2)}{(n+7)(n+9)}}{\sqrt {\frac {6(n+3)(n+5)}{n(n-2)(n-3)}}},\\&\gamma _{2}(g_{2})\equiv {\frac {\mu _{4}(g_{2})}{\mu _{2}(g_{2})^{2}}}-3={\frac {36(15n^{6}-36n^{5}-628n^{4}+982n^{3}+5777n^{2}-6402n+900)}{n(n-3)(n-2)(n+7)(n+9)(n+11)(n+13)}}.\end{aligned}}}

For example, a sample with size n = 1000 drawn from a normally distributed population can be expected to have a skewness of 0, SD 0.08 and a kurtosis of 0, SD 0.15, where SD indicates the standard deviation.

Transformed sample skewness and kurtosis

The sample skewness g1 and kurtosis g2 are both asymptotically normal. However, the rate of their convergence to the distribution limit is frustratingly slow, especially for g2. For example even with n = 5000 observations the sample kurtosis g2 has both the skewness and the kurtosis of approximately 0.3, which is not negligible. In order to remedy this situation, it has been suggested to transform the quantities g1 and g2 in a way that makes their distribution as close to standard normal as possible.

In particular, D'Agostino & Pearson (1973) suggested the following transformation for sample skewness:

Z 1 ( g 1 ) = δ asinh ( g 1 α μ 2 ) , {\displaystyle Z_{1}(g_{1})=\delta \operatorname {asinh} \left({\frac {g_{1}}{\alpha {\sqrt {\mu _{2}}}}}\right),}

where constants α and δ are computed as

W 2 = 2 γ 2 + 4 1 , δ = 1 / ln W , α 2 = 2 / ( W 2 1 ) , {\displaystyle {\begin{aligned}&W^{2}={\sqrt {2\gamma _{2}+4}}-1,\\&\delta =1/{\sqrt {\ln W}},\\&\alpha ^{2}=2/(W^{2}-1),\end{aligned}}}

and where μ2 = μ2(g1) is the variance of g1, and γ2 = γ2(g1) is the kurtosis — the expressions given in the previous section.

Similarly, Anscombe & Glynn (1983) suggested a transformation for g2, which works reasonably well for sample sizes of 20 or greater:

Z 2 ( g 2 ) = 9 A 2 { 1 2 9 A ( 1 2 / A 1 + g 2 μ 1 μ 2 2 / ( A 4 ) ) 1 / 3 } , {\displaystyle Z_{2}(g_{2})={\sqrt {\frac {9A}{2}}}\left\{1-{\frac {2}{9A}}-\left({\frac {1-2/A}{1+{\frac {g_{2}-\mu _{1}}{\sqrt {\mu _{2}}}}{\sqrt {2/(A-4)}}}}\right)^{\!1/3}\right\},}

where

A = 6 + 8 γ 1 ( 2 γ 1 + 1 + 4 / γ 1 2 ) , {\displaystyle A=6+{\frac {8}{\gamma _{1}}}\left({\frac {2}{\gamma _{1}}}+{\sqrt {1+4/\gamma _{1}^{2}}}\right),}

and μ1 = μ1(g2), μ2 = μ2(g2), γ1 = γ1(g2) are the quantities computed by Pearson.

Omnibus K statistic

Statistics Z1 and Z2 can be combined to produce an omnibus test, able to detect deviations from normality due to either skewness or kurtosis (D'Agostino, Belanger & D'Agostino 1990):

K 2 = Z 1 ( g 1 ) 2 + Z 2 ( g 2 ) 2 {\displaystyle K^{2}=Z_{1}(g_{1})^{2}+Z_{2}(g_{2})^{2}\,}

If the null hypothesis of normality is true, then K is approximately χ-distributed with 2 degrees of freedom.

Note that the statistics g1, g2 are not independent, only uncorrelated. Therefore, their transforms Z1, Z2 will be dependent also (Shenton & Bowman 1977), rendering the validity of χ approximation questionable. Simulations show that under the null hypothesis the K test statistic is characterized by

expected value standard deviation 95% quantile
n = 20 1.971 2.339 6.373
n = 50 2.017 2.308 6.339
n = 100 2.026 2.267 6.271
n = 250 2.012 2.174 6.129
n = 500 2.009 2.113 6.063
n = 1000 2.000 2.062 6.038
χ(2) distribution 2.000 2.000 5.991

See also

References

Statistics
Descriptive statistics
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Data collection
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical inference
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical / Multivariate / Time-series / Survival analysis
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Applications
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Categories: