Chapter 14-7. Kolmogorov-Smirnov Test

Recommended Post: 【Statistics】 Chapter 14. Statistical Test

1. General Kolmogorov-Smirnov Test

2. Parametric Kolmogorov-Smirnov Test

3. Cramér–von Mises Test

4. Kolmogorov-Smirnov Two-Sample Test

5. Kolmogorov-Smirnov Independence Test

1. General Kolmogorov-Smirnov Test

⑴ Define the sample (i.e., empirical) distribution function as follows: Step function form

⑵ Reject the null hypothesis when the following value is large for the two-sided Kolmogorov-Smirnov test

Figure 1. Kolmogorov-Smirnov test statistic

⑶ Null hypothesis

① The null distribution of D does not depend on F₀ (assumed to be continuous), and this is summarized in a table for various sample sizes n

② In other words, the null hypothesis is that F̂ and F₀, the two probability distributions, are the same.

⑷ Theory: Under the null hypothesis, √n D asymptotically (n → ∞) follows the distribution of the maximum absolute value of a Brownian bridge

⑸ Simulation: The p-value can be computed through Monte Carlo simulation

① In practice, since the distribution of D under the null distribution does not depend on F₀, this only needs to be performed once for each sample size

② For example, F₀ can be set to Unif(0, 1)

2. Parametric Kolmogorov-Smirnov Test

⑴ The test statistic is defined as follows

⑵ The p-value is usually estimated through parametric bootstrap: Here, F̂ and θ̂ are recalculated for each bootstrap sample

⑶ When 𝒢 is a family of normal distributions: That is, when testing whether a given distribution follows a normal distribution (Normality Test)

① This test is often called the Lilliefors normality test

② The above test statistic can be adjusted for all distributions within the family, and this is performed through Monte Carlo simulation

③ Therefore, the null distribution of this test statistic is summarized in a table

④ R code: ks.test(dat, "pnorm", mean=mu, sd=sigma)

⑷ The same logic applies to distributions of other location-scale families

① Here, G₀ is a given distribution defined on the real set ℝ

② Location: mean, median, quantiles/percentiles, etc

③ Scale: standard deviation, median absolute deviation, etc

3. Cramér–von Mises Test

⑴ A variation of the Kolmogorov-Smirnov test

⑵ The Cramér–von Mises test rejects the null hypothesis when the following value is large:

① f₀(x) = dF₀(x)/dx is the probability density function (PDF) under the null hypothesis.

② Formulate the equation similarly to MSE (mean-squared error).

⑶ This has a simple closed-form expression that does not require integration:

① Here, X₍₁₎ ≤ ⋯ ≤ X_(n) is the ordered sample, known as order statistics.

② The null distribution of D does not depend on F₀ and has been tabulated.

③ The asymptotic null distribution is also known but is complicated.

④ Therefore, Monte Carlo simulations can be used to compute the p-value.

4. Kolmogorov-Smirnov Two-Sample Test

⑴ The (one-sided) Kolmogorov-Smirnov test rejects for large values of

⑵ Theory

① The distribution of D_m,n⁺ can be computed exactly and efficiently using some recursion formulas.

② In the large-sample limit

③ This happens to be the same limiting distribution as in the one-sample case with sample size ⎣ mn / (m+n) ⎦.

5. Kolmogorov-Smirnov Independence Test

⑴ Overview

① In the spirit of the Kolmogorov-Smirnov test, Hoeffding (1948) — and others later — proposed a test of indepence based on the empirical CDFs.

② Feature 1. If X and Y are continuous, the test is distribution-free. The asymptotic distribution is also known in closed form.

③ Feature 2. The test is universally consistent against any alternative to independence.

⑵ Statistic

① Null hypothesis H₀: Two random variables are independent.

② A test rejects for large values of H.

③ F_n^X is the empirical CDF of X₁, ···, X_n.

④ F_n^Y is the empirical CDF of Y₁, ···, Y_n.

⑤ F_n^XY is the joint empirical CDF of (X₁, Y₁), ···, (X_n, Y_n).

Input: 2025.03.23 18:32

2310

Chapter 14-7. Kolmogorov-Smirnov Test

1. General Kolmogorov-Smirnov Test

2. Parametric Kolmogorov-Smirnov Test

3. Cramér–von Mises Test

4. Kolmogorov-Smirnov Two-Sample Test

5. Kolmogorov-Smirnov Independence Test

results matching ""

No results matching ""