Last modified: September 21, 2024

This article is written in: 🇺🇸

F-Distribution (Continuous)

The F-distribution, also known as the Fisher-Snedecor distribution, is a continuous probability distribution that arises in hypothesis testing when comparing the variances of two normally distributed populations. The F-distribution is denoted as $X \sim F(d_1, d_2)$, where $d_1$ and $d_2$ are the degrees of freedom for the numerator and denominator, respectively.

Probability Density Function (PDF)

The PDF of an F-distribution is given by:

$$f(x) = \frac{\sqrt{\frac{(d_1x)^{d_1} \cdot d_2^{d_2}}{(d_1x + d_2)^{d_1 + d_2}}}}{xB(d_1/2, d_2/2)}$$

where $B(\cdot)$ is the beta function.

output(9)

Cumulative Distribution Function (CDF)

The CDF of an F-distribution can be expressed in terms of the regularized incomplete beta function, $I_x(a, b)$:

$$F(x) = I_{\frac{d_1x}{d_1x + d_2}}\left(\frac{d_1}{2}, \frac{d_2}{2}\right)$$

output(10)

Expected Value and Variance

The expected value (mean) of an F-distribution is given by:

$$E[X] = \frac{d_2}{d_2 - 2}, \text{ for } d_2 > 2$$

The variance of an F-distribution is given by:

$$\text{Var}(X) = \begin{cases} \frac{2d_2^2(d_1 + d_2 - 2)}{d_1(d_2 - 2)^2(d_2 - 4)}, & \text{if}\ d_2 > 4 \\ \text{undefined}, & \text{if}\ d_2 \le 4 \end{cases} $$

Example: Variability in Agricultural Yields

An agronomist is assessing the variability in yield between two wheat varieties. Random samples yield the following data:

To compare the variances, the agronomist employs an F-test.

Given:

I. Compute the F-statistic:

The F-statistic is calculated as the ratio of the two sample variances.

$$ F = \frac{20}{30} = \frac{2}{3} \approx 0.67 $$

II. Determine the degrees of freedom (df) for the test:

The degrees of freedom for each sample are calculated as the sample size minus one.

Applications

F-distributions are primarily used in hypothesis testing, such as the F-test for testing the equality of the variances of two normally distributed populations, and in the analysis of variance (ANOVA) for comparing the means of multiple groups. They are also used in regression analysis to test the overall significance of a model and in constructing confidence intervals for the ratio of two population variances.

Table of Contents

    F-Distribution (Continuous)
    1. Probability Density Function (PDF)
    2. Cumulative Distribution Function (CDF)
    3. Expected Value and Variance
    4. Example: Variability in Agricultural Yields
    5. Applications