Last modified: November 26, 2018
This article is written in: 🇺🇸
Student's t-Distribution (Continuous)
The Student's t-distribution, or simply t-distribution, is a continuous probability distribution that arises when estimating the mean of a normally distributed population in situations where the sample size is small and the population standard deviation is unknown. The t-distribution is denoted as , where is the number of degrees of freedom.
Probability Density Function (PDF)
The PDF of a t-distribution is given by:
where is the gamma function.
Cumulative Distribution Function (CDF)
The CDF of a t-distribution can be expressed in terms of the regularized incomplete beta function, :
Expected Value and Variance
The expected value (mean) of a t-distribution is equal to 0 for :
The variance of a t-distribution is given by:
Moment Generating Functions and Moments
The moment generating function (MGF) of a t-distribution does not exist due to the heavy tails of the distribution. However, we can compute the moments directly from the PDF.
- First Moment (Mean):
- Second Moment (Variance + Mean^2):
Example: Analyzing the Effect of Diet on Blood Pressure
A researcher investigates whether two diets lead to different average systolic blood pressure. The following data is collected from a random sample:
- Diet A: Sample size (n1) = 20, Mean (mean1) = 120, Standard Deviation (s1) = 8
- Diet B: Sample size (n2) = 25, Mean (mean2) = 130, Standard Deviation (s2) = 10
To compare the means, a two-sample t-test is employed.
I. Calculation of the T-statistic:
The t-statistic is calculated using the formula for the two-sample t-test:
For the given data:
This value indicates the number of standard deviations the sample mean difference is from 0.
II. Estimation of Degrees of Freedom:
The degrees of freedom can be estimated using the Welch-Satterthwaite equation:
These degrees of freedom are used in determining the critical value of t from the t-distribution table for the hypothesis test.
Applications
Student's t-distributions are widely used in hypothesis testing (such as the one-sample t-test, two-sample t-test, and paired t-test), confidence interval estimation for the population mean, and in regression analysis. They are particularly useful when the sample size is small and the population standard deviation is unknown.