Skip to content
Sahithyan's S3
1
Sahithyan's S3 — Applied Statistics

Student's t Distribution

A continuous sampling distribution. Used to estimate population mean of a population with unknown population standard deviation using a small sample. First described by William Sealy Gosset under the pseudonym “Student” in 1908.

Suppose a sample of size nn is taken from a population N(μ,σ2)N(\mu, \sigma^2). It’s mean is xˉ\bar{x} and standard deviation is ss. The tt-statistic is given by:

tn1=xˉμs/nt_{n-1} = \frac{\bar{x} - \mu}{s / \sqrt{n}}

Here, tn1t_{n-1} represents the t-statistic with n1n-1 degrees of freedom.

  • Symmetric about 0
  • Bell-shaped
  • Heavier Tails
    Compared to the normal distribution. This means it is more prone to producing values that fall far from its mean.

Usually n1n-1, where nn is the sample size. Denoted by ν\nu.

As degrees of freedom increase:

  • Curve becomes narrower
  • Approaches the standard normal distribution.

For ν>1\nu > 1, the mean is 00.

For ν>2\nu > 2:

νν2\frac{\nu}{\nu - 2} t=xˉμ0s/nt = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}

Mathematically, the t distribution is defined by the following probability density function (PDF):

f(t)=Γ(ν+12)νπΓ(ν2)(1+t2ν)ν+12f(t) = \frac{\Gamma\left(\frac{\nu+1}{2}\right)}{\sqrt{\nu\pi}\,\Gamma\left(\frac{\nu}{2}\right)} \left(1 + \frac{t^2}{\nu}\right)^{-\frac{\nu+1}{2}}

where:

  • tt is the value of the random variable,
  • ν\nu is the degrees of freedom (typically, ν=n1\nu = n - 1 for a sample of size nn),
  • Γ\Gamma is the gamma function.

For large degrees of freedom (typically ν>30\nu > 30), the t distribution and normal distributions becomes indistinguishable.

Used to make inferences about means when sample sizes are small and the population standard deviation is unknown.

Most commonly used in the following scenarios:

  • Estimating the Mean: When you want to estimate the mean of a population based on a small sample and the population standard deviation is unknown.
  • Hypothesis Testing: In t-tests (such as one-sample, two-sample, and paired t-tests) to determine if there is a significant difference between means.
  • Confidence Intervals: To construct confidence intervals for the mean when the sample size is small.