7.3 Hotelling’s \(T^2\) distribution
Recall that in univariate statistics, Student’s \(t\)-distribution appears as the sampling distribution of \(\frac{\bar{x}-\mu}{s/\sqrt{n}}\), which is used for hypothesis tests and constructing confidence intervals.
Hotelling’s \(T^2\) distribution is the multivariate analogue of Student’s \(t\)-distribution. It plays an important role in multivariate hypothesis testing and confidence region construction, just as the Student \(t\)-distribution does in the univariate setting.
This is reminiscent of the definition of the Student \(t\)-distribution: if \(x \sim N(0,1)\) and \(v\sim \chi^2_n\), then \[T = \frac{x}{\sqrt{v/n}} \sim t_n.\] Hotelling’s \(T^2\) distribution looks similar (albeit working with the square):a MVN random variable ‘divided’ by a Wishart r.v. divided by the degrees of freedom.
We can generalise the definition with the following result.
This result gives rise to an important corollary used in hypothesis testing when \(\boldsymbol{\Sigma}\) is unknown.
Corollary 7.3 If \(\bar{\mathbf x}\) and \(\mathbf S\) are the mean and covariance matrix based on a sample of size \(n\) from \(N_p({\boldsymbol{\mu}},\boldsymbol{\Sigma})\) then \[ (n-1)(\bar{\mathbf x}-{\boldsymbol{\mu}})^\top \mathbf S^{-1} (\bar{\mathbf x}-{\boldsymbol{\mu}}) \sim T^2(p,n-1).\]
Proof. We have seen earlier that \(\bar{\mathbf x} \sim N_p({\boldsymbol{\mu}},\frac{1}{n}\boldsymbol{\Sigma})\). Let \(\mathbf x^\ast = n^{1/2} \bar{\mathbf x}\) and let \({\boldsymbol{\mu}}^\ast = n^{1/2} {\boldsymbol{\mu}}\). Then \(\mathbf x^\ast=n^{1/2} \bar{\mathbf x} \sim N_p({\boldsymbol{\mu}}^\ast, \boldsymbol{\Sigma})\).
From Proposition 7.10 we know \(n\mathbf S\sim W_p(\boldsymbol{\Sigma},n-1)\), and from Theorem 7.4 we know \(\bar{\mathbf x}\) and \(\mathbf S\) are independent. Applying Proposition 7.11 with \(\mathbf x= \mathbf x^\ast\) and \(\mathbf M= n\mathbf S\) we obtain \[ (n-1)(\mathbf x^\ast - {\boldsymbol{\mu}}^\ast)^\top (n\mathbf S)^{-1} (\mathbf x^\ast - {\boldsymbol{\mu}}^\ast) \sim T^2(p,n-1),\] and given \(\mathbf x^\ast - {\boldsymbol{\mu}}^\ast = n^{1/2} (\mathbf x-{\boldsymbol{\mu}})\) then \[\begin{align*} &(n-1)(\mathbf x^\ast - {\boldsymbol{\mu}}^\ast)^\top (n\mathbf S)^{-1} (\mathbf x^\ast - {\boldsymbol{\mu}}^\ast)\\ & \qquad \qquad = (n-1)n^{1/2}(\bar{\mathbf x}-{\boldsymbol{\mu}})^\top n^{-1} \mathbf S^{-1} n^{1/2}(\bar{\mathbf x}-{\boldsymbol{\mu}}) \\ &\qquad \qquad = (n-1)(\bar{\mathbf x}-{\boldsymbol{\mu}})^\top \mathbf S^{-1} (\bar{\mathbf x}-{\boldsymbol{\mu}}). \end{align*}\]Hotelling’s \(T^2\) distribution is not often included in statistical tables but the next result tells us that Hotelling’s \(T^2\) is a scale transformation of an \(F\) distribution.
We can apply this result to the previous corollary.
We’ll use this result to do hypothesis tests.