LOADING
1612 字
8 分钟
测试公式
2026-05-03
无标签

二元正态分布的详细推导

一、联合密度函数的推导

1.1 协方差矩阵的基本性质

给定:

X=[X1X2]N2(μ,Σ)X = \begin{bmatrix} X_1 \\ X_2 \end{bmatrix} \sim N_2(\mu, \Sigma)

其中:

μ=[μ1μ2],Σ=[σ11σ12σ21σ22]=[σ12ρσ1σ2ρσ1σ2σ22]>0\mu = \begin{bmatrix} \mu_1 \\ \mu_2 \end{bmatrix}, \quad \Sigma = \begin{bmatrix} \sigma_{11} & \sigma_{12} \\ \sigma_{21} & \sigma_{22} \end{bmatrix} = \begin{bmatrix} \sigma_1^2 & \rho\sigma_1\sigma_2 \\ \rho\sigma_1\sigma_2 & \sigma_2^2 \end{bmatrix} > 0

注记Σ>0\Sigma > 0 表示 Σ\Sigma 是正定矩阵。

1.2 计算协方差矩阵的行列式

Σ=det(Σ)=σ12ρσ1σ2ρσ1σ2σ22=σ12σ22(ρσ1σ2)2=σ12σ22(1ρ2)|\Sigma| = \det(\Sigma) = \begin{vmatrix} \sigma_1^2 & \rho\sigma_1\sigma_2 \\ \rho\sigma_1\sigma_2 & \sigma_2^2 \end{vmatrix} = \sigma_1^2 \cdot \sigma_2^2 - (\rho\sigma_1\sigma_2)^2 = \sigma_1^2\sigma_2^2(1 - \rho^2)

由于 Σ>0\Sigma > 0(正定),故 Σ>0|\Sigma| > 0,即: σ12σ22(1ρ2)>0    1ρ2>0    ρ<1\sigma_1^2\sigma_2^2(1 - \rho^2) > 0 \implies 1 - \rho^2 > 0 \implies |\rho| < 1

1.3 计算协方差矩阵的逆矩阵

对于 2×22 \times 2 矩阵 A=[abcd]A = \begin{bmatrix} a & b \\ c & d \end{bmatrix},其逆矩阵为: A1=1adbc[dbca]A^{-1} = \frac{1}{ad - bc} \begin{bmatrix} d & -b \\ -c & a \end{bmatrix}

伴随矩阵求逆

应用到 Σ\Sigma 上:

Σ1=1Σ[σ22ρσ1σ2ρσ1σ2σ12]=1σ12σ22(1ρ2)[σ22ρσ1σ2ρσ1σ2σ12]\Sigma^{-1} = \frac{1}{|\Sigma|} \begin{bmatrix} \sigma_2^2 & -\rho\sigma_1\sigma_2 \\ -\rho\sigma_1\sigma_2 & \sigma_1^2 \end{bmatrix} = \frac{1}{\sigma_1^2\sigma_2^2(1 - \rho^2)} \begin{bmatrix} \sigma_2^2 & -\rho\sigma_1\sigma_2 \\ -\rho\sigma_1\sigma_2 & \sigma_1^2 \end{bmatrix}

化简得:

Σ1=11ρ2[1σ12ρσ1σ2ρσ1σ21σ22]\Sigma^{-1} = \frac{1}{1 - \rho^2} \begin{bmatrix} \frac{1}{\sigma_1^2} & -\frac{\rho}{\sigma_1\sigma_2} \\ -\frac{\rho}{\sigma_1\sigma_2} & \frac{1}{\sigma_2^2} \end{bmatrix}

1.4 二次型的展开

x=[x1x2]x = \begin{bmatrix} x_1 \\ x_2 \end{bmatrix},则: xμ=[x1μ1x2μ2]x - \mu = \begin{bmatrix} x_1 - \mu_1 \\ x_2 - \mu_2 \end{bmatrix}

计算二次型 (xμ)Σ1(xμ)(x - \mu)'\Sigma^{-1}(x - \mu)

(xμ)Σ1(xμ)=[x1μ1x2μ2]11ρ2[1σ12ρσ1σ2ρσ1σ21σ22][x1μ1x2μ2]=11ρ2[x1μ1x2μ2][x1μ1σ12ρ(x2μ2)σ1σ2ρ(x1μ1)σ1σ2+x2μ2σ22]=11ρ2[(x1μ1)(x1μ1σ12ρ(x2μ2)σ1σ2)+(x2μ2)(ρ(x1μ1)σ1σ2+x2μ2σ22)]=11ρ2[(x1μ1)2σ12ρ(x1μ1)(x2μ2)σ1σ2ρ(x1μ1)(x2μ2)σ1σ2+(x2μ2)2σ22]=11ρ2[(x1μ1)2σ122ρ(x1μ1)(x2μ2)σ1σ2+(x2μ2)2σ22]\begin{aligned} (x - \mu)'\Sigma^{-1}(x - \mu) &= \begin{bmatrix} x_1 - \mu_1 & x_2 - \mu_2 \end{bmatrix} \cdot \frac{1}{1 - \rho^2} \begin{bmatrix} \frac{1}{\sigma_1^2} & -\frac{\rho}{\sigma_1\sigma_2} \\ -\frac{\rho}{\sigma_1\sigma_2} & \frac{1}{\sigma_2^2} \end{bmatrix} \cdot \begin{bmatrix} x_1 - \mu_1 \\ x_2 - \mu_2 \end{bmatrix} \\ &= \frac{1}{1 - \rho^2} \begin{bmatrix} x_1 - \mu_1 & x_2 - \mu_2 \end{bmatrix} \cdot \begin{bmatrix} \frac{x_1 - \mu_1}{\sigma_1^2} - \frac{\rho(x_2 - \mu_2)}{\sigma_1\sigma_2} \\ -\frac{\rho(x_1 - \mu_1)}{\sigma_1\sigma_2} + \frac{x_2 - \mu_2}{\sigma_2^2} \end{bmatrix} \\ &= \frac{1}{1 - \rho^2} \left[ (x_1 - \mu_1)\left(\frac{x_1 - \mu_1}{\sigma_1^2} - \frac{\rho(x_2 - \mu_2)}{\sigma_1\sigma_2}\right) + (x_2 - \mu_2)\left(-\frac{\rho(x_1 - \mu_1)}{\sigma_1\sigma_2} + \frac{x_2 - \mu_2}{\sigma_2^2}\right) \right] \\ &= \frac{1}{1 - \rho^2} \left[ \frac{(x_1 - \mu_1)^2}{\sigma_1^2} - \frac{\rho(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} - \frac{\rho(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} + \frac{(x_2 - \mu_2)^2}{\sigma_2^2} \right] \\ &= \frac{1}{1 - \rho^2} \left[ \frac{(x_1 - \mu_1)^2}{\sigma_1^2} - 2\rho\frac{(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} + \frac{(x_2 - \mu_2)^2}{\sigma_2^2} \right] \end{aligned}

1.5 联合密度函数

根据多元正态分布的密度函数公式:

f(x)=1(2π)p/2Σ1/2exp[12(xμ)Σ1(xμ)]f(x) = \frac{1}{(2\pi)^{p/2}|\Sigma|^{1/2}} \exp\left[-\frac{1}{2}(x - \mu)'\Sigma^{-1}(x - \mu)\right]

对于 p=2p = 2,有:

(2π)p/2=(2π)2/2=2π(2\pi)^{p/2} = (2\pi)^{2/2} = 2\piΣ1/2=σ12σ22(1ρ2)=σ1σ21ρ2|\Sigma|^{1/2} = \sqrt{\sigma_1^2\sigma_2^2(1 - \rho^2)} = \sigma_1\sigma_2\sqrt{1 - \rho^2}

因此,联合密度函数为:

f(x1,x2)=12πσ1σ21ρ2exp[12(1ρ2)((x1μ1)2σ122ρ(x1μ1)(x2μ2)σ1σ2+(x2μ2)2σ22)]f(x_1, x_2) = \frac{1}{2\pi\sigma_1\sigma_2\sqrt{1 - \rho^2}} \exp\left[-\frac{1}{2(1 - \rho^2)}\left(\frac{(x_1 - \mu_1)^2}{\sigma_1^2} - 2\rho\frac{(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} + \frac{(x_2 - \mu_2)^2}{\sigma_2^2}\right)\right]

二、边缘密度函数的推导

2.1 X1X_1 的边缘密度函数

边缘密度函数 fX1(x1)f_{X_1}(x_1) 通过对 x2x_2 积分得到: fX1(x1)=+f(x1,x2)dx2f_{X_1}(x_1) = \int_{-\infty}^{+\infty} f(x_1, x_2) dx_2

将联合密度函数代入:

fX1(x1)=+12πσ1σ21ρ2exp[12(1ρ2)((x1μ1)2σ122ρ(x1μ1)(x2μ2)σ1σ2+(x2μ2)2σ22)]dx2f_{X_1}(x_1) = \int_{-\infty}^{+\infty} \frac{1}{2\pi\sigma_1\sigma_2\sqrt{1 - \rho^2}} \exp\left[-\frac{1}{2(1 - \rho^2)}\left(\frac{(x_1 - \mu_1)^2}{\sigma_1^2} - 2\rho\frac{(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} + \frac{(x_2 - \mu_2)^2}{\sigma_2^2}\right)\right] dx_2

2.2 完成平方

将指数中的表达式关于 x2x_2 完成平方:

A=12(1ρ2)A = \frac{1}{2(1 - \rho^2)},则指数部分为:

Aσ22(x2μ2)2+2Aρ(x1μ1)σ1σ2(x2μ2)A(x1μ1)2σ12=Aσ22[(x2μ2)22ρσ2(x1μ1)σ1(x2μ2)]A(x1μ1)2σ12=Aσ22[(x2μ2)22ρσ2(x1μ1)σ1(x2μ2)+ρ2σ22(x1μ1)2σ12ρ2σ22(x1μ1)2σ12]A(x1μ1)2σ12=Aσ22[(x2μ2)ρσ2(x1μ1)σ1]2+Aρ2(x1μ1)2σ12A(x1μ1)2σ12=Aσ22[(x2μ2)ρσ2(x1μ1)σ1]2A(1ρ2)(x1μ1)2σ12\begin{aligned} &-\frac{A}{\sigma_2^2}(x_2 - \mu_2)^2 + \frac{2A\rho(x_1 - \mu_1)}{\sigma_1\sigma_2}(x_2 - \mu_2) - \frac{A(x_1 - \mu_1)^2}{\sigma_1^2} \\ &= -\frac{A}{\sigma_2^2}\left[(x_2 - \mu_2)^2 - 2\rho\frac{\sigma_2(x_1 - \mu_1)}{\sigma_1}(x_2 - \mu_2)\right] - \frac{A(x_1 - \mu_1)^2}{\sigma_1^2} \\ &= -\frac{A}{\sigma_2^2}\left[(x_2 - \mu_2)^2 - 2\rho\frac{\sigma_2(x_1 - \mu_1)}{\sigma_1}(x_2 - \mu_2) + \rho^2\frac{\sigma_2^2(x_1 - \mu_1)^2}{\sigma_1^2} - \rho^2\frac{\sigma_2^2(x_1 - \mu_1)^2}{\sigma_1^2}\right] - \frac{A(x_1 - \mu_1)^2}{\sigma_1^2} \\ &= -\frac{A}{\sigma_2^2}\left[(x_2 - \mu_2) - \rho\frac{\sigma_2(x_1 - \mu_1)}{\sigma_1}\right]^2 + \frac{A\rho^2(x_1 - \mu_1)^2}{\sigma_1^2} - \frac{A(x_1 - \mu_1)^2}{\sigma_1^2} \\ &= -\frac{A}{\sigma_2^2}\left[(x_2 - \mu_2) - \rho\frac{\sigma_2(x_1 - \mu_1)}{\sigma_1}\right]^2 - \frac{A(1 - \rho^2)(x_1 - \mu_1)^2}{\sigma_1^2} \end{aligned}

2.3 积分计算

代入 A=12(1ρ2)A = \frac{1}{2(1 - \rho^2)}

fX1(x1)=12πσ1σ21ρ2exp[12(x1μ1)2σ12]×+exp[12(1ρ2)σ22((x2μ2)ρσ2(x1μ1)σ1)2]dx2\begin{aligned} f_{X_1}(x_1) &= \frac{1}{2\pi\sigma_1\sigma_2\sqrt{1 - \rho^2}} \exp\left[-\frac{1}{2}\cdot\frac{(x_1 - \mu_1)^2}{\sigma_1^2}\right] \\ &\quad \times \int_{-\infty}^{+\infty} \exp\left[-\frac{1}{2(1 - \rho^2)\sigma_2^2}\left((x_2 - \mu_2) - \rho\frac{\sigma_2(x_1 - \mu_1)}{\sigma_1}\right)^2\right] dx_2 \end{aligned}

y=(x2μ2)ρσ2(x1μ1)σ1y = (x_2 - \mu_2) - \rho\frac{\sigma_2(x_1 - \mu_1)}{\sigma_1},则 dy=dx2dy = dx_2,积分变为:

+exp[y22(1ρ2)σ22]dy\int_{-\infty}^{+\infty} \exp\left[-\frac{y^2}{2(1 - \rho^2)\sigma_2^2}\right] dy

这是一个高斯积分,其值为:

+ey2/(2σ2)dy=σ2π\int_{-\infty}^{+\infty} e^{-y^2/(2\sigma^2)} dy = \sigma\sqrt{2\pi}

这里 σ2=(1ρ2)σ22\sigma^2 = (1 - \rho^2)\sigma_2^2,所以:

+exp[y22(1ρ2)σ22]dy=σ22π(1ρ2)\int_{-\infty}^{+\infty} \exp\left[-\frac{y^2}{2(1 - \rho^2)\sigma_2^2}\right] dy = \sigma_2\sqrt{2\pi(1 - \rho^2)}

2.4 最终结果

代入得:

fX1(x1)=12πσ1σ21ρ2exp[12(x1μ1)2σ12]σ22π(1ρ2)=12πσ1exp[12(x1μ1)2σ12]\begin{aligned} f_{X_1}(x_1) &= \frac{1}{2\pi\sigma_1\sigma_2\sqrt{1 - \rho^2}} \exp\left[-\frac{1}{2}\cdot\frac{(x_1 - \mu_1)^2}{\sigma_1^2}\right] \cdot \sigma_2\sqrt{2\pi(1 - \rho^2)} \\ &= \frac{1}{\sqrt{2\pi}\sigma_1} \exp\left[-\frac{1}{2}\cdot\frac{(x_1 - \mu_1)^2}{\sigma_1^2}\right] \end{aligned}

因此,X1X_1 的边缘密度函数为:

fX1(x1)=12πσ1exp[(x1μ1)22σ12]f_{X_1}(x_1) = \frac{1}{\sqrt{2\pi}\sigma_1} \exp\left[-\frac{(x_1 - \mu_1)^2}{2\sigma_1^2}\right]

X1N(μ1,σ12)X_1 \sim N(\mu_1, \sigma_1^2)

2.5 X2X_2 的边缘密度函数

同理,X2X_2 的边缘密度函数为:

fX2(x2)=12πσ2exp[(x2μ2)22σ22]f_{X_2}(x_2) = \frac{1}{\sqrt{2\pi}\sigma_2} \exp\left[-\frac{(x_2 - \mu_2)^2}{2\sigma_2^2}\right]

X2N(μ2,σ22)X_2 \sim N(\mu_2, \sigma_2^2)


三、相关系数 ρ\rho 的统计意义

3.1 相关系数的定义

相关系数 ρ\rho 定义为:

ρ=Cov(X1,X2)Var(X1)Var(X2)=σ12σ1σ2\rho = \frac{\text{Cov}(X_1, X_2)}{\sqrt{\text{Var}(X_1)\text{Var}(X_2)}} = \frac{\sigma_{12}}{\sigma_1\sigma_2}

其中:

  • Cov(X1,X2)=σ12=ρσ1σ2\text{Cov}(X_1, X_2) = \sigma_{12} = \rho\sigma_1\sigma_2X1X_1X2X_2 的协方差
  • Var(X1)=σ12\text{Var}(X_1) = \sigma_1^2Var(X2)=σ22\text{Var}(X_2) = \sigma_2^2 分别是 X1X_1X2X_2 的方差

3.2 ρ\rho 的取值范围

由柯西-施瓦茨不等式:

Cov(X1,X2)Var(X1)Var(X2)|\text{Cov}(X_1, X_2)| \leq \sqrt{\text{Var}(X_1)\text{Var}(X_2)}

因此: ρ1|\rho| \leq 1

在非退化情况下(Σ>0\Sigma > 0),有 ρ<1|\rho| < 1

3.3 ρ\rho 的几何解释

3.3.1 线性相关程度

ρ\rho 衡量了 X1X_1X2X_2 之间的线性相关程度:

  • ρ=1\rho = 1X1X_1X2X_2 完全正相关,存在 a>0a > 0 使得 X2=aX1+bX_2 = aX_1 + b 几乎必然成立
  • ρ=1\rho = -1X1X_1X2X_2 完全负相关,存在 a<0a < 0 使得 X2=aX1+bX_2 = aX_1 + b 几乎必然成立
  • ρ=0\rho = 0X1X_1X2X_2 不相关(在正态分布下等价于独立)

3.3.2 联合密度函数的等高线

联合密度函数的等高线由方程:

(x1μ1)2σ122ρ(x1μ1)(x2μ2)σ1σ2+(x2μ2)2σ22=常数\frac{(x_1 - \mu_1)^2}{\sigma_1^2} - 2\rho\frac{(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} + \frac{(x_2 - \mu_2)^2}{\sigma_2^2} = \text{常数}

定义,这是一个椭圆方程。ρ\rho 决定了椭圆的形状和方向:

  • ρ1|\rho| \to 1:椭圆变得越来越扁,趋向于一条直线
  • ρ=0\rho = 0:椭圆退化为圆(当 σ1=σ2\sigma_1 = \sigma_2 时)或轴对齐的椭圆

3.4 ρ\rho 与条件分布的关系

3.4.1 条件期望

给定 X1=x1X_1 = x_1X2X_2 的条件期望为: E[X2X1=x1]=μ2+ρσ2σ1(x1μ1)E[X_2 | X_1 = x_1] = \mu_2 + \rho\frac{\sigma_2}{\sigma_1}(x_1 - \mu_1)

这是一条关于 x1x_1 的直线,斜率为 ρσ2σ1\rho\frac{\sigma_2}{\sigma_1}

3.4.2 条件方差

给定 X1=x1X_1 = x_1X2X_2 的条件方差为: Var(X2X1=x1)=σ22(1ρ2)\text{Var}(X_2 | X_1 = x_1) = \sigma_2^2(1 - \rho^2)

这说明:

  • ρ1|\rho| \to 1:条件方差 0\to 0,即 X2X_2 几乎完全由 X1X_1 决定
  • ρ=0\rho = 0:条件方差 =σ22= \sigma_2^2,即 X2X_2X1X_1 无关

3.5 ρ\rho 的统计推断

3.5.1 样本相关系数

给定样本 (x1i,x2i)(x_{1i}, x_{2i})i=1,2,,ni = 1, 2, \dots, n,样本相关系数为: r=i=1n(x1ixˉ1)(x2ixˉ2)i=1n(x1ixˉ1)2i=1n(x2ixˉ2)2r = \frac{\sum_{i=1}^n (x_{1i} - \bar{x}_1)(x_{2i} - \bar{x}_2)}{\sqrt{\sum_{i=1}^n (x_{1i} - \bar{x}_1)^2 \sum_{i=1}^n (x_{2i} - \bar{x}_2)^2}}

3.5.2 假设检验

检验 H0:ρ=0H_0: \rho = 0 vs H1:ρ0H_1: \rho \neq 0

H0H_0 下,统计量: t=rn21r2t(n2)t = r\sqrt{\frac{n-2}{1-r^2}} \sim t(n-2)

3.6 ρ\rho 的实际意义

3.6.1 预测能力

ρ2\rho^2(决定系数)表示 X1X_1 能够解释 X2X_2 变异的比例: ρ2=1Var(X2X1)Var(X2)\rho^2 = 1 - \frac{\text{Var}(X_2 | X_1)}{\text{Var}(X_2)}

3.6.2 风险分散

在投资组合理论中,ρ\rho 决定了资产组合的风险分散效果:

  • ρ=1\rho = 1:无法分散风险
  • ρ=1\rho = -1:可以完全对冲风险
  • ρ=0\rho = 0:可以部分分散风险

四、总结

4.1 联合密度函数

f(x1,x2)=12πσ1σ21ρ2exp[12(1ρ2)((x1μ1)2σ122ρ(x1μ1)(x2μ2)σ1σ2+(x2μ2)2σ22)]f(x_1, x_2) = \frac{1}{2\pi\sigma_1\sigma_2\sqrt{1 - \rho^2}} \exp\left[-\frac{1}{2(1 - \rho^2)}\left(\frac{(x_1 - \mu_1)^2}{\sigma_1^2} - 2\rho\frac{(x_1 - \mu_1)(x_2 - \mu_2)}{\sigma_1\sigma_2} + \frac{(x_2 - \mu_2)^2}{\sigma_2^2}\right)\right]

4.2 边缘密度函数

fX1(x1)=12πσ1exp[(x1μ1)22σ12],X1N(μ1,σ12)f_{X_1}(x_1) = \frac{1}{\sqrt{2\pi}\sigma_1} \exp\left[-\frac{(x_1 - \mu_1)^2}{2\sigma_1^2}\right], \quad X_1 \sim N(\mu_1, \sigma_1^2)

fX2(x2)=12πσ2exp[(x2μ2)22σ22],X2N(μ2,σ22)f_{X_2}(x_2) = \frac{1}{\sqrt{2\pi}\sigma_2} \exp\left[-\frac{(x_2 - \mu_2)^2}{2\sigma_2^2}\right], \quad X_2 \sim N(\mu_2, \sigma_2^2)

4.3 ρ\rho 的统计意义

  1. 定义ρ=Cov(X1,X2)σ1σ2\rho = \frac{\text{Cov}(X_1, X_2)}{\sigma_1\sigma_2}
  2. 取值范围ρ<1|\rho| < 1(非退化情况)
  3. 线性相关程度:衡量 X1X_1X2X_2 之间的线性相关程度
  4. 条件分布
    • E[X2X1=x1]=μ2+ρσ2σ1(x1μ1)E[X_2 | X_1 = x_1] = \mu_2 + \rho\frac{\sigma_2}{\sigma_1}(x_1 - \mu_1)
    • Var(X2X1=x1)=σ22(1ρ2)\text{Var}(X_2 | X_1 = x_1) = \sigma_2^2(1 - \rho^2)
  5. 决定系数ρ2\rho^2 表示解释比例
  6. 几何意义:决定联合密度函数等高线的形状和方向
测试公式
/posts/测试公式/
作者
Xs
发布于
2026-05-03
许可协议
CC BY-NC-SA 4.0

部分信息可能已经过时