WeHUSTER
LOG IN
Open main menu
Home
四级真题
六级真题
《线性代数入门》答案
微积分考试真题
About
6.3 奇异值分解
6.3.1
\\
略。
\\
6.3.2
\quad
矩阵
A
A
A
的 QR 分解
A
=
Q
R
A=QR
A
=
QR
,且
R
R
R
有奇异值分解
R
=
U
Σ
V
T
R=U\varSigma V^\mathrm{T}
R
=
U
Σ
V
T
,求
A
A
A
的奇异值分解。
\\
解:
\\
显然矩阵
Q
U
QU
Q
U
也是正交矩阵,于是
A
A
A
的奇异值分解是
A
=
Q
U
Σ
V
T
.
A=QU\varSigma V^\mathrm{T}.\\
A
=
Q
U
Σ
V
T
.
6.3.3
\quad
设
A
A
A
的奇异值分解为
A
=
U
Σ
V
T
A=U\varSigma V^\mathrm{T}
A
=
U
Σ
V
T
,求矩阵
[
O
A
T
A
O
]
\begin{bmatrix} O&A^\mathrm{T}\\ A&O \end{bmatrix}
[
O
A
A
T
O
]
的谱分解。
\\
解:
\\
注意
A
A
A
不一定是方阵且可能不满秩。
\\
设
rank
(
A
)
=
r
\operatorname{rank}(A)=r
rank
(
A
)
=
r
,则
U
=
[
u
1
⋯
u
r
u
r
+
1
⋯
u
n
]
,
V
=
[
v
1
⋯
v
r
v
r
+
1
⋯
v
n
]
U=\begin{bmatrix} \bm{u}_1&\cdots&\bm{u}_r&\bm{u}_{r+1}&\cdots&\bm{u}_{n} \end{bmatrix},V=\begin{bmatrix} \bm{v}_1&\cdots&\bm{v}_r&\bm{v}_{r+1}&\cdots&\bm{v}_{n} \end{bmatrix}
U
=
[
u
1
⋯
u
r
u
r
+
1
⋯
u
n
]
,
V
=
[
v
1
⋯
v
r
v
r
+
1
⋯
v
n
]
。
A
A
A
的简化奇异值分解为
A
=
σ
1
u
1
v
1
T
+
⋯
+
σ
r
u
r
v
r
T
A=\sigma_1\bm{u}_1\bm{v}_1^{\mathrm{T}}+\cdots+\sigma_r\bm{u}_r\bm{v}_r^{\mathrm{T}}
A
=
σ
1
u
1
v
1
T
+
⋯
+
σ
r
u
r
v
r
T
,其中
u
1
⋯
u
r
\bm{u}_1\cdots\bm{u}_r
u
1
⋯
u
r
正交,
v
1
⋯
v
r
\bm{v}_1\cdots\bm{v}_r
v
1
⋯
v
r
正交。
\\
先考虑
1
≤
i
≤
r
1\leq i \leq r
1
≤
i
≤
r
的情况,此时
A
v
i
=
σ
i
u
i
,
A
T
u
i
=
σ
i
v
i
A\bm{v}_i=\sigma_i\bm{u}_i, A^{\mathrm{T}}\bm{u}_i=\sigma_i\bm{v}_i
A
v
i
=
σ
i
u
i
,
A
T
u
i
=
σ
i
v
i
,于是
[
O
A
T
A
O
]
[
v
i
u
i
]
=
[
A
T
u
i
A
v
i
]
=
σ
i
[
v
i
u
i
]
\begin{bmatrix} O&A^{\mathrm{T}}\\ A&O \end{bmatrix} \begin{bmatrix} \bm{v}_i\\ \bm{u}_i \end{bmatrix} = \begin{bmatrix} A^{\mathrm{T}}\bm{u}_i\\ A\bm{v}_i \end{bmatrix} = \sigma_i\begin{bmatrix} \bm{v}_i\\ \bm{u}_i \end{bmatrix}
[
O
A
A
T
O
]
[
v
i
u
i
]
=
[
A
T
u
i
A
v
i
]
=
σ
i
[
v
i
u
i
]
[
O
A
T
A
O
]
[
v
i
−
u
i
]
=
[
−
A
T
u
i
A
v
i
]
=
−
σ
i
[
v
i
−
u
i
]
\begin{bmatrix} O&A^{\mathrm{T}}\\ A&O \end{bmatrix} \begin{bmatrix} \bm{v}_i\\ -\bm{u}_i \end{bmatrix} = \begin{bmatrix} -A^{\mathrm{T}}\bm{u}_i\\ A\bm{v}_i \end{bmatrix} = -\sigma_i\begin{bmatrix} \bm{v}_i\\ -\bm{u}_i \end{bmatrix}
[
O
A
A
T
O
]
[
v
i
−
u
i
]
=
[
−
A
T
u
i
A
v
i
]
=
−
σ
i
[
v
i
−
u
i
]
所以
σ
i
\sigma_i
σ
i
是特征值,对应的特征向量是
[
v
i
u
i
]
\begin{bmatrix} \bm{v}_i\\ \bm{u}_i \end{bmatrix}
[
v
i
u
i
]
,
−
σ
i
-\sigma_i
−
σ
i
也是特征值,对应的特征向量是
[
v
i
−
u
i
]
\begin{bmatrix} \bm{v}_i\\ -\bm{u}_i \end{bmatrix}
[
v
i
−
u
i
]
。这里有
2
r
2r
2
r
个特征值。
\\
再考虑
r
<
i
≤
n
r \lt i \leq n
r
<
i
≤
n
,显然此时
A
v
i
=
A
T
u
i
=
0
A\bm{v}_i=A^{\mathrm{T}}\bm{u}_i=\bm{0}
A
v
i
=
A
T
u
i
=
0
,于是
[
O
A
T
A
O
]
[
0
u
i
]
=
[
A
T
u
i
0
]
=
0
\begin{bmatrix} O&A^{\mathrm{T}}\\ A&O \end{bmatrix} \begin{bmatrix} \bm{0}\\ \bm{u}_i \end{bmatrix} = \begin{bmatrix} A^{\mathrm{T}}\bm{u}_i\\ \bm{0} \end{bmatrix} = \bm{0}
[
O
A
A
T
O
]
[
0
u
i
]
=
[
A
T
u
i
0
]
=
0
[
O
A
T
A
O
]
[
v
i
0
]
=
[
0
A
v
i
]
=
0
\begin{bmatrix} O&A^{\mathrm{T}}\\ A&O \end{bmatrix} \begin{bmatrix} \bm{v}_i\\ \bm{0} \end{bmatrix} = \begin{bmatrix} \bm{0}\\ A\bm{v}_i \end{bmatrix} = \bm{0}
[
O
A
A
T
O
]
[
v
i
0
]
=
[
0
A
v
i
]
=
0
所以
0
0
0
是特征值,对应的特征向量是
[
0
u
i
]
\begin{bmatrix} \bm{0}\\ \bm{u}_i \end{bmatrix}
[
0
u
i
]
和
[
v
i
0
]
\begin{bmatrix} \bm{v}_i\\ \bm{0} \end{bmatrix}
[
v
i
0
]
。这里有
2
(
n
−
r
)
2(n-r)
2
(
n
−
r
)
个特征值。
\\
设
U
1
=
[
u
1
⋯
u
r
]
,
U
2
=
[
u
r
+
1
⋯
u
n
]
,
V
1
=
[
v
1
⋯
v
r
]
,
V
2
=
[
v
r
+
1
⋯
v
n
]
U_1=\begin{bmatrix} \bm{u}_1&\cdots&\bm{u}_r \end{bmatrix},U_2=\begin{bmatrix} \bm{u}_{r+1}&\cdots&\bm{u}_n \end{bmatrix},V_1=\begin{bmatrix} \bm{v}_1&\cdots&\bm{v}_r \end{bmatrix},V_2=\begin{bmatrix} \bm{v}_{r+1}&\cdots&\bm{v}_n \end{bmatrix}
U
1
=
[
u
1
⋯
u
r
]
,
U
2
=
[
u
r
+
1
⋯
u
n
]
,
V
1
=
[
v
1
⋯
v
r
]
,
V
2
=
[
v
r
+
1
⋯
v
n
]
,则
U
=
[
U
1
U
2
]
,
V
=
[
V
1
V
2
]
U=\begin{bmatrix} U_1&U_2 \end{bmatrix},V=\begin{bmatrix} V_1&V_2 \end{bmatrix}
U
=
[
U
1
U
2
]
,
V
=
[
V
1
V
2
]
,所以谱分解为
[
O
A
T
A
O
]
=
[
V
1
V
1
O
V
2
U
1
−
U
1
U
2
O
]
[
σ
1
⋱
σ
r
−
σ
1
⋱
−
σ
r
0
⋱
0
]
[
V
1
T
U
1
T
V
1
T
−
U
1
T
O
U
2
T
V
2
T
O
]
\begin{bmatrix} O&A^{\mathrm{T}}\\ A&O \end{bmatrix} = \begin{bmatrix} V_1&V_1&O&V_2\\ U_1&-U_1&U_2&O \end{bmatrix} \begin{bmatrix} \sigma_1\\ &\ddots\\ &&\sigma_r\\ &&&-\sigma_1\\ &&&&\ddots\\ &&&&&-\sigma_r\\ &&&&&&0\\ &&&&&&&\ddots\\ &&&&&&&&0 \end{bmatrix} \begin{bmatrix} V_1^{\mathrm{T}}&U_1^{\mathrm{T}}\\ V_1^{\mathrm{T}}&-U_1^{\mathrm{T}}\\ O&U_2^{\mathrm{T}}\\ V_2^{\mathrm{T}}&O\\ \end{bmatrix}
[
O
A
A
T
O
]
=
[
V
1
U
1
V
1
−
U
1
O
U
2
V
2
O
]
σ
1
⋱
σ
r
−
σ
1
⋱
−
σ
r
0
⋱
0
V
1
T
V
1
T
O
V
2
T
U
1
T
−
U
1
T
U
2
T
O
6.3.4
\quad
设矩阵
A
=
[
1
0
−
1
1
]
A=\begin{bmatrix}1&0\\-1&1\end{bmatrix}
A
=
[
1
−
1
0
1
]
,考虑单位圆
C
=
{
v
∈
R
2
∣
∥
v
∥
=
1
}
C=\set{\bm{v}\in\mathbb{R}^2 \vert \Vert \bm{v} \Vert=1}
C
=
{
v
∈
R
2
∣∥
v
∥
=
1
}
及其在
A
A
A
对应的线性变换
A
\bm{A}
A
下的像
A
(
C
)
=
{
A
v
∈
R
2
∣
∥
v
∥
=
1
}
.
A(C)=\set{A\bm{v}\in\mathbb{R}^2 \vert \Vert \bm{v} \Vert=1}.\\
A
(
C
)
=
{
A
v
∈
R
2
∣∥
v
∥
=
1
}
.
1.设
w
∈
A
(
C
)
\bm{w}\in A(C)
w
∈
A
(
C
)
,证明
w
T
(
A
A
T
)
−
1
w
=
1.
\bm{w}^\mathrm{T}(AA^\mathrm{T})^{-1}\bm{w}=1.\\
w
T
(
A
A
T
)
−
1
w
=
1.
证明:
\\
设
w
=
A
v
,
∥
v
∥
=
1
\bm{w}=A\bm{v},\Vert \bm{v} \Vert = 1
w
=
A
v
,
∥
v
∥
=
1
,则
w
T
(
A
A
T
)
−
1
w
=
v
T
A
T
A
−
T
A
−
1
A
v
=
v
T
v
=
1.
\bm{w}^\mathrm{T}(AA^\mathrm{T})^{-1}\bm{w}=\bm{v}^\mathrm{T}A^{\mathrm{T}}A^{-\mathrm{T}}A^{-1}A\bm{v}=\bm{v}^{\mathrm{T}}\bm{v}=1.\\
w
T
(
A
A
T
)
−
1
w
=
v
T
A
T
A
−
T
A
−
1
A
v
=
v
T
v
=
1.
2.求
A
A
A
的奇异值分解
A
=
U
Σ
V
T
.
A=U\varSigma V^\mathrm{T}.\\
A
=
U
Σ
V
T
.
解:
\\
A
=
[
1
−
5
10
−
2
5
1
+
5
10
+
2
5
2
10
−
2
5
2
10
+
2
5
]
[
5
+
1
2
5
−
1
2
]
[
−
1
−
5
10
+
2
5
2
10
+
2
5
−
1
+
5
10
+
2
5
2
10
−
2
5
]
A=\begin{bmatrix} \cfrac{1-\sqrt{5}}{\sqrt{10-2\sqrt{5}}}&\cfrac{1+\sqrt{5}}{\sqrt{10+2\sqrt{5}}}\\ \cfrac{2}{\sqrt{10-2\sqrt{5}}}&\cfrac{2}{\sqrt{10+2\sqrt{5}}} \end{bmatrix} \begin{bmatrix} \cfrac{\sqrt{5}+1}{2}&\\ &\cfrac{\sqrt{5}-1}{2}\\ \end{bmatrix} \begin{bmatrix} \cfrac{-1-\sqrt{5}}{\sqrt{10+2\sqrt{5}}}&\cfrac{2}{\sqrt{10+2\sqrt{5}}}\\ \cfrac{-1+\sqrt{5}}{\sqrt{10+2\sqrt{5}}}&\cfrac{2}{\sqrt{10-2\sqrt{5}}}\\ \end{bmatrix}
A
=
10
−
2
5
1
−
5
10
−
2
5
2
10
+
2
5
1
+
5
10
+
2
5
2
2
5
+
1
2
5
−
1
10
+
2
5
−
1
−
5
10
+
2
5
−
1
+
5
10
+
2
5
2
10
−
2
5
2
3. 注意
V
,
U
V,U
V
,
U
为二阶正交矩阵,对应的线性变换是旋转或反射,而
Σ
\varSigma
Σ
是对角矩阵,对应伸缩变换。从几何上看,曲线
V
T
(
C
)
,
Σ
V
T
(
C
)
,
U
Σ
V
T
(
C
)
V^{\mathrm{T}}(C),\varSigma V^{\mathrm{T}}(C),U\varSigma V^{\mathrm{T}}(C)
V
T
(
C
)
,
Σ
V
T
(
C
)
,
U
Σ
V
T
(
C
)
分别是什么形状?
\\
解:
\\
圆,椭圆,椭圆。
\\
6.3.5
\quad
设矩阵
A
A
A
的奇异值分解是
A
=
U
Σ
V
T
.
A=U\varSigma V^\mathrm{T}.\\
A
=
U
Σ
V
T
.
1. 证明
A
A
T
=
U
(
Σ
Σ
T
)
U
T
,
A
T
A
=
V
(
Σ
T
Σ
)
V
T
AA^\mathrm{T}=U(\varSigma\varSigma^\mathrm{T})U^\mathrm{T},A^\mathrm{T}A=V(\varSigma^\mathrm{T}\varSigma)V^\mathrm{T}
A
A
T
=
U
(
Σ
Σ
T
)
U
T
,
A
T
A
=
V
(
Σ
T
Σ
)
V
T
分别是这两个对称矩阵的谱分解,并得到
A
A
T
AA^\mathrm{T}
A
A
T
和
A
T
A
A^TA
A
T
A
的非零特征值相同。
\\
2. 对任意
A
A
A
的奇异值
σ
≠
0
\sigma\neq0
σ
=
0
,设
v
\bm{v}
v
和
w
\bm{w}
w
分别是
A
T
A
A^\mathrm{T}A
A
T
A
和
A
A
T
AA^\mathrm{T}
A
A
T
的属于
σ
2
\sigma^2
σ
2
的特征向量,证明
A
v
A\bm{v}
A
v
和
A
T
w
A^\mathrm{T}\bm{w}
A
T
w
分别是
A
A
T
AA^\mathrm{T}
A
A
T
和
A
T
A
A^\mathrm{T}A
A
T
A
的属于
σ
2
\sigma^2
σ
2
的特征向量.
\\
1.
\\
略。
\\
2. 证明:
\\
A
T
A
v
=
σ
2
v
⟹
A
A
T
A
v
=
σ
2
A
v
⟹
A
v
A^{\mathrm{T}}A\bm{v}=\sigma^2\bm{v} \implies AA^{\mathrm{T}}A\bm{v}=\sigma^2A\bm{v} \implies A\bm{v}
A
T
A
v
=
σ
2
v
⟹
A
A
T
A
v
=
σ
2
A
v
⟹
A
v
是
A
A
T
AA^{\mathrm{T}}
A
A
T
属于
σ
2
\sigma^2
σ
2
的特征向量。
\\
A
A
T
w
=
σ
2
w
⟹
A
T
A
A
T
w
=
σ
2
A
T
w
⟹
A
T
w
AA^{\mathrm{T}}\bm{w}=\sigma^2\bm{w} \implies A^{\mathrm{T}}AA^{\mathrm{T}}\bm{w}=\sigma^2A^{\mathrm{T}}\bm{w} \implies A^{\mathrm{T}}\bm{w}
A
A
T
w
=
σ
2
w
⟹
A
T
A
A
T
w
=
σ
2
A
T
w
⟹
A
T
w
是
A
T
A
A^{\mathrm{T}}A
A
T
A
属于
σ
2
\sigma^2
σ
2
的特征向量。
\\
6.3.6 (极分解)
\quad
对
n
n
n
阶方阵
A
A
A
,存在正交矩阵
Q
Q
Q
和对称半正定矩阵
S
S
S
,使得
A
=
Q
S
.
A=QS.\\
A
=
QS
.
分解式
A
=
Q
S
A=QS
A
=
QS
称为
A
A
A
的极分解. 容易看到,
A
=
S
1
Q
1
A=S_1Q_1
A
=
S
1
Q
1
,即方阵分解为对称半正定矩阵和正交 矩阵的乘积,也存在。
\\
证明:
\\
A
=
U
Σ
V
T
=
U
V
T
V
Σ
V
T
A=U\varSigma V^{\mathrm{T}}=UV^{\mathrm{T}}V \varSigma V^{\mathrm{T}}
A
=
U
Σ
V
T
=
U
V
T
V
Σ
V
T
。显然
Q
=
U
V
T
Q=UV^{\mathrm{T}}
Q
=
U
V
T
是正交矩阵,
S
=
V
Σ
V
T
S=V \varSigma V^{\mathrm{T}}
S
=
V
Σ
V
T
是半正定的对称矩阵,所以
A
A
A
存在极分解
A
=
Q
S
A=QS
A
=
QS
。同理,
A
=
U
Σ
V
T
=
U
Σ
U
T
U
V
T
A=U\varSigma V^{\mathrm{T}}=U \varSigma U^{\mathrm{T}}UV^{\mathrm{T}}
A
=
U
Σ
V
T
=
U
Σ
U
T
U
V
T
,其中
S
1
=
U
Σ
U
T
S_1=U \varSigma U^{\mathrm{T}}
S
1
=
U
Σ
U
T
是对称半正定矩阵,
Q
1
=
U
V
T
Q_1=UV^{\mathrm{T}}
Q
1
=
U
V
T
是正交矩阵,所以分解
A
=
S
1
Q
1
A=S_1Q_1
A
=
S
1
Q
1
也存在。
\\
6.3.7
\quad
证明矩阵的广义逆唯一。
\\
证明:
\\
由广义逆的性质,有
A
A
+
A
=
A
,
A
+
A
A
+
=
A
+
AA^{+}A=A,A^{+}AA^{+}=A^{+}
A
A
+
A
=
A
,
A
+
A
A
+
=
A
+
,
A
A
+
AA^{+}
A
A
+
和
A
+
A
A^{+}A
A
+
A
均为对称矩阵。设
A
A
A
有两个不同的广义逆
X
1
,
X
2
X_1,X_2
X
1
,
X
2
,则
X
1
=
X
1
A
X
1
=
X
1
A
X
2
A
X
1
=
(
X
1
A
)
T
(
X
2
A
)
T
X
1
=
(
X
2
A
X
1
A
)
T
X
1
=
(
X
2
A
)
T
X
1
=
X
2
A
X
1
=
X
2
A
X
2
A
X
1
=
X
2
(
A
X
2
)
T
(
A
X
1
)
T
=
X
2
(
A
X
1
A
X
2
)
T
=
X
2
(
A
X
2
)
T
=
X
2
A
X
2
=
X
2
\begin{align*} X_1&=X_1AX_1=X_1AX_2AX_1=(X_1A)^{\mathrm{T}}(X_2A)^{\mathrm{T}}X_1=(X_2AX_1A)^{\mathrm{T}}X_1=(X_2A)^{\mathrm{T}}X_1\\ &=X_2AX_1=X_2AX_2AX_1=X_2(AX_2)^{\mathrm{T}}(AX_1)^{\mathrm{T}}=X_2(AX_1AX_2)^{\mathrm{T}}=X_2(AX_2)^{\mathrm{T}}\\ &=X_2AX_2\\ &=X_2 \end{align*}
X
1
=
X
1
A
X
1
=
X
1
A
X
2
A
X
1
=
(
X
1
A
)
T
(
X
2
A
)
T
X
1
=
(
X
2
A
X
1
A
)
T
X
1
=
(
X
2
A
)
T
X
1
=
X
2
A
X
1
=
X
2
A
X
2
A
X
1
=
X
2
(
A
X
2
)
T
(
A
X
1
)
T
=
X
2
(
A
X
1
A
X
2
)
T
=
X
2
(
A
X
2
)
T
=
X
2
A
X
2
=
X
2
即
X
1
=
X
2
X_1=X_2
X
1
=
X
2
,于是矩阵的广义逆唯一。
\\
6.3.8 (谱范数的性质)
\quad
证明命题 6.3.7. 矩阵的谱范数满足:
\\
1.
∥
A
∥
≥
0
\Vert A \Vert \geq 0
∥
A
∥
≥
0
,且
∥
A
∥
=
0
\Vert A \Vert=0
∥
A
∥
=
0
当且仅当
A
=
O
;
A=O;\\
A
=
O
;
2.
∥
k
A
∥
=
∣
k
∣
∥
A
∥
;
\Vert kA \Vert = \vert k \vert \Vert A \Vert ;\\
∥
k
A
∥
=
∣
k
∣∥
A
∥
;
3.
∥
A
+
B
∥
≤
∥
A
∥
+
∥
B
∥
;
\Vert A+B \Vert \leq \Vert A \Vert + \Vert B \Vert ;\\
∥
A
+
B
∥
≤
∥
A
∥
+
∥
B
∥
;
4.
∥
A
B
∥
≤
∥
A
∥
∥
B
∥
;
\Vert AB \Vert \leq \Vert A \Vert \Vert B \Vert ;\\
∥
A
B
∥
≤
∥
A
∥∥
B
∥
;
5. 如果
U
,
V
U,V
U
,
V
是正交矩阵,则
∥
U
A
V
T
∥
=
∥
A
∥
.
\Vert UAV^{\mathrm{T}} \Vert = \Vert A \Vert.
∥
U
A
V
T
∥
=
∥
A
∥.
\\
1.证明:
\\
∥
A
∥
=
max
x
≠
0
∥
A
x
∥
∥
x
∥
≥
0
\Vert A \Vert = \displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert A\bm{x} \Vert}{\Vert \bm{x} \Vert} \geq 0
∥
A
∥
=
x
=
0
max
∥
x
∥
∥
A
x
∥
≥
0
,而
∥
A
∥
=
0
⟺
∀
x
≠
0
,
∥
A
x
∥
=
0
⟺
N
(
A
)
=
R
n
⟺
A
=
O
.
\Vert A \Vert = 0 \iff \forall\bm{x}\neq 0,\quad \Vert A\bm{x} \Vert=0 \iff \mathcal{N}(A)=\mathbb{R}^n \iff A=O. \\
∥
A
∥
=
0
⟺
∀
x
=
0
,
∥
A
x
∥
=
0
⟺
N
(
A
)
=
R
n
⟺
A
=
O
.
2.证明:
\\
∥
k
A
∥
=
max
x
≠
0
∥
k
A
x
∥
∥
x
∥
=
∣
k
∣
∥
A
∥
.
\Vert kA \Vert = \displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert kA\bm{x} \Vert}{\Vert \bm{x} \Vert} = \vert k \vert \Vert A \Vert. \\
∥
k
A
∥
=
x
=
0
max
∥
x
∥
∥
k
A
x
∥
=
∣
k
∣∥
A
∥.
3.证明:
\\
由谱范数定义,
∥
A
+
B
∥
=
max
x
≠
0
∥
A
x
+
B
x
∥
∥
x
∥
≤
max
x
≠
0
∥
A
x
∥
+
∥
B
x
∥
∥
x
∥
(由三角不等式)
≤
max
x
≠
0
∥
A
x
∥
x
+
max
x
≠
0
∥
B
x
∥
x
(因为最大值不一定同时取到)
=
∥
A
∥
+
∥
B
∥
.
\begin{align*} \Vert A+B \Vert&=\displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert A\bm{x}+B\bm{x} \Vert}{\Vert \bm{x} \Vert}\\ &\leq \displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert A\bm{x} \Vert+\Vert B\bm{x} \Vert}{\Vert \bm{x} \Vert}\quad\text{(由三角不等式)}\\ &\leq \displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert A\bm{x} \Vert}{\bm{x}}+\displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert B\bm{x} \Vert}{\bm{x}}\quad\text{(因为最大值不一定同时取到)}\\ &=\Vert A \Vert+\Vert B \Vert. \end{align*}
∥
A
+
B
∥
=
x
=
0
max
∥
x
∥
∥
A
x
+
B
x
∥
≤
x
=
0
max
∥
x
∥
∥
A
x
∥
+
∥
B
x
∥
(
由三角不等式
)
≤
x
=
0
max
x
∥
A
x
∥
+
x
=
0
max
x
∥
B
x
∥
(
因为最大值不一定同时取到
)
=
∥
A
∥
+
∥
B
∥.
即
∥
A
+
B
∥
≤
∥
A
∥
+
∥
B
∥
.
\Vert A+B \Vert \leq \Vert A \Vert+\Vert B \Vert.\\
∥
A
+
B
∥
≤
∥
A
∥
+
∥
B
∥.
4.证明:
\\
由谱范数定义,
∥
A
B
∥
=
max
x
≠
0
∥
A
B
x
∥
∥
x
∥
=
max
B
x
≠
0
∥
A
B
x
∥
∥
B
x
∥
∥
B
x
∥
∥
x
∥
≤
max
y
∈
R
(
B
)
,
y
≠
0
∥
A
y
∥
∥
y
∥
max
x
≠
0
∥
B
x
∥
∥
x
∥
(因为最大值不一定同时取到)
≤
max
y
≠
0
∥
A
y
∥
∥
y
∥
max
x
≠
0
∥
B
x
∥
∥
x
∥
(因为
y
的取值范围变大了)
=
∥
A
∥
∥
B
∥
.
\begin{align*} \Vert AB \Vert &= \displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert AB\bm{x} \Vert}{\Vert \bm{x} \Vert}\\ &=\displaystyle\max_{B\bm{x}\neq\bm{0}} \cfrac{\Vert AB\bm{x} \Vert}{\Vert B\bm{x} \Vert}\cfrac{\Vert B\bm{x} \Vert}{\Vert \bm{x} \Vert}\\ &\leq \displaystyle\max_{\bm{y}\in\mathcal{R}(B),\bm{y}\neq\bm{0}} \cfrac{\Vert A\bm{y} \Vert}{\Vert \bm{y} \Vert}\displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert B\bm{x} \Vert}{\Vert \bm{x} \Vert}\quad\text{(因为最大值不一定同时取到)}\\ &\leq \displaystyle\max_{\bm{y}\neq\bm{0}} \cfrac{\Vert A\bm{y} \Vert}{\Vert \bm{y} \Vert}\displaystyle\max_{\bm{x}\neq\bm{0}} \cfrac{\Vert B\bm{x} \Vert}{\Vert \bm{x} \Vert}\quad\text{(因为$\bm{y}$的取值范围变大了)}\\ &=\Vert A \Vert \Vert B \Vert. \end{align*}
∥
A
B
∥
=
x
=
0
max
∥
x
∥
∥
A
B
x
∥
=
B
x
=
0
max
∥
B
x
∥
∥
A
B
x
∥
∥
x
∥
∥
B
x
∥
≤
y
∈
R
(
B
)
,
y
=
0
max
∥
y
∥
∥
A
y
∥
x
=
0
max
∥
x
∥
∥
B
x
∥
(
因为最大值不一定同时取到
)
≤
y
=
0
max
∥
y
∥
∥
A
y
∥
x
=
0
max
∥
x
∥
∥
B
x
∥
(
因为
y
的取值范围变大了
)
=
∥
A
∥∥
B
∥.
即
∥
A
B
∥
≤
∥
A
∥
∥
B
∥
.
\Vert AB \Vert \leq \Vert A \Vert \Vert B \Vert.\\
∥
A
B
∥
≤
∥
A
∥∥
B
∥.
5. 证明:
\\
正交矩阵不改变向量的长度,于是
∥
U
A
∥
=
max
x
≠
0
∥
U
A
x
∥
∥
x
∥
=
max
x
≠
0
∥
A
x
∥
∥
x
∥
=
∥
A
∥
\Vert UA \Vert = \max_{\bm{x}\neq\bm{0}} \cfrac{\Vert UA\bm{x} \Vert}{\Vert \bm{x} \Vert} = \max_{\bm{x}\neq\bm{0}} \cfrac{\Vert A\bm{x} \Vert}{\Vert \bm{x} \Vert} = \Vert A \Vert
∥
U
A
∥
=
x
=
0
max
∥
x
∥
∥
U
A
x
∥
=
x
=
0
max
∥
x
∥
∥
A
x
∥
=
∥
A
∥
∥
A
V
T
∥
=
max
x
≠
0
∥
A
V
T
x
∥
∥
x
∥
=
max
x
≠
0
∥
A
V
T
x
∥
∥
V
T
x
∥
=
max
y
≠
0
∥
A
y
∥
∥
y
∥
=
∥
A
∥
\Vert AV^{\mathrm{T}} \Vert = \max_{\bm{x}\neq\bm{0}} \cfrac{\Vert AV^{\mathrm{T}}\bm{x} \Vert}{\Vert \bm{x} \Vert} = \max_{\bm{x}\neq\bm{0}} \cfrac{\Vert AV^{\mathrm{T}}\bm{x} \Vert}{\Vert V^{\mathrm{T}}\bm{x} \Vert} = \max_{\bm{y}\neq\bm{0}} \cfrac{\Vert A\bm{y} \Vert}{\Vert \bm{y} \Vert} = \Vert A \Vert
∥
A
V
T
∥
=
x
=
0
max
∥
x
∥
∥
A
V
T
x
∥
=
x
=
0
max
∥
V
T
x
∥
∥
A
V
T
x
∥
=
y
=
0
max
∥
y
∥
∥
A
y
∥
=
∥
A
∥
所以
∥
U
A
V
T
∥
=
∥
A
V
T
∥
=
∥
U
A
∥
=
∥
A
∥
.
\Vert UAV^{\mathrm{T}} \Vert=\Vert AV^{\mathrm{T}} \Vert=\Vert UA \Vert=\Vert A \Vert.\\
∥
U
A
V
T
∥
=
∥
A
V
T
∥
=
∥
U
A
∥
=
∥
A
∥.
6.3.9
\quad
证明矩阵任意特征值的绝对值不大于其最大的奇异值。
\\
证明:
\\
σ
m
a
x
=
max
x
≠
0
∥
A
x
∥
∥
x
∥
\sigma_{max}=\displaystyle\max_{\bm{x}\neq\bm{0}}\cfrac{\Vert A\bm{x} \Vert}{\Vert \bm{x} \Vert}
σ
ma
x
=
x
=
0
max
∥
x
∥
∥
A
x
∥
,而对任意特征值
λ
\lambda
λ
都有
∥
A
v
∥
=
∥
λ
v
∥
=
∣
λ
∣
∥
v
∥
⟹
∣
λ
∣
=
∥
A
v
∥
∥
v
∥
\Vert A\bm{v} \Vert=\Vert \lambda\bm{v} \Vert=\vert\lambda\vert\Vert \bm{v} \Vert \implies \vert\lambda\vert=\cfrac{\Vert A\bm{v} \Vert}{\Vert \bm{v} \Vert}
∥
A
v
∥
=
∥
λ
v
∥
=
∣
λ
∣∥
v
∥
⟹
∣
λ
∣
=
∥
v
∥
∥
A
v
∥
,其中
v
\bm{v}
v
是是对应的特征向量。即
∣
λ
∣
≤
σ
m
a
x
.
\vert\lambda\vert \leq \sigma_{max}.\\
∣
λ
∣
≤
σ
ma
x
.
6.3.10
\quad
证明或者举出反例。
\\
1.
n
n
n
阶方阵
A
A
A
为正交矩阵当且仅当它的
n
n
n
个奇异值都是1。
\\
2.
n
n
n
阶方阵的
n
n
n
个奇异值的乘积等于所有特征值的乘积。
\\
3.设
n
n
n
阶方阵
A
A
A
和
A
+
I
n
A+I_n
A
+
I
n
的奇异值分解分别为
A
=
U
Σ
V
T
,
A
+
I
n
=
U
(
Σ
+
I
n
)
V
T
.
A=U\varSigma V^{\mathrm{T}},A+I_n=U(\varSigma+I_n)V^{\mathrm{T}}.
A
=
U
Σ
V
T
,
A
+
I
n
=
U
(
Σ
+
I
n
)
V
T
.
证明
A
A
A
是对称矩阵。
\\
4. 如果
n
n
n
阶方阵
A
A
A
的
n
n
n
个奇异值就是它的
n
n
n
个特征值,则
A
A
A
是对称矩阵。
\\
1.证明:
\\
A
A
A
为正交矩阵
⟺
A
T
A
=
A
A
T
=
I
n
⟺
n
\iff A^{\mathrm{T}}A=AA^{\mathrm{T}}=I_n\iff n
⟺
A
T
A
=
A
A
T
=
I
n
⟺
n
个奇异值都是1。
\\
2.
\\
显然不正确,奇异值乘积不小于0,而特征值的乘积可以为任意实数。
\\
3.证明:
\\
A
+
I
n
=
U
(
Σ
+
I
n
)
V
T
=
U
Σ
V
T
+
U
V
T
⟹
U
V
T
=
I
n
⟹
U
=
V
⟹
A
=
U
Σ
U
T
⟹
A
A+I_n=U(\varSigma+I_n)V^{\mathrm{T}}=U\varSigma V^{\mathrm{T}}+UV^{\mathrm{T}} \implies UV^{\mathrm{T}}=I_n \implies U=V \implies A=U\varSigma U^{\mathrm{T}} \implies A
A
+
I
n
=
U
(
Σ
+
I
n
)
V
T
=
U
Σ
V
T
+
U
V
T
⟹
U
V
T
=
I
n
⟹
U
=
V
⟹
A
=
U
Σ
U
T
⟹
A
是对称矩阵。
\\
4.证明:
\\
要证
A
A
A
是对称矩阵,只需要证
A
−
A
T
=
O
A-A^{\mathrm{T}}=O
A
−
A
T
=
O
。由Frobenius范数性质可得
trace
(
A
T
A
)
=
O
⟺
A
=
O
.
\operatorname{trace}(A^{\mathrm{T}}A)=O \iff A=O.
trace
(
A
T
A
)
=
O
⟺
A
=
O
.
所以
trace
(
(
A
−
A
T
)
T
(
A
−
A
T
)
)
=
trace
(
A
A
T
+
A
T
A
−
A
2
−
(
A
T
)
2
)
=
trace
(
A
A
T
)
+
trace
(
A
T
A
)
−
trace
(
A
2
)
−
trace
(
(
A
T
)
2
)
=
2
trace
(
A
T
A
)
−
2
trace
(
A
2
)
=
2
∑
i
=
1
n
σ
i
2
−
2
∑
i
=
1
n
λ
i
2
=
0
\begin{align*} \operatorname{trace}((A-A^{\mathrm{T}})^{\mathrm{T}}(A-A^{\mathrm{T}}))&=\operatorname{trace}(AA^{\mathrm{T}}+A^{\mathrm{T}}A-A^2-(A^{\mathrm{T}})^2)\\ &= \operatorname{trace}(AA^{\mathrm{T}})+\operatorname{trace}(A^{\mathrm{T}}A)-\operatorname{trace}(A^2)-\operatorname{trace}((A^{\mathrm{T}})^2)\\ &=2\operatorname{trace}(A^{\mathrm{T}}A)-2\operatorname{trace}(A^2)\\ &=2\sum_{i=1}^n \sigma_i^2 - 2\sum_{i=1}^n \lambda_i^2\\ &=0 \end{align*}
trace
((
A
−
A
T
)
T
(
A
−
A
T
))
=
trace
(
A
A
T
+
A
T
A
−
A
2
−
(
A
T
)
2
)
=
trace
(
A
A
T
)
+
trace
(
A
T
A
)
−
trace
(
A
2
)
−
trace
((
A
T
)
2
)
=
2
trace
(
A
T
A
)
−
2
trace
(
A
2
)
=
2
i
=
1
∑
n
σ
i
2
−
2
i
=
1
∑
n
λ
i
2
=
0
于是
A
−
A
T
=
O
⟹
A
A-A^{\mathrm{T}}=O \implies A
A
−
A
T
=
O
⟹
A
是对称矩阵。
\\
6.3.11 (Frobenius范数的性质)
\quad
证明命题 6.3.15.
\quad
矩阵的Frobenius范数满足:
\\
1.
∥
A
∥
F
≥
0
\Vert A \Vert_{\mathrm{F}} \geq 0
∥
A
∥
F
≥
0
,且
∥
A
∥
F
=
0
\Vert A \Vert_{\mathrm{F}}=0
∥
A
∥
F
=
0
当且仅当
A
=
O
;
A=O;\\
A
=
O
;
2.
∥
k
A
∥
F
=
∣
k
∣
∥
A
∥
F
;
\Vert kA \Vert_{\mathrm{F}} = \vert k \vert \Vert A \Vert_{\mathrm{F}} ;\\
∥
k
A
∥
F
=
∣
k
∣∥
A
∥
F
;
3.
∥
A
+
B
∥
F
≤
∥
A
∥
F
+
∥
B
∥
F
;
\Vert A+B \Vert_{\mathrm{F}} \leq \Vert A \Vert_{\mathrm{F}} + \Vert B \Vert_{\mathrm{F}} ;\\
∥
A
+
B
∥
F
≤
∥
A
∥
F
+
∥
B
∥
F
;
4.
∥
A
B
∥
F
≤
∥
A
∥
∥
B
∥
F
,
∥
A
B
∥
F
≤
∥
A
∥
F
∥
B
∥
;
\Vert AB \Vert_{\mathrm{F}} \leq \Vert A \Vert \Vert B \Vert_{\mathrm{F}},\Vert AB \Vert_{\mathrm{F}} \leq \Vert A \Vert_{\mathrm{F}} \Vert B \Vert ;\\
∥
A
B
∥
F
≤
∥
A
∥∥
B
∥
F
,
∥
A
B
∥
F
≤
∥
A
∥
F
∥
B
∥
;
5. 如果
U
,
V
U,V
U
,
V
是正交矩阵,则
∥
U
A
V
T
∥
F
=
∥
A
∥
F
.
\Vert UAV^{\mathrm{T}} \Vert_{\mathrm{F}} = \Vert A \Vert_{\mathrm{F}}.\\
∥
U
A
V
T
∥
F
=
∥
A
∥
F
.
1. 证明:
\\
由Frobenius范数定义
∥
A
∥
F
=
trace
(
A
T
A
)
=
∑
i
=
1
m
∑
j
=
1
n
∣
a
i
j
∣
2
\Vert A \Vert_{\mathrm{F}}=\sqrt{\operatorname{trace}(A^{\mathrm{T}}A)}=\sqrt{\displaystyle\sum_{i=1}^{m}\displaystyle\sum_{j=1}^{n}\vert a_{ij} \vert^2}
∥
A
∥
F
=
trace
(
A
T
A
)
=
i
=
1
∑
m
j
=
1
∑
n
∣
a
ij
∣
2
立得。
\\
2. 证明:
\\
∥
k
A
∥
F
=
k
2
trace
(
A
T
A
)
=
∣
k
∣
trace
(
A
T
A
)
=
∣
k
∣
∥
A
∥
F
.
\Vert kA \Vert_{\mathrm{F}}=\sqrt{k^2\operatorname{trace}(A^{\mathrm{T}}A)}=\vert k \vert \sqrt{\operatorname{trace}(A^{\mathrm{T}}A)}=\vert k \vert \Vert A \Vert_{\mathrm{F}}.\\
∥
k
A
∥
F
=
k
2
trace
(
A
T
A
)
=
∣
k
∣
trace
(
A
T
A
)
=
∣
k
∣∥
A
∥
F
.
3. 证明:
\\
由三角不等式,
∥
A
+
B
∥
F
=
∑
i
=
1
m
∑
j
=
1
n
∣
a
i
j
+
b
i
j
∣
2
≤
∑
i
=
1
m
∣
a
i
j
∣
2
+
∑
j
=
1
n
∣
b
i
j
∣
2
=
∥
A
∥
F
+
∥
B
∥
F
.
\Vert A+B \Vert_{\mathrm{F}}=\sqrt{\displaystyle\sum_{i=1}^{m}\displaystyle\sum_{j=1}^{n}\vert a_{ij}+b_{ij} \vert^2} \leq \sqrt{\displaystyle\sum_{i=1}^{m}\vert a_{ij} \vert^2}+\sqrt{\displaystyle\sum_{j=1}^{n}\vert b_{ij} \vert^2}=\Vert A \Vert_{\mathrm{F}} + \Vert B \Vert_{\mathrm{F}}.\\
∥
A
+
B
∥
F
=
i
=
1
∑
m
j
=
1
∑
n
∣
a
ij
+
b
ij
∣
2
≤
i
=
1
∑
m
∣
a
ij
∣
2
+
j
=
1
∑
n
∣
b
ij
∣
2
=
∥
A
∥
F
+
∥
B
∥
F
.
4. 证明:
\\
∥
A
∥
=
max
∥
A
x
∥
∥
x
∥
⟹
∥
A
∥
≥
∥
A
x
∥
∥
x
∥
⟹
∥
A
x
∥
≤
∥
A
∥
∥
x
∥
.
\Vert A \Vert = \max\cfrac{\Vert A\bm{x} \Vert}{\Vert \bm{x} \Vert} \implies \Vert A \Vert \geq \cfrac{\Vert A\bm{x} \Vert}{\Vert \bm{x} \Vert} \implies \Vert A\bm{x} \Vert \leq \Vert A \Vert \Vert \bm{x} \Vert.\\
∥
A
∥
=
max
∥
x
∥
∥
A
x
∥
⟹
∥
A
∥
≥
∥
x
∥
∥
A
x
∥
⟹
∥
A
x
∥
≤
∥
A
∥∥
x
∥.
设
B
=
[
b
1
b
2
⋯
b
n
]
B=\begin{bmatrix} \bm{b}_1&\bm{b}_2&\cdots&\bm{b}_n \end{bmatrix}
B
=
[
b
1
b
2
⋯
b
n
]
,则
A
B
=
[
A
b
1
A
b
2
⋯
A
b
n
]
AB=\begin{bmatrix} A\bm{b}_1&A\bm{b}_2&\cdots&A\bm{b}_n \end{bmatrix}
A
B
=
[
A
b
1
A
b
2
⋯
A
b
n
]
,由Frobenius范数定义有
∥
A
B
∥
F
2
=
∑
i
=
1
n
∥
A
b
i
∥
2
≤
∥
A
∥
2
∑
i
=
1
n
∥
b
i
∥
2
=
∥
A
∥
2
∥
B
∥
F
2
.
\Vert AB \Vert_{\mathrm{F}}^2=\displaystyle\sum_{i=1}^n \Vert A\bm{b}_i \Vert^2 \leq \Vert A \Vert^2 \displaystyle\sum_{i=1}^n \Vert \bm{b}_i \Vert^2 = \Vert A \Vert^2 \Vert B \Vert_{\mathrm{F}}^2.
∥
A
B
∥
F
2
=
i
=
1
∑
n
∥
A
b
i
∥
2
≤
∥
A
∥
2
i
=
1
∑
n
∥
b
i
∥
2
=
∥
A
∥
2
∥
B
∥
F
2
.
即
∥
A
B
∥
F
≤
∥
A
∥
∥
B
∥
F
.
\Vert AB \Vert_{\mathrm{F}} \leq \Vert A \Vert \Vert B \Vert_{\mathrm{F}}.\\
∥
A
B
∥
F
≤
∥
A
∥∥
B
∥
F
.
同理,设
A
=
[
a
1
T
a
2
T
⋮
a
n
T
]
A=\begin{bmatrix} \bm{a}_1^{\mathrm{T}}\\ \bm{a}_2^{\mathrm{T}}\\ \vdots\\ \bm{a}_n^{\mathrm{T}} \end{bmatrix}
A
=
a
1
T
a
2
T
⋮
a
n
T
可得
∥
A
B
∥
F
≤
∥
A
∥
F
∥
B
∥
.
\Vert AB \Vert_{\mathrm{F}} \leq \Vert A \Vert_{\mathrm{F}} \Vert B \Vert.\\
∥
A
B
∥
F
≤
∥
A
∥
F
∥
B
∥.
5. 证明:
\\
∥
U
A
∥
F
=
trace
(
A
T
U
T
U
A
)
=
trace
(
A
T
A
)
=
∥
A
∥
F
\Vert UA \Vert_{\mathrm{F}} = \sqrt{\operatorname{trace}(A^{\mathrm{T}}U^{\mathrm{T}}UA)}=\sqrt{\operatorname{trace}(A^{\mathrm{T}}A)}=\Vert A \Vert_{\mathrm{F}}
∥
U
A
∥
F
=
trace
(
A
T
U
T
U
A
)
=
trace
(
A
T
A
)
=
∥
A
∥
F
∥
A
V
T
∥
F
=
trace
(
V
A
T
A
V
T
)
=
trace
(
V
T
V
A
T
A
)
=
trace
(
A
T
A
)
=
∥
A
∥
F
\Vert AV^{\mathrm{T}} \Vert_{\mathrm{F}} = \sqrt{\operatorname{trace}(VA^{\mathrm{T}}AV^{\mathrm{T}})}=\sqrt{\operatorname{trace}(V^{\mathrm{T}}VA^{\mathrm{T}}A)}=\sqrt{\operatorname{trace}(A^{\mathrm{T}}A)}=\Vert A \Vert_{\mathrm{F}}
∥
A
V
T
∥
F
=
trace
(
V
A
T
A
V
T
)
=
trace
(
V
T
V
A
T
A
)
=
trace
(
A
T
A
)
=
∥
A
∥
F
于是
∥
U
A
V
T
∥
F
=
∥
U
A
∥
F
=
∥
A
∥
F
.
\Vert UAV^{\mathrm{T}} \Vert_{\mathrm{F}}=\Vert UA \Vert_{\mathrm{F}}=\Vert A \Vert_{\mathrm{F}}.\\
∥
U
A
V
T
∥
F
=
∥
U
A
∥
F
=
∥
A
∥
F
.
6.3.12
\quad
证明命题 6.3.16.
\quad
对任意矩阵
A
A
A
,其Frobenius范数平方
∥
A
∥
F
2
\Vert A \Vert_{\mathrm{F}}^2
∥
A
∥
F
2
等于
A
A
A
所有奇异值的平方和。因此
∥
A
∥
F
≥
∥
A
∥
\Vert A \Vert_{\mathrm{F}} \geq \Vert A \Vert
∥
A
∥
F
≥
∥
A
∥
\\
证明:
\\
∥
A
∥
F
2
=
trace
(
A
T
A
)
=
∑
i
=
1
n
σ
i
2
≥
σ
max
2
=
∥
A
∥
2
.
\Vert A \Vert_{\mathrm{F}}^2=\operatorname{trace}(A^{\mathrm{T}}A)=\sum_{i=1}^n \sigma_i^2 \geq \sigma_{\max}^2=\Vert A \Vert^2.
∥
A
∥
F
2
=
trace
(
A
T
A
)
=
i
=
1
∑
n
σ
i
2
≥
σ
m
a
x
2
=
∥
A
∥
2
.
6.3.13 (樊畿迹定理)
\quad
对任意
n
n
n
阶对称矩阵
A
∈
R
n
×
n
A\in\mathbb{R}^{n\times n}
A
∈
R
n
×
n
,假设特征值为
λ
1
⩾
λ
2
⩾
.
.
.
⩾
λ
n
\lambda_1\geqslant\lambda_2\geqslant...\geqslant\lambda_n
λ
1
⩾
λ
2
⩾
...
⩾
λ
n
,对应特征向量为
u
1
,
u
2
,
⋯
,
u
n
\bm{u}_1,\bm{u}_2,\cdots,\bm{u}_n
u
1
,
u
2
,
⋯
,
u
n
,则
max
n
×
m
矩阵
Q
:
Q
T
Q
=
I
trace
(
Q
T
A
Q
)
=
∑
i
=
1
m
λ
i
{\displaystyle\max_{\substack{n\times m \text{矩阵} Q:\\Q^{\mathrm{T}}Q=I}}}\operatorname{trace}(Q^{\mathrm{T}}AQ)=\displaystyle\sum_{i=1}^{m}\lambda_i
n
×
m
矩阵
Q
:
Q
T
Q
=
I
max
trace
(
Q
T
A
Q
)
=
i
=
1
∑
m
λ
i
,且
Q
=
[
u
1
u
2
⋯
u
m
]
Q=\begin{bmatrix}\bm{u}_1&\bm{u}_2&\cdots&\bm{u}_m\end{bmatrix}
Q
=
[
u
1
u
2
⋯
u
m
]
时取得最大值。
\\
证明:
\\
设
Q
=
[
q
1
q
2
⋯
q
m
]
Q=\begin{bmatrix} \bm{q}_1&\bm{q}_2&\cdots&\bm{q}_m \end{bmatrix}
Q
=
[
q
1
q
2
⋯
q
m
]
,且列向量两两正交。则
trace
(
Q
T
A
Q
)
=
q
1
T
A
q
1
+
q
2
T
A
q
2
+
⋯
+
q
m
T
A
q
m
.
\operatorname{trace}(Q^{\mathrm{T}}AQ)=\bm{q}_1^{\mathrm{T}}A\bm{q}_1+\bm{q}_2^{\mathrm{T}}A\bm{q}_2+\cdots+\bm{q}_m^{\mathrm{T}}A\bm{q}_m.
trace
(
Q
T
A
Q
)
=
q
1
T
A
q
1
+
q
2
T
A
q
2
+
⋯
+
q
m
T
A
q
m
.
\\
注意到
q
i
T
A
q
i
=
q
i
T
A
q
i
q
i
T
q
i
\bm{q}_i^{\mathrm{T}}A\bm{q}_i=\cfrac{\bm{q}_i^{\mathrm{T}}A\bm{q}_i}{\bm{q}_i^{\mathrm{T}}\bm{q}_i}
q
i
T
A
q
i
=
q
i
T
q
i
q
i
T
A
q
i
是
q
i
\bm{q}_i
q
i
关于
A
A
A
的Rayleigh商。于是
max
q
i
T
q
i
=
1
q
i
T
A
q
i
=
λ
1
,
max
k
≤
i
≤
m
q
i
T
A
q
i
=
λ
k
\displaystyle\max_{\bm{q}_i^{\mathrm{T}}\bm{q}_i=1} \bm{q}_i^{\mathrm{T}}A\bm{q}_i=\lambda_1, \displaystyle\max_{k\leq i \leq m} \bm{q}_i^{\mathrm{T}}A\bm{q}_i=\lambda_k
q
i
T
q
i
=
1
max
q
i
T
A
q
i
=
λ
1
,
k
≤
i
≤
m
max
q
i
T
A
q
i
=
λ
k
,当且仅当
q
i
\bm{q}_i
q
i
为对应特征值时取得最大值。
\\
所以
max
n
×
m
矩阵
Q
:
Q
T
Q
=
I
trace
(
Q
T
A
Q
)
=
∑
i
=
1
m
λ
i
{\displaystyle\max_{\substack{n\times m \text{矩阵} Q:\\Q^{\mathrm{T}}Q=I}}}\operatorname{trace}(Q^{\mathrm{T}}AQ)=\displaystyle\sum_{i=1}^{m}\lambda_i
n
×
m
矩阵
Q
:
Q
T
Q
=
I
max
trace
(
Q
T
A
Q
)
=
i
=
1
∑
m
λ
i
,当且仅当
Q
=
[
u
1
u
2
⋯
u
m
]
Q=\begin{bmatrix}\bm{u}_1&\bm{u}_2&\cdots&\bm{u}_m\end{bmatrix}
Q
=
[
u
1
u
2
⋯
u
m
]
时取得最大值。
\\
6.3.14
\\
略。唯一需要注意的是不要浪费时间去算第四问
B
B
B
的最佳秩1逼近。这里直接给出
B
B
B
的奇异值分解,供读者观摩。
\\
显然
B
=
[
−
2
−
1
1
2
−
9
−
1
−
1
11
]
B=\begin{bmatrix} -2 & -1 & 1 & 2 \\ -9 & -1 & -1 & 11 \end{bmatrix}
B
=
[
−
2
−
9
−
1
−
1
1
−
1
2
11
]
,于是
B
B
B
的奇异值分解为
[
−
2
−
1
1
2
−
9
−
1
−
1
11
]
=
U
Σ
V
T
\begin{bmatrix} -2 & -1 & 1 & 2 \\ -9 & -1 & -1 & 11 \end{bmatrix} =U\varSigma V^{\mathrm{T}}
[
−
2
−
9
−
1
−
1
1
−
1
2
11
]
=
U
Σ
V
T
其中
U
=
[
−
9409
44036
+
1
2
−
9409
44036
+
1
2
9409
44036
+
1
2
−
9409
44036
+
1
2
]
U= \begin{bmatrix} \sqrt{-\sqrt{\frac{9409}{44036}}+\frac{1}{2}} & -\sqrt{\sqrt{\frac{9409}{44036}}+\frac{1}{2}} \\ \sqrt{\sqrt{\frac{9409}{44036}}+\frac{1}{2}} & \sqrt{-\sqrt{\frac{9409}{44036}}+\frac{1}{2}} \end{bmatrix}
U
=
−
44036
9409
+
2
1
44036
9409
+
2
1
−
44036
9409
+
2
1
−
44036
9409
+
2
1
Σ
=
[
11009
+
107
0
0
0
0
−
11009
+
107
0
0
]
\varSigma= \begin{bmatrix} \sqrt{\sqrt{11009}+107} & 0 & 0 & 0 \\ 0 & \sqrt{-\sqrt{11009}+107} & 0 & 0 \end{bmatrix}
Σ
=
[
11009
+
107
0
0
−
11009
+
107
0
0
0
0
]
V
T
=
[
−
0.633
0.148
−
174
87
103
4785
9570
−
0.081
0.546
11
174
174
−
4785
9570
−
0.054
−
0.816
7
174
174
31
4785
9570
176066361
2131342400
+
133
440
−
176066361
2131342400
+
133
440
0
4785
110
]
T
V^{\mathrm{T}}= \begin{bmatrix} -0.633 & 0.148 & \frac{-\sqrt{174}}{87} & \frac{103\sqrt{4785}}{9570} \\ -0.081 & 0.546 & \frac{11\sqrt{174}}{174} & \frac{-\sqrt{4785}}{9570} \\ -0.054 & -0.816 & \frac{7\sqrt{174}}{174} & \frac{31\sqrt{4785}}{9570} \\ \sqrt{\sqrt{\frac{176066361}{2131342400}}+\frac{133}{440}} & \sqrt{-\sqrt{\frac{176066361}{2131342400}}+\frac{133}{440}} & 0 & \frac{\sqrt{4785}}{110} \end{bmatrix}^{\mathrm{T}}
V
T
=
−
0.633
−
0.081
−
0.054
2131342400
176066361
+
440
133
0.148
0.546
−
0.816
−
2131342400
176066361
+
440
133
87
−
174
174
11
174
174
7
174
0
9570
103
4785
9570
−
4785
9570
31
4785
110
4785
T
用numpy进行奇异值分解后计算秩一逼近
B
1
=
σ
1
u
1
v
1
T
=
[
−
1.7910
−
0.2283
−
0.1528
2.1722
−
9.0413
−
1.1528
−
0.7716
10.9658
]
B_1=\sigma_1\bm{u}_1\bm{v}_1^{\mathrm{T}}= \begin{bmatrix} -1.7910 & -0.2283 & -0.1528 & 2.1722 \\ -9.0413 & -1.1528 & -0.7716 & 10.9658 \end{bmatrix}
B
1
=
σ
1
u
1
v
1
T
=
[
−
1.7910
−
9.0413
−
0.2283
−
1.1528
−
0.1528
−
0.7716
2.1722
10.9658
]
可以看到秩一逼近已经很接近真实值了。
\\
6.3.15
\quad
考虑子空间
M
,
N
\mathcal{M},\mathcal{N}
M
,
N
,其对应的正交投影矩阵为
P
,
Q
.
P,Q.
P
,
Q
.
我们想要研究矩阵
H
=
P
(
P
+
Q
)
+
Q
+
Q
(
P
+
Q
)
+
P
.
H=P\left(P+Q\right)^{+}Q+Q\left(P+Q\right)^{+}P.
H
=
P
(
P
+
Q
)
+
Q
+
Q
(
P
+
Q
)
+
P
.
\\
1.计算
(
P
+
Q
)
(
P
+
Q
)
+
(P+Q)(P+Q)^{+}
(
P
+
Q
)
(
P
+
Q
)
+
的列空间和零空间,该矩阵是否为一个正交投影矩阵?
\\
2.计算
(
P
+
Q
)
+
(
P
+
Q
)
\left(P+Q\right)^{+}\left(P+Q\right)
(
P
+
Q
)
+
(
P
+
Q
)
的列空间和零空间,该矩阵是否为一个正交投影矩阵?和前一矩阵有何关联?
\\
3.证明
Q
(
P
+
Q
)
+
(
P
+
Q
)
=
Q
,
(
P
+
Q
)
(
P
+
Q
)
+
Q
=
Q
.
Q(P+Q)^{+}(P+Q)=Q,(P+Q)(P+Q)^{+}Q=Q.
Q
(
P
+
Q
)
+
(
P
+
Q
)
=
Q
,
(
P
+
Q
)
(
P
+
Q
)
+
Q
=
Q
.
\\
4.证明
H
=
2
P
(
P
+
Q
)
+
Q
=
2
Q
(
P
+
Q
)
+
P
.
H=2P(P+Q)^{+}Q=2Q(P+Q)^{+}P.
H
=
2
P
(
P
+
Q
)
+
Q
=
2
Q
(
P
+
Q
)
+
P
.
\\
5.假设
T
T
T
是
M
∩
N
\mathcal{M}\cap\mathcal{N}
M
∩
N
上的正交投影矩阵,证明
H
P
=
H
Q
=
H
T
=
H
.
HP=HQ=HT=H.
H
P
=
H
Q
=
H
T
=
H
.
\\
6.证明
H
T
=
T
.
HT=T.
H
T
=
T
.
\\
于是
H
=
T
H=T
H
=
T
,由此即得
M
∩
N
\mathcal{M}\cap\mathcal{N}
M
∩
N
的正交投影矩阵的表达式。
\\
1.解:
\\
设
P
+
Q
=
A
P+Q=A
P
+
Q
=
A
,由广义逆性质可知
A
A
+
=
U
r
U
r
T
AA^{+}=U_rU_r^{\mathrm{T}}
A
A
+
=
U
r
U
r
T
,其中
U
r
U_r
U
r
是
R
(
A
)
\mathcal{R}(A)
R
(
A
)
的一组标准正交基,于是
R
(
A
A
+
)
=
R
(
A
)
=
R
(
P
+
Q
)
\mathcal{R}(AA^{+})=\mathcal{R}(A)=\mathcal{R}(P+Q)
R
(
A
A
+
)
=
R
(
A
)
=
R
(
P
+
Q
)
。同理可得
N
(
A
A
+
)
=
N
(
P
+
Q
)
\mathcal{N}(AA^+)=\mathcal{N}(P+Q)
N
(
A
A
+
)
=
N
(
P
+
Q
)
。
\\
接下来我们将证明,
R
(
P
+
Q
)
=
M
+
N
,
N
(
P
+
Q
)
=
(
M
+
N
)
⊥
\mathcal{R}(P+Q)=\mathcal{M}+\mathcal{N}, \mathcal{N}(P+Q)=(\mathcal{M}+\mathcal{N})^{\perp}
R
(
P
+
Q
)
=
M
+
N
,
N
(
P
+
Q
)
=
(
M
+
N
)
⊥
。
\\
对
∀
x
∈
N
(
P
+
Q
)
,
(
P
+
Q
)
x
=
0
⟹
P
x
+
Q
x
=
0
⟹
P
x
=
−
Q
x
\forall \bm{x}\in \mathcal{N}(P+Q), (P+Q)\bm{x}=\bm{0} \implies P\bm{x}+Q\bm{x}=\bm{0} \implies P\bm{x}=-Q\bm{x}
∀
x
∈
N
(
P
+
Q
)
,
(
P
+
Q
)
x
=
0
⟹
P
x
+
Q
x
=
0
⟹
P
x
=
−
Q
x
。而
P
P
P
是
M
\mathcal{M}
M
的正交投影矩阵,
Q
Q
Q
是
N
\mathcal{N}
N
的正交投影矩阵,
P
x
∈
M
,
Q
x
∈
N
P\bm{x}\in \mathcal{M}, Q\bm{x}\in \mathcal{N}
P
x
∈
M
,
Q
x
∈
N
,这说明
∀
x
∈
N
(
P
+
Q
)
,
P
x
∈
M
∩
N
,
Q
x
∈
M
∩
N
\forall \bm{x} \in \mathcal{N}(P+Q), P\bm{x}\in \mathcal{M}\cap\mathcal{N}, Q\bm{x}\in \mathcal{M}\cap\mathcal{N}
∀
x
∈
N
(
P
+
Q
)
,
P
x
∈
M
∩
N
,
Q
x
∈
M
∩
N
。
\\
设
M
∩
N
\mathcal{M}\cap\mathcal{N}
M
∩
N
的正交投影矩阵是
T
T
T
,则
P
T
=
Q
T
=
T
=
T
T
=
(
P
T
)
T
=
(
Q
T
)
T
=
T
P
=
T
Q
PT=QT=T=T^{\mathrm{T}}=(PT)^{\mathrm{T}}=(QT)^{\mathrm{T}}=TP=TQ
PT
=
QT
=
T
=
T
T
=
(
PT
)
T
=
(
QT
)
T
=
TP
=
TQ
于是
P
x
=
−
Q
x
⟹
T
P
x
=
−
T
Q
x
⟹
T
x
=
−
T
x
⟹
T
x
=
0
⟹
T
P
x
=
T
Q
x
=
0
\begin{align*} P\bm{x}=-Q\bm{x}&\implies TP\bm{x}=-TQ\bm{x}\\ &\implies T\bm{x}=-T\bm{x}\\ &\implies T\bm{x}=\bm{0}\\ &\implies TP\bm{x}=TQ\bm{x}=\bm{0} \end{align*}
P
x
=
−
Q
x
⟹
TP
x
=
−
TQ
x
⟹
T
x
=
−
T
x
⟹
T
x
=
0
⟹
TP
x
=
TQ
x
=
0
又因为
P
x
∈
M
∩
N
,
Q
x
∈
M
∩
N
P\bm{x}\in\mathcal{M}\cap\mathcal{N},Q\bm{x}\in\mathcal{M}\cap\mathcal{N}
P
x
∈
M
∩
N
,
Q
x
∈
M
∩
N
,且
T
T
T
是
M
∩
N
\mathcal{M}\cap\mathcal{N}
M
∩
N
的正交投影矩阵,所以
T
P
x
=
P
x
=
0
,
T
Q
x
=
Q
x
=
0
⟹
N
(
P
+
Q
)
⊆
N
(
P
)
∩
N
(
Q
)
TP\bm{x}=P\bm{x}=\bm{0}, TQ\bm{x}=Q\bm{x}=\bm{0} \implies \mathcal{N}(P+Q)\subseteq\mathcal{N}(P)\cap\mathcal{N}(Q)
TP
x
=
P
x
=
0
,
TQ
x
=
Q
x
=
0
⟹
N
(
P
+
Q
)
⊆
N
(
P
)
∩
N
(
Q
)
。显然
N
(
P
)
∩
N
(
Q
)
⊆
N
(
P
+
Q
)
\mathcal{N}(P)\cap\mathcal{N}(Q)\subseteq\mathcal{N}(P+Q)
N
(
P
)
∩
N
(
Q
)
⊆
N
(
P
+
Q
)
也成立,于是
N
(
P
+
Q
)
=
N
(
P
)
∩
N
(
Q
)
=
M
⊥
∩
N
⊥
.
\mathcal{N}(P+Q)=\mathcal{N}(P)\cap\mathcal{N}(Q)=\mathcal{M}^\perp \cap \mathcal{N}^\perp. \\
N
(
P
+
Q
)
=
N
(
P
)
∩
N
(
Q
)
=
M
⊥
∩
N
⊥
.
由De Morgan定律 (见练习3.3.8),
M
⊥
∩
N
⊥
=
(
M
+
N
)
⊥
\mathcal{M}^\perp \cap \mathcal{N}^\perp = (\mathcal{M}+\mathcal{N})^\perp
M
⊥
∩
N
⊥
=
(
M
+
N
)
⊥
,即
N
(
A
A
+
)
=
N
(
P
+
Q
)
=
M
⊥
∩
N
⊥
=
(
M
+
N
)
⊥
\mathcal{N}(AA^{+})=\mathcal{N}(P+Q)=\mathcal{M}^\perp \cap \mathcal{N}^\perp =(\mathcal{M}+\mathcal{N})^{\perp}
N
(
A
A
+
)
=
N
(
P
+
Q
)
=
M
⊥
∩
N
⊥
=
(
M
+
N
)
⊥
。由正交补的性质可得,
R
(
A
A
+
)
=
R
(
P
+
Q
)
=
M
+
N
.
\mathcal{R}(AA^{+})=\mathcal{R}(P+Q)=\mathcal{M}+\mathcal{N}.\\
R
(
A
A
+
)
=
R
(
P
+
Q
)
=
M
+
N
.
(
A
A
+
)
2
=
(
A
A
+
)
T
=
A
A
+
(AA^{+})^2=(AA^{+})^{\mathrm{T}}=AA^{+}
(
A
A
+
)
2
=
(
A
A
+
)
T
=
A
A
+
,所以
A
A
+
AA^{+}
A
A
+
是正交投影矩阵。
\\
2.解:
\\
设
P
+
Q
=
A
P+Q=A
P
+
Q
=
A
,由于
A
A
A
是对称矩阵,由广义逆性质和前面讨论可知
A
A
+
AA^{+}
A
A
+
和
A
+
A
A^{+}A
A
+
A
有相同的零空间和列空间。即
R
(
A
+
A
)
=
R
(
P
+
Q
)
=
M
+
N
,
N
(
A
A
+
)
=
N
(
P
+
Q
)
=
(
M
+
N
)
⊥
.
\mathcal{R}(A^{+}A)=\mathcal{R}(P+Q)=\mathcal{M}+\mathcal{N}, \mathcal{N}(AA^{+})=\mathcal{N}(P+Q)=(\mathcal{M}+\mathcal{N})^{\perp}. \\
R
(
A
+
A
)
=
R
(
P
+
Q
)
=
M
+
N
,
N
(
A
A
+
)
=
N
(
P
+
Q
)
=
(
M
+
N
)
⊥
.
(
A
+
A
)
2
=
(
A
+
A
)
T
=
A
+
A
(A^{+}A)^2=(A^{+}A)^{\mathrm{T}}=A^{+}A
(
A
+
A
)
2
=
(
A
+
A
)
T
=
A
+
A
,所以
A
A
+
AA^{+}
A
A
+
是正交投影矩阵,且
A
+
A
=
A
A
+
.
A^{+}A=AA^{+}.\\
A
+
A
=
A
A
+
.
3.证明:
\\
由前面讨论可知,
A
A
+
=
A
+
A
AA^{+}=A^{+}A
A
A
+
=
A
+
A
是
M
+
N
\mathcal{M}+\mathcal{N}
M
+
N
上的正交投影矩阵,而
R
(
Q
)
⊆
M
+
N
\mathcal{R}(Q)\subseteq\mathcal{M}+\mathcal{N}
R
(
Q
)
⊆
M
+
N
,所以
A
A
+
Q
=
Q
AA^{+}Q=Q
A
A
+
Q
=
Q
,等号两侧取转置立得
Q
A
+
A
=
Q
.
QA^{+}A=Q.\\
Q
A
+
A
=
Q
.
4.证明:
\\
由第(3)小问结论
Q
=
Q
(
P
+
Q
)
+
(
P
+
Q
)
=
Q
(
P
+
Q
)
+
P
+
Q
(
P
+
Q
)
+
Q
Q=Q(P+Q)^{+}(P+Q)=Q(P+Q)^{+}P+Q(P+Q)^{+}Q
Q
=
Q
(
P
+
Q
)
+
(
P
+
Q
)
=
Q
(
P
+
Q
)
+
P
+
Q
(
P
+
Q
)
+
Q
Q
=
(
P
+
Q
)
(
P
+
Q
)
+
Q
=
P
(
P
+
Q
)
+
Q
+
Q
(
P
+
Q
)
+
Q
Q=(P+Q)(P+Q)^{+}Q=P(P+Q)^{+}Q+Q(P+Q)^{+}Q
Q
=
(
P
+
Q
)
(
P
+
Q
)
+
Q
=
P
(
P
+
Q
)
+
Q
+
Q
(
P
+
Q
)
+
Q
于是
Q
(
P
+
Q
)
+
P
=
P
(
P
+
Q
)
+
Q
⟹
H
=
2
P
(
P
+
Q
)
+
Q
=
2
Q
(
P
+
Q
)
+
P
.
Q(P+Q)^{+}P=P(P+Q)^{+}Q\implies H=2P(P+Q)^{+}Q=2Q(P+Q)^{+}P.\\
Q
(
P
+
Q
)
+
P
=
P
(
P
+
Q
)
+
Q
⟹
H
=
2
P
(
P
+
Q
)
+
Q
=
2
Q
(
P
+
Q
)
+
P
.
5.证明:
\\
H
P
=
2
Q
(
P
+
Q
)
+
P
2
=
2
Q
(
P
+
Q
)
+
P
=
H
.
HP=2Q(P+Q)^{+}P^2=2Q(P+Q)^{+}P=H.
H
P
=
2
Q
(
P
+
Q
)
+
P
2
=
2
Q
(
P
+
Q
)
+
P
=
H
.
H
Q
=
2
P
(
P
+
Q
)
+
Q
2
=
2
P
(
P
+
Q
)
+
Q
=
H
.
HQ=2P(P+Q)^{+}Q^2=2P(P+Q)^{+}Q=H.
H
Q
=
2
P
(
P
+
Q
)
+
Q
2
=
2
P
(
P
+
Q
)
+
Q
=
H
.
由De Morgan定律 (见练习3.3.8),
(
M
∩
N
)
⊥
=
M
⊥
+
N
⊥
(\mathcal{M}\cap\mathcal{N})^{\perp}=\mathcal{M}^{\perp}+\mathcal{N}^{\perp}
(
M
∩
N
)
⊥
=
M
⊥
+
N
⊥
。设
P
(
M
∩
N
)
⊥
P_{(\mathcal{M}\cap\mathcal{N})^{\perp}}
P
(
M
∩
N
)
⊥
表示
(
M
∩
N
)
⊥
(\mathcal{M}\cap\mathcal{N})^\perp
(
M
∩
N
)
⊥
上的正交投影矩阵,
P
M
⊥
,
P
N
⊥
P_{\mathcal{M}^{\perp}},P_{\mathcal{N}^{\perp}}
P
M
⊥
,
P
N
⊥
分别表示
M
⊥
,
N
⊥
\mathcal{M}^{\perp},\mathcal{N}^{\perp}
M
⊥
,
N
⊥
上的正交投影矩阵,则
T
=
P
M
∩
N
=
I
−
P
(
M
∩
N
)
⊥
=
I
−
P
M
⊥
+
N
⊥
T=P_{\mathcal{M}\cap\mathcal{N}}=I-P_{(\mathcal{M}\cap\mathcal{N})^{\perp}}=I-P_{\mathcal{M}^\perp+\mathcal{N}^\perp}
T
=
P
M
∩
N
=
I
−
P
(
M
∩
N
)
⊥
=
I
−
P
M
⊥
+
N
⊥
设
B
=
P
M
⊥
+
P
N
⊥
=
I
−
P
+
I
−
Q
=
2
I
−
P
−
Q
B=P_{\mathcal{M}^{\perp}}+P_{\mathcal{N}^{\perp}}=I-P+I-Q=2I-P-Q
B
=
P
M
⊥
+
P
N
⊥
=
I
−
P
+
I
−
Q
=
2
I
−
P
−
Q
。由第(2)问结论可得
P
M
⊥
+
N
⊥
=
B
B
+
P_{\mathcal{M}^\perp+\mathcal{N}^\perp}=BB^{+}
P
M
⊥
+
N
⊥
=
B
B
+
,即
B
B
+
BB^{+}
B
B
+
是
R
(
B
)
=
M
⊥
+
N
⊥
\mathcal{R}(B)=\mathcal{M}^{\perp}+\mathcal{N}^{\perp}
R
(
B
)
=
M
⊥
+
N
⊥
上的正交投影矩阵。因此
H
T
=
H
(
I
−
P
M
⊥
+
N
⊥
)
=
H
(
I
−
B
B
+
)
=
H
−
H
B
B
+
=
H
−
H
(
2
I
−
P
−
Q
)
B
+
=
H
−
(
2
H
−
H
−
H
)
B
+
=
H
\begin{align*} HT&=H(I-P_{\mathcal{M}^\perp+\mathcal{N}^\perp})\\ &=H(I-BB^{+})\\ &=H-HBB^{+}\\ &=H-H(2I-P-Q)B^{+}\\ &=H-(2H-H-H)B^{+}\\ &=H \end{align*}
H
T
=
H
(
I
−
P
M
⊥
+
N
⊥
)
=
H
(
I
−
B
B
+
)
=
H
−
H
B
B
+
=
H
−
H
(
2
I
−
P
−
Q
)
B
+
=
H
−
(
2
H
−
H
−
H
)
B
+
=
H
6.证明:
\\
考虑矩阵
H
T
−
T
HT-T
H
T
−
T
的零空间。设
x
∈
R
n
\bm{x}\in\mathbb{R}^n
x
∈
R
n
,分两种情况讨论:
\\
(a)若
x
∈
(
M
∩
N
)
⊥
\bm{x}\in (\mathcal{M}\cap\mathcal{N})^{\perp}
x
∈
(
M
∩
N
)
⊥
,
\\
(
H
T
−
T
)
x
=
H
T
x
−
T
x
=
0
−
0
=
0
(HT-T)\bm{x}=HT\bm{x}-T\bm{x}=\bm{0}-\bm{0}=\bm{0}
(
H
T
−
T
)
x
=
H
T
x
−
T
x
=
0
−
0
=
0
。所以
(
M
∩
N
)
⊥
⊆
N
(
H
T
−
T
)
(\mathcal{M}\cap\mathcal{N})^{\perp} \subseteq \mathcal{N}(HT-T)
(
M
∩
N
)
⊥
⊆
N
(
H
T
−
T
)
。
\\
(b)若
x
∈
(
M
∩
N
)
\bm{x}\in(\mathcal{M}\cap\mathcal{N})
x
∈
(
M
∩
N
)
,
\\
则
(
H
T
−
T
)
x
=
H
x
−
x
=
P
(
P
+
Q
)
+
Q
x
+
Q
(
P
+
Q
)
+
P
x
−
x
=
P
(
P
+
Q
)
+
x
+
Q
(
P
+
Q
)
+
x
−
x
=
(
P
+
Q
)
(
P
+
Q
)
+
x
−
x
=
A
A
+
x
−
x
\begin{align*} (HT-T)\bm{x}&=H\bm{x}-\bm{x}\\ &=P\left(P+Q\right)^{+}Q\bm{x}+Q\left(P+Q\right)^{+}P\bm{x}-\bm{x}\\ &=P\left(P+Q\right)^{+}\bm{x}+Q\left(P+Q\right)^{+}\bm{x}-\bm{x}\\ &=(P+Q)\left(P+Q\right)^{+}\bm{x}-\bm{x}\\ &=AA^{+}\bm{x}-\bm{x} \end{align*}
(
H
T
−
T
)
x
=
H
x
−
x
=
P
(
P
+
Q
)
+
Q
x
+
Q
(
P
+
Q
)
+
P
x
−
x
=
P
(
P
+
Q
)
+
x
+
Q
(
P
+
Q
)
+
x
−
x
=
(
P
+
Q
)
(
P
+
Q
)
+
x
−
x
=
A
A
+
x
−
x
注意到
A
A
+
AA^{+}
A
A
+
是
M
+
N
\mathcal{M}+\mathcal{N}
M
+
N
上的正交投影矩阵,而
x
∈
(
M
∩
N
)
⊆
(
M
+
N
)
\bm{x}\in(\mathcal{M}\cap\mathcal{N})\subseteq (\mathcal{M}+\mathcal{N})
x
∈
(
M
∩
N
)
⊆
(
M
+
N
)
,于是
A
A
+
x
−
x
=
x
−
x
=
0
AA^{+}\bm{x}-\bm{x}=\bm{x}-\bm{x}=\bm{0}
A
A
+
x
−
x
=
x
−
x
=
0
。即
(
M
∩
N
)
⊆
N
(
H
T
−
T
)
(\mathcal{M}\cap\mathcal{N}) \subseteq \mathcal{N}(HT-T)
(
M
∩
N
)
⊆
N
(
H
T
−
T
)
。
\\
这表明
N
(
H
T
−
T
)
=
R
n
⟺
H
T
−
T
=
O
⟺
H
T
=
T
.
\mathcal{N}(HT-T)=\mathbb{R}^n \iff HT-T=O \iff HT=T.
N
(
H
T
−
T
)
=
R
n
⟺
H
T
−
T
=
O
⟺
H
T
=
T
.